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METHODS OFDIAGNOSISOFBKEASTCANCER, COMPOSITIONS 

ANTO^VBETHODS OF SCREENING FOR MODULATORS OF BREAST 

CANCER 



CROSS-REFERENCES TO RELATED APPLICATIONS 
Ms application claims priori* to USSN 60/263,965, fifcd 

60/294,443, filed May 29, 2001, all of which are incorporated herem hy 



entirety. 



FIELD OF THE INVENTION 
Theinvention^^ 

toufying and using agents and/or targets .ha, inlubhbreas, carreer. 

BACKGROUND OF THE INVENTION 
Bceas.^cecison.ofU.amos.ft^ueo.lydiagnoscdcanc^andmeaecoad 

, ,• ^offemalccanccrdeathinNorthArncricaandnorth.n.E.ropc.w.thlung 
leading cause of female cane Jkease in the United States is one- 

k !„,, th. leadinit cause. Lifetime incidence of the disease in me u 
cancer being the leaning can detection of breast 

• h, „i,h a l.m-291ifetimeriskofdyingfrombrea S lcancer.Earlydetect 1 ono 

weight, wtha 291 ^ „ ^ ^ examinalion, haa 

cancer, nsing mammography, ctacal br ^ ^ ^ 

^manually improved the — of the Treatment of breast 

as mammographic sensitivity has been esnmated at only 60/C-90/.. 

^^Urgely.fanrgicnln^eo.omy.rn^^y.ramafionmempy.ama. 
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hormone tlierapy.and/orchemotherapy.Althoughmany breast cancer patients are effect.vely 
treated, the current therapies can all induce serious side effects which diminish quality of hfe. 
Deciding on a particular course of treatment is typically based on a variety of prognose 
parameters and markers (Fitzgibbons et al., 2000, Arch. Pathol. Lab. Med. 124:966-978; 
Hamilton and Piccart, 2000, Ann. Oncol. 11:647-663), including genetic predisposUon 
markers BRCA-1 and BRCA-2 (Robson, 2000, J. Clin. Oncol. 18:1 13su P -l 1 8su P ). 

Imaging of breast cancer for diagnosis has been problematic and limited. In 
addition, dissemination of tumor cells (metastases) to locoregional lymph nodes is an 
important prognostic factor; five year survival rates drop from 80 percent in patients with no 
lymph node metastases to 45 to 50 percent in those patients who do have lymph node 
metastases. A recent report showed that micrometastases can be detected from lymph nodes 
using reverse transcriptase-PCR methods based on the presence of mRNA for 
careinoembryonic antigen, which has previously been shown to be present in the vast 
majority of breast cancers but not in normal tissues. Liefers et al., New England J. of Med. 

339(4):223 (1998). 

The identification of novel therapeutic targets and diagnostic markers is 
essential for improving the current treatment of breast cancer patients. Recent advances in 
molecular medicine have increased the interest in tumor-specific cell surface antigens that 
could serve as targets for various immunotherapeutic or small molecule strategies. Antigens 
suitable for immunotherapeutic strategies should be highly expressed in cancer tissues and 
ideally not expressed in normal adult tissues. Expression in tissues that are dispensable for 
life, however, may be tolerated. Examples of such antigens include Her2/neu and the B-cell 
antigen CD20. Humanized monclonal antibodies directed to Her2/neu 
(Herce P tin®/trastu 2 umab) are currently in use for the treatment of metastatic breast cancer 
(Ross arid Fletcher, 1998, Stem Cells 16:413-428). Similarly, anti-CD20 monoclonal 
antibodies (Rituxin®/rituximab) are used to effectively treat non-Hodgekin's lymphoma 
(Maloney et al., 1997, Blood 90:2188-2195; Leget and Czuczman, 1998, Curr. Opin. Oncol. 
10:548-551). 

Other potential immunotherapeutic targets have been identified for breast 
cancer. One such target is polymorphic epithelial mucin (MUC1). MUC1 is a transmembrane 
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^genicnUy distinc, — and is in «fr chnica. trials as «— « ^ 
d 2000 Clin Canc e rRe,6:1693-1701 ; SohoUe,al.,2000,I.Innnun»ther. 23.570 580). 

ZlJcA .5-3 (Bon e, ,, .997, CU, Chero. 43:585-593, *^££T 

^ «o be overexpressed in breas, cancer, sneK as EOFR (Samsbury « aM WUnce, 
U L47,:1398.1402,,e-e r bB3( N *e.a 1 ,.9 8 8 > Br.,C^78: 1 3854390)FG F R2 

(P^U-Uoeea e, a,., .99., bat 3. Career 61 :.70-176,, « « 

brc ast eanee, associated 6 cnc . (Kurt et al, 2000, Breas, Cancer Res. M 59:41-48). 
AKbough monoclonal antibodies .o the protein products of sotne of these overexpressed 
, g enesba,obeon I eported(fo,review,seoOroe„e,a 1 .,2000,CancerT re atRev.26.269.286), 

nonearocurrentlyapprovedforbreastcancortherapyinthcUS. 

Disclosures of certain genes and ESTs described as being expressed m breast 

97/02280 and WO-00/55173, WO-98/45328 and WO-00/22130. Similarly, genes and KTs 
„ d e OT ^eda S bo m goxp re ss^mb re astoano=rarodisolosedinUSPa K n,Nos.5,759,776and 

5,ti93,522. The utility of such genes is described in each of these poblications, and then 
disclosures are incorporated herein in their entirety. 

Whilo industry and acadomia have identified novel sequences, there has not 
been an equal effort exerted to identify the function of those novel sequences^ 
,5 elucidation of a role for novel protein, and compounds in disease states for tdenhflcahon of 
' orerapoudc targets and diagnostic markers is essentia, for improving the current treatment of 
bre , st cancer patient, According.,, provided herein are molecular targets for therapeutic 
intervention in breas, and other cancer, Additionally, provided herein are methods mat can 
be us* in diagnosis and prognosis of broas, cancer. Further provided are methods mat can 
30 be used to screen candidate bioactive agents for the ability to modulate breast cancer. 
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SUMMARY OF THE INVENTION 
The present invention therefore provides nucleotide sequences of genes that 
areup-anddown-regulatedinbreastcancercell, Such genes are useful for diagnosuc 

5 ptoses, and also as t^^ , 
cancer, such as hormones or antibod.es. Other aspects of the invention will become apparent 
to the skilled artisan by the following description of the invention. 

In one aspect, the present invention provides a method of detecting a breast 
cancer-associated transcript in a cell from a patient, the method comprising contacting a 
10 biological sample from the patient with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-25. 

In one embodiment, the present invention provides a method of determining 
the level of abreast cancer associated transcript in a cell from a patient. 

In one embodiment, the present invention provides a method of detecting a 
15 breast cancer-associated transcript in a cell from a patient, the method comprising contacting 
a biological sample from the patient with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-25. 

In one embodiment, the polynucleotide selectively hybridizes to a sequence at 

least 95% identical to a sequence as shown in Tables 1-25. 
20 m one embodiment, the biological sample is a tissue sample. In another 

embodiment, the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent 

label. 

In one embodiment, the polynucleotide is immobilized on a solid surface. 
25 i n one embodiment, the patient is undergoing a therapeutic regimen to treat 

breast cancer. In another embodiment, the patient is suspected of having metastatic breast 
cancer. 

In one embodiment, the patient is a human. 

In one embodiment, the breast cancer associated transcript is mRNA. 
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10 I na f uA eI embod i m e n^epa tt entha S .d rag re sisM fonnofbreastoan«n 

to o™ embodiment, the method farther comprises the step of: (ft) companng 
to .eve. of Ore breast eancer-associattd transcript to a level of*, breast cancer-assoc.ated 
transcript in a biological sample from the patient prior to, or earlier in, the therapeuttc 

treatment. «..«■„ 
15 Additionally, provided herein is a method of evaluating the effect of a 

candidate breast cancer drug comprising administering the drug to a patient and removing a 

cell sample from the patient. The expression profile of the cell is then determined. Tins 

m ethod may further comprise comparing the expression profile to an expression profile of a 

healthy individual. In a preferred embodiment, said expression profile includes a gene of 

20 Tables 1-25. - 
In one aspect, the present invention provides an isolated nucleic acid molecule 

consisting of a polynucleotide sequence as shown in Tables 1-25. 

In one embodiment, an expression vector or cell comprises the isolated nucleic 

acid • 
25 ' m one aspect, the present invention provides an isolated polypeptide which is 

encoded by a nucleic acid molecule having polynucleotide sequence as shown in Tables 1-25. 

In another aspect, the present invention provides an antibody that specifically 
binds to an isolated polypeptide which is encoded by a nucleic acid molecule havmg 
polynucleotide sequence as shown in Tables 1-25. 



5 




WO 02/059377 PCT/US02/02242 



In one embodiment, the antibody is conjugated to an effector component, e.g., 
a fluorescent label, a radioisotope or a cytotoxic chemical 

In one embodiment, the antibody is an antibody fragment. In another 
embodiment, the antibody is humanized. 
5 In one aspect, the present invention provides a method of detecting a breast 

cancer cell in a biological sample from a patient, the method comprising contacting the 
biological sample with an antibody as described herein. 

In another aspect, the present invention provides a method of detecting 
antibodies specific to breast cancer in a patient, the method comprising contacting a 
1 0 biological sample from the patient with a polypeptide encoded by a nucleic acid comprising a 
sequence from Tables 1-25. 

In another aspect, the present invention provides a method for identifying a 
compound that modulates a breast cancer-associated polypeptide, the method comprising the 
steps of: (i) contacting the compound with a breast cancer-associated polypeptide, the 
1 5 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 
80% identical to a sequence as shown in Tables 1-25; and (ii) determining the functional 
effect of the compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic 
effect, or a chemical effect. 
20 In one embodiment, the polypeptide is expressed in a eukaryotic host cell or 

cell membrane. In another embodiment, the polypeptide is recombinant. 

In one embodiment, the functional effect is determined by measuring ligand 
binding to the polypeptide. 

In another aspect, the present invention provides a method of inhibiting 
25 proliferation of a breast cancer-associated cell to treat breast cancer in a patient, the method 
comprising the step of administering to the subject a therapeutically effective amount of a 
compound identified as described herein. 

In one embodiment, the compound is an antibody. 
In another aspect, the present invention provides a drug screening assay 
30 comprising the steps ofr(i) administering a test compound to a mammal having breast cancer 
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15 



the level of expression of toe polynuclaoud. 18 a candtdat. for tn.tr 

toooeembodiroeo.^econ.rolisamasrrrna.wi.hbreas.oanoeroracn 

the control is a normal cell or mammal. 

■nonceM^entto^^ooMis^^inva^araountsor 

conceal. In another embodiment the - compound is administered for varying 

hybridize .0 a sequence at leas. 80% identical to a sequence as shown in Tablea 1-25 are 
individnaUy compared to their respective levels in a control cel. sample or mamma,. In a 
preferred embodiment the plurality of polynucleotides is from three to ton. 

to another aspect, the present invention providca a method for treating a 
m anrn,a,,.vm g b reM .o m oerc.mprisi„ g adminis.e ri n g acomp.andidentifredhymoas S ay 

described herein. 

In another aspect, the present invention provides a pharmaceutical 
composition for treating a mammal having breast cancer, the composition compnsmg a 
compound identified by the assay described herein and a physiologically acceptable 
excipient. 

to one aspect, the present invention provides a method of soreemng drug 
, candidates by providing a eel, expressing a gene that is up- and down-regulated as in a breast 
oancer. to one embodiment, a gen. is selected from Tables 1-25. Th. method further 

on the expression of the expression profile gene. 

In one embodiment, the method of screening drug candidates includes 
comparing the level of expression in the absence of the drug candidate to the level of 



30 
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a\a*» wherein the concentration of the drug 

~ "* ^::T'Z plml embodiment, the ceO expresses - « - 
removal of the drug candidate. In a prelerrea em 

, — =:=rr,=.=.— 

dregoomprismgadlste ringthedregtoabansgenicanima! expressmgorover-expressmg 

protein, for example as a result of a gene knockout. „ m0 ,eic acid 

Moreover, provided herein is a biochip comprismg one or more nucletc aotd 
, of Tables 1-25 wherein the bioohip comprises fewer than 1000 nuoleic aotd probes. 
'° rS- — — Morepreferably.atleastmaee 

is provided. The method comprises determining .he expression of a gen. of Tables 1-25, 
, 5 preferably a gene of Tab.e 25, in a tat tissue We of a ft* individual, and company the 
distribution to the expression of the gene from a second notmal tissue type from me firs, 
mdividua! or a second unaffected individual. A difference in mo expression mdtca.es that the 
fimtindividualhasadisorderassooialedwitobreaslcanoer. 

to a further embodiment, the biochip also includes a polynucleoude sequence 
20 of a gene that is not up- and down-regulated in breast cancer. 

to one embodiment a method for screening for a bioactiv. agent capable of 

pr o,ein)oranagmentmeroofandana„,ibodywhiohbmds.o S aidb,ea S «canoermoul.tory 
leinor fragment thereof, to a preferred embodiment the method compnses combtnmg a 

25 L.«-~^-~^ , ^^r , rL" 

mti „ody which binds to said breast cancer modulatory protein or fragment thereof The 

fragment thereof and said antibody. Wherein mere is a change in binding, an agent ts 
id.nL aa an interfering agent. Th. interfering agent an b. an agonist or an antagotust. 
30 Preferably, the ag«nt inhibits breast cancer. 
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Mso provided hereto are methods of ottoittog an immune response to an 
W ividuaL to one emhodimen, a method provided herein eomprisea 
: ividna, a edition eomprisins a toe*, oancer m^Uns peotenw a hapnen, 
tto eof. to another embodiment, toe pronto is enecded by a nn ol e,c ae,d aeleeted fi om thoae 

5 of Tables 1-25. ■ ^..^ 

Further provided herein are compositions capable of ehcrung an nnmune 

responsemanmdividual^ 

Jastcancermodulatingprotein, preferably encodedbyanucleicacidofTab es l-25,mo, 
preferably ofTab^S.orafragment thereof, and a pharmaceutically acceptable earner. In 

10 anotherembodiment^^^ 

encodingabreast cancer modulating protein, preferably selected from the nucle, acxds of 

Tables 1-25, and a pharmaceutically acceptable carrier. 

Also provided are methods of neutralizing the effect of a breast cancer protem, 

or a fragment thereof,^ . 
15 protein in an amount sufficient to effect neutralization. In another embodiment, the protem ts 

encoded by a nucleic acid selected from those- of Tables 1-25. 

In another aspect of the invention, a method of treating an individual for breast 

cancer is provided. In one embodiment, the method comprises administering to said 

individual an inhibitor of a breast cancer modulating protein. In another embodunent, the 
20 method comprises administering to apatient having breast cancer an antibody to abreast 

c'ancermodulatingprotein conjugated to a therapeutic moiety. Such a therapeutic moiety can 

be a cytotoxic agent or a radioisotope. 

DETAILED DESCRIPTION OF THE INVENTION 
25 In accordance with the objects outlined above, the present invention provides 

novel methods for diagnosis and prognosis evaluation for breast cancer (PC), including 
met astatic breast cancer, as well as methods for screening for compositions which modulate 
breast cancer. Also provided are methods for treating breast cancer. 



30 



Tables 1-24B provide unigene cluster identification numbers for the 
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. ,n 11 10 n 18 19 20 21, and 22 list those genes that are 
that is part of the unigene cluster. 

Defl " i "° nS Tta «^^^-^^'*- ri - ,-fcr -' w 

,„ transcrip." refers « nnOeio - and po.ypep«de polymorphic *~ 

.ban about 60% nudeotide sconce IM 65%, 70%, 75%, 80%, 85%, 90/, 

preferably 91%, 92%, 93%, 94%, 95%, 96%. 97%, 98% or 99% 0, greater nucfcoUd, 
L qu a OMitoat ,,pr.f«-yo«.^nofovexa I eg i on.fa tte .Aou,25, 5 0. 1 00 200, 

,5 500 W.ormorenucleoUdea.toanuctotidasaquenceofor.ssociatedwthageneof 

Tables 1-25; (2) bind to antibodies, «*, polyolonal andbod.es, raised again* an immunogen 
con^sing^^-dao^enooen^-byannn^udes^ofor^ed^a 

ofTables 1-25, and conservafive.y modified variants thereof; (3) apeoifically hybndrze 
under sbingen. hybridization conditions to a nuoleio aoid sequence, or the complement 

sequence that has greater than about 60% amino aoid sequence identity, 65 A 70 A, 75 /, 
80%, 85%, 90%, preferabiy 9.X 92%, 93%, 94V.. 95%, 96%, 97%, 98% or 99% or greater 
amine sequenoe identity, preferably over a region of over a region of a. leas, abou, 25, 50 
100 200, 500, 1000, or more amino aoid, to an amino aoid sequence encoded by a nucleobde 
25 aequenceoforasaooialedwilhageneofTablesl-25. A polynucleotide or polypephde 
sequence is typrcall, ftom a mammal including, bn, not limited to, pnmato, o.g, human; 
rodent, e.g., ra,. mouse, hamster, cow, pig, horse, sheep, or other mammal. A "breast cancer 
po.ypepude" and a "breast cancer po.ynueleo.ide," include both nan.ra.ly ooeumng or 
recombinant forms. 
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,^ nn t thereof that contains all of the elements 
oolvpeptide or polynucleotide sequence, or a variant thereot, 

polynucleotide or polypeptide sequences. The full length y 

LTs of post-translation processing or splicing, including alternative splicing. 

g ^ogicalsan^ 
contains nucleic acids or polypeptides, e.g., of a breast cancer « 
transcript Such samples include, but are not limited to, tissue elated from pnmates,, ., 

bl ood,plasma,serum,s P utum,stooUears,mucus,hair, sk in,etc. Biological samples also 
include explantsandprimary and/or ^fonnedceUcultures derived from pahent tissue, A 
biological sample is typically obtained from a eukaryotic organism, most preferably a 
mammal such as a primate e.g., chimpanzee or human; cow; dog; cat; a rodent, e.g., gumea 
pig rat, mouse; rabbit; or a bird; reptile; or fish. 

"Providing a biological sample" means to obtain a biological sample for use in 
methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 
isolated by another person, at another time, and/or for another purpose), or by perfonmng the 
methods of the invention in vivo. Archival tissues, having treatment or outcome history, will 

be particularly useful. 

The terms "identical" or percent "identity," in the context of two or more 
nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that 

95 o/ 0 96 o /o 97%, 98%, 99%, or higher identity over a specified region, when compared and 
aligned for maximum correspondence over a comparison window or designated region) as 
measured using a BLAST or BLAST 2.0 sequence comparison algorithms with default 
parameters described below, or by manual alignment and visual inspection ,g., NCBI 
web site http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such sequences are then said to 

11 
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be ly idenUoar This definition *> ^ * or may be «^ 

aigolms oao aocoon. for gaps and to fit. Preferably, idennfy ex,a.s over a regron <ha. . 

anfino acids or nuoieondes in lengfir, ornrorepreferafiiy over aregroo firaUs 

50-100 amino acids or nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference sequence, 
towhichtestsequencesarecompared. When using a sequence comparison algonthm, .test 

if necessary, and sequence algorithm program parameters are designated. Preferably, default 
program parameters can be used, or alternative parameters can be designated. The sequence 
comparison algonthm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 
5 A "comparison window", as used herein, includes reference to a segment of 

one of the number of contiguous positions selected from the group consisting typically of 
from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which 
a sequence may be compared to a reference sequence of the same number of contiguous 
positions after the two sequences are optimally aligned. Methods of alignment of sequences 
20 forcomparisonarewell-knownintheart Optimal alignment of sequences for companson 

canbe conducts e.g.^ , 
Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J. Mol. 
Biol. 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Nat I 
AcadSci. USA 85:2444 (1988), by computerized implementations of these algorithms 
25 (GAP BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and 
visual inspection {see, e.g.. Current Protocols in Molecular Biology (Ausubel et al, eds. 

1995 supplement)). 

Preferred examples of algorithms that are suitable for deterrnining percent 
30 sequence identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, 

12 
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20 



a,, J. MM. Bio,. 215:^0 (1990). BLAST and BLAST M »«* 
the invention. Software for performing BLAST analyses 

ateorithm involves firs, identifying high scoring sequence pars (HSPs) by .denting 

referred to as the neighborhood word snore rhreshold (Altschtd m aL, supra). Thes urrha 
.hlThewordhi.sareox.endedinhotird^tionsa.ongeaehseanenoofora.faraa^o 
cumulauveahgnmen, aeoreoanhe inched. thrive ™c—usmg.e.g, 

hits in each drreotion are halted when, the eomulative alignment score falls off by tire 
qua mrtyXmam its maximum achieved value; tire eumurative score goes to zero or below 

either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
sensitivity »dspeedof m eatignment.TheBLASTNpm 6 ram(forn«c 1 eotide S eonence S ) 
oses as defaults a wordlength (W) of 1 1 , an expectation (E) of 10, M-5, N=-4 and a 
eomparisonofbothstratds. For amino acid sequences, the BLASTP program uses aa 

50 expeetation(E)oflO,M-5,N-4,anda«.mpari S onofbomstrand S . 

The BLAST algorithm also p«forms a statistical analysis of to stmrlartty 
fcetween two sequences (see, , g .. KarUn * A»tt »~ ** « ™ 90 ^ 
57 87 (1993)) One measure of similarity provided by the BLAST algonthm rs me amtdles. 
smP tob«(T OT ),wmehpmvtdesanmdieationo,meprobabUi W bywmebam»«cb 

30 ^eenwonnelentideortmtmoaotdsequenceswomdocenrbyehattee. Forexamp.e,a 
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r „ cAniipnce if the smallest sum probability in a 

el arg el g ativ e nu mb e r s ) e. g . > 5,10,20 ) 30,40,40 > 70 )9 0 j n0,150,17,etc. 

Aaindicationthattwonucleicacidsequeacesorpolypept^ 
" identical is that the polypeptide encoded by the first nucleic acid is —logically cross 
active withmeantibodiesraised against the polypeptide encoded by the second nucle, 

^sdescribedbelow. fc .p««-— 

polypeptide, e.g., where the two peptides differ only by conservative subsUtuhons. An the 

ortheir consents hybridize to each other under strh^gent condiUons, as descnbed be w^ 

same primers can be used to amplify the sequences. 

A "host cell" is a naturally occurring cell or a transformed cell that contains an 
15 expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphxbran, or 
nrammalian cells such as CHO, HeLa, and the like (see, e.g.. the American Type Culture 
Collection catalog or web site, www.atcc.org). 
20 The terms "isolated," "purified;" or "biologically pure" refer to matenal that rs 

substantially or essentially free from components that normally accompany it as found m Us 
native state. Purity and homogeneity are typically determined using analytical chenustry 
techniques such as polyacrylamide gel electrophoresis or high performance hqu,d 
chromatography. A protein or nucleic acid that is the predominant species present m a 
25 preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
some open reading frames that naturally flank the gene and encode proteins other than protem 
encoded by the gene. The term "purified" in some embodiments denotes that a nucleic acrd 
or protein gives rise to essentially one band in an electrophoretic gel. Preferably, it means 
that the nucleic acid or protein is at least 85% pure, more preferably at least 95% pure, and 
30 most preferably at least 99% pure. "Purify" or "purification" in other embodiments means 

14 
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The terms "polypeptide, peptide ana P r ;H ™ivmers 
•j ^,.»o Th^ terms aoolY to amino acid polymers 

c„„ la ™8modifl^^» M . m dno».>^yocc«mng m «dpol!mi a . 

wdl as amino acid -ogs and anino acid Ratios tha. taction W <« *e na.ma.ly 

a5mell asto S aandnoacids.ha. a rela, e rmodaed, e .g,h,drOKyprohn e , 1 ,- 
^gmtamate, and O-phosphoserine. Amino acid analogs refers te abounds *., ha « 

to »d,oahy*og.n,acarboxy,group, M amino group, andanRgroup, e. 8 ,homos™e, 
norieucine, melhionioe sulfoxide, methionine methyl sulfonium. Such analogs may have 

Cemical structure as a naturally occunang amino acid. Amino acid mhnetics refers to 
chemical compounds .ha. have a struchue ma. is differen. from me general chemaca. 
^mre of an amino acid, bu, ft* functions similarly » a naturally ocouning am.no *± 
Amino acida may be referred to herein by either their commonly known three 

Nomenc.ah.re Commission. Nucleotides, tikewise, may bo referred to by men commonly 

accepted single-letter codes. 

"Conservatively modined variants" applies to bom amino ao.d and nuctoc 
; ^sequences. Wim respect to particula I nnclrioaoidse,uen« S ,oonsen.arive.ymodrSe4 
„W refers to .hose nucleic acids which encod* identical or esse.Ua.ly identical ammo 
acid sequences, or where me nuoleic acid does not encode an amino acid sequence, to 
e^nhally idenUcal or associated, e.g., naturally contiguous, sequences. Becanso o the 
aaaeneraoy of me genetic code, a large number of functionally identical nuole.c acds encode 
0 mos.pro.eins. For instance, the codons GCA,GCC,GCGand GCU all encode the ammo 

1<; 
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be M .o another of the cording codons described «** *"»«» "» ** 

be Idtfied .0 yield a funcfionaUy idenuca! Accordingly, often 

I nLc - Ik encodes a polypeptide is implicit in a described sequence w,ft, -pee, ,o 

.beoxprossionpiodoot.bo.notwithrespeo.toac.ualpiobesequences. 

As to amino acid sequences, on. of skill will recognize that rndmdual 
substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or proton 
sequence which alters, adds or deletes a sing.e amino acid or a small percentage of ammo 
acids in the encoded sequence is a "conservativCy modified variant" where the al.erat.on 
results in the substitution of an amino acid with a chemically similar ammo mi. 
Conservative substitution tables providing functionally similar amino acids are well known m 
,he art Such conservatively modified variants are in addition to and do not exclude 
polymorphic variants, interspecies bomologs, and alleles of the inve„.ion. W .cally 
conservative substitutions for one another: 1) Alanine (A). Glycine «* 2) AsparUc acrd (D), 
Glutamic acid (E); 3) Aspamgiue (N), Glutamine (Q); 4) Argmine (R), Lysme (K); 5) 
leucine (I), Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosme (Y), 
Tryptophan (W); 7) Serine (S), Threonine CD; and 8, Cysteine (C), Methienme (M) (see. 

eg.. Creighton, Proteins (1984)). 

Macromolecular structures such as polypeptide structures can be desenbed in 
term s of various levels of organization. For a general discussion of this organization, 
eg . A^risetal.,MolecularBiologyoftheCell(3^, 1994) and Cantor & Solummel, 
Biophysical Chemistry Part I: The Conformation of Biological Macromolecules (1980). 
.Trimary structure" refers to the amino acid sequence of a particular peptide. "Secondary 
structure" refers to locally ordered, thre* dimensional structures within a polypeptide. These 
structures are commonly known as domains. Domains are portions of a polypeptide that 
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often fern, a compact - of ft. polypeptide -d « typicaUy 25 to approximate ly 500 
al acids .ong Typica. domains a, -e „p of socrions o f .esscr orga— sue* as 
sketches of p-shee. and a-hehces. ternary sbucfure" ^- * * -pfefe mrea 
dimensions, sbuctore of a pofypepUde monomer. «Quatero*y structure 

units Anisotropic terms are also known as energy terms. 

-Nucleic acid" or "oligonucleotide" or "polynucleotide" or grammatrcal 
used herein means * lens, two nucieorides covaienfly .inked togeUrer. 
Oiigon—are fypioaUy *om soon, 5, 6, 7, 8, 9, f0, .2, .5, 25, 30, 40, Oormore 
nnCeo.idesrn.engOr.np.oaloonf.OOnnCeo.ide.inlengn,. N„c,eic acids and 

"000,2000, 3000, 5000, 7000, 10,000. etc. Anncleie acid offhe present invemronwd. 
generrd.y contain phosphodiester bonds, anhough in some oases, nnoleio add analogs are 
included durt may have alternate backbones, comprising, e.g., phosphoramrdate, 
phosphorothioate, phoaphorodimroafo, or 0-memy.phophoroamidi.e Hnkages (sea Extern, 
Ongonuc.eotides rmd Antdoguas: A Practica. Approach, Oxford University Prcss); and 
peptide nucleic acid backbones and .inxages. Other analog nucleic acids inofudc those wrm 
positivebackbones; non-ionio backbones, andnon-ribose backbones, inoludingfhos. 
described m U.S. Pafen. Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC 
Symposium Series 580, Sm f "* 
Cook ed,. s^.*.-^-"™-^^"*^^ 

fcr avarietyofreasons, e.g. .0 increase the-ilrty andhaff-life of auoh mofecuiea m 
physiofogioa. environment, or aa probes on a biochip. Mixtures of nafumuy occurnng 
nuefcic acids and analogs can be made; alternatively, mixtures of different nucletc actd 
analogs and mixtures of naturally occurring nucleic acids and analogs may be made. 

A variety of references disclose such nucleic acid analogs, including, for 

therein; Leraingcr, J. Org. Cham. 35:3800 (1970); Sprinzl e, ah, Eur. I. Brochem. 81:579 
30 (.977); linger e, al., Nucl. Acids Rea. .4:3487 (.986); Sawai a, a!, Chem. Led. 805 
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Ju.S. Paten.No. 5.644,04*), phosphoric* (Briu « a.., ,. Am. Chen, ^ 

31 1008 (1992); Nieboo, Nature, 365:566 (1993), Carlsson 01 al,»a 
otwhi ch«inco^byref^).O te ^gn-.oioaoid S uKWoftoso» ; « l 

backbones CM. Pa«eo.Nos. 5,386,023, 5,637.684, 5,602,240, 5,2.6,14, and ,69,86, 
Kiedr.ws.ri el aL. Angew. Chen, M. Ed. English 30:423 (.991)1 ^.ngo, o, M L Are. 
Ch.reSoo.llO:4470(.988)',Letsing.r.lal.,Naoleoside&Nuolootido 13:1597(1994); 
Chapters 2 and 3, ASC Syreposinre Series 580, Xa*ohydra,e Modifications in Antrsense 
Rcsearoh", Ed. Y.S. Sanghni and P. Dan Cook; Mesnraoker e. al., Bioorganio * Md-I 
Cbem Let,. 4:395 (1994); Jeffs e, aL, J. Biomoleeular NMR 34:17 (1994); Tetrahedron Lett 
37-743 ( 1996)) and non-ribose backbones, inoloding those described in LIS. Patent Nos. 
5 235 033 and 5,034,506, and Chapters 6 and 7, ASC Symposium Series 580. "Carbohydrate 
Mediations in Anriscuse Research-, Ed. Y.S. Sanghni and P. Dan Cook. Nucleic ac* 
containing one or more careocyclic sugars are also included within one definition of una ere 
acidstsee Jenkins eta... Chen, Son Rov. (.995) op 169-176). Several nndeic aord ana.ogs 
are described in Raw.s, C & B News June 2, .997 page35. All of these references are horehy 

expressly incorporated by reference. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide 
nucleic acid analogs. TT.ese backbones are substantially non-ionic under neutral coupons, m 
contrast to the highly charged phosphodiester backbone of naturally occurring nucle.cac.d, 
This results in two advantages. First, the PNA backbone exhibits improved hybridan 
kinetics. PNAs have larger changes in the melting temperature (TJ for mismatched versus 
perfectlymatchedbasepairs. DNA and PNA typically exhibit a 2-4°C drop in T m for an 
Lemalmismatch. With the non-ionic PNA backbone, the drop is closer to 7-9°C. Smularly, 
30 due to their non-ionic nature, hybridization of the bases attached to these backbones is 
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complementary strand; thus the sequence 

tlB sequence. The - may be DNA, bo* - 

wheretoenucleieaeidmaycontamoombinat.onsofdeoxynbo-^ntoa 

^nebypoxantbin 

WeoLe" indudes nucleotides and nudeoside and nucleotide analoga, and nmduied 
ZZZ - ea amino modifted nuclides. In addition, -de" tne,udes nor, 

aoid.e.ehcon^iningatee.areref.Ted.ohe.inaaanudeos* 

A "Ubel" or a "detectable moiety ia a composition detectable by 

« a g., as commonly used in an BUS A), biotin, digoxigenin, or bapten, and pro ems 

^Igtitosemennodadesoribedby Hunter e.a.^lM^ (.96 l Dav.de, aX 
, ^0.4 C>,74>; Pain e, at, «*» < 198 » ; 

An "effector" or "effector moiety or ettector comy^ 

30 t -rtJr oan be a varies of mo.t.u.ea inoWing, e.g, detection mo,ebes tncludmg 
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radioisotope emitting "hard" e.g., beta radiation. _ ., her 

A "labeled nucleic acid probe or oligonucleotide is one that is bound, either 

Waals electtostatic, or hydrogen bonds to a label such ma, the presence of the probe may 

by detecting the pteseneo of the Ube, bound to die probe. AUemattv* . 
uamgmghafmiirym.eraottonaniayaohieve.heaame^whereoneofap.cofbniding 

partners binds to the other, e.g.,biotin, streptavidin. . A ..,. a,„ 

As used herein a "nucleic acid probe or oligonucleotide" ta defined as a 

nucleic aoidoapableofbinmngtoatarge^^ 

one or more types ofchemical bonds, usually enough complementary baae panmg, usuahy 
through hydrogen bond formation. As used herein, a probe may include natural fre., A, O, C, 
„r T) or modified bases p-deazaguanosine, inosine, etc.). In addition, the bases in a probe 
may be joined by a linkage other than a phosphodieater bond, so long as it docs not 
functionally interfere with hybridization. Thus, =.g., probes may bo peptide nucleic actds m 
which die constituent bases are joined by peptide bonds rather than phosphodiester linkages. 
It will be understood by one of skill in tire art drat probes may bind targe, sequences lacking 
complete complementarity with the probe sequence depending upon the stringency of.be 
hybridization conditions. The probes are preferably directly labeled aawith isotopes 

streptavidin complex may later bind. By assaying for the presence or absence of the probe, 
one crut de.ee, tire presence or absence of the select sequence or subsequence. Dtagnosrs or 
prognosis may be based a, fine genomic .eve,, or a, fine level of RNA or protein express,™. 

The term "recombinant" when used with reference, e.g„ to a cell, or nuclerc 
acid ptotein, or vector, indicates that ft. cel., nucleic acid, protein or vector, has been 
modified by the introduction of a heterologous nucleic acid or protein or the allerafon of a 
native nucleic acid or protein, or mat the cell is derived from a cell so modified. Thus, eg., 
recombinant cells express genes ma, are no, found within me native (non-reoombinan.) form 
of me eel, o, expmss native genes that are otherwise abnormally expressed, under expressed 
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, • herein is meant nucleic acid, 

originally fotm«d«v«, in general by dinnatllte . to this manner, 

rt — anri • "-^J^. , nnoleic acid, in a linear 

operaUy linkage of different ***** > . ^ „ „ act 

..will replicate non-recontbntanny, ..... ns, g 
KostceU^*-.- ^ stillconsidered 

^-^^^^^^^^^orrionsofan^ 

environmental and developmental condations. An mau P 

active under environmental or developmental regulauon. The term operably fa* 

3 0 ^^^1^.--- — ^■*- <N *" m 
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tntfl! cellular or library DNA or RNA). 

Xha^^hybridi^onco^-^tooo^unde. 

whichaprobewiU hybridise to its targe, subsequence, typicaUy in. complex n»x«e of 
nuc ,eic a kbu..ono ounces. Stfngen. Condons aae sequence^ en and 
wiU be differen. iu ditferen. circumsunces. Longer sequences hybridize specially at 
higher temper.^ Mearensiveguide^rhahybridiaarionofnucWc acids ,s found™ 

toe rarge, sequences arc present in excess, a. T„ 50% of the probes are occuprcd a. 
, ^bnum).S^.cond i uons™nbeu W se ta wHcb6.^co n ^on,a.c«ta 

.Ltl.OMsodiumion, «U» about 0.0Uo 1.0 M sedition concentration or otto 

a, pH 7.0 ,0 8.3 and the temperature is a, to, abou. 30°C for short probes (eg ,10 to 
5 0n«c 1 eo ti de S )and,,rco S .abo U . 6 0-Cforlo„gp ro be i (e.g.,^a.e I *an 5 0nuc 1 e„ h dcs). 

SBi ngen. conditions may a.so be achieved with <he Edition of dazing agents such as 
3„ fomtamide. For selccUvc or specific hybridiaadon, aposirive signal is a. leos. W o bmea 
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.. i,„ mm dhYbridization. Exemplary stringent 
Aground, preferably 10 limes Aground hybn ^ ^ ^ ]% ^ 

incubating at 42°C, or, 5x SSC, 1 A SDS, m 

62-C is typical, although tagh stnngeaoy ann « TypicaUy de eooditions for 

,o*ou,65.C,dep=ndingoothepn n >er,e ngUn »ndsp^Typ ' 

30soa. 2 ^.aoa M e* g phasela S UBg30ec- 2 r»^ 
reactions are provided, e.g., ui 

This occurs, e.g., when a copy of nude* ac, 
degeneracy permittedby the genettccode. Ins hens- 
m dermod=ra.ely srtngenthybridizahoncondhons. E^P'J 

guidelines for determining hybridization parameters are provxded m 
f Pmtocols in Molecular Biology, ed. Ausubel, at 

* .^duWeac.i^ofab^.cancerpro.cinincWestedc—onofap^cr 
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metastasis, and other characteristics of breast cancer ceils. Functional elfe 

ch anges inspectroscopic characteristics (e.g., fluorescence, absorbance, reftacv tndex), 
h ydmdynanuc(e. g ., shape), ohronrategraphic, or solubfltty properties for.hepro.em 
.Luring inducible martrers o, national activation of the breas. cancer proteu, 
Zl hind.gaofl.it.orbinding^s.e.g.binding.o — oro.herhgands^ 

^.olrcana.sobeperfonnedns.ngbreasroanoer assays anovm * moseof sh.,1 n*e 
art such . an ,n t»o assays, e.g, cefl gmwth ort soft agar; anchorage dependence; contact 

™* factor or serum defence; tenror specific marker teveU; invasrveness urto Manage., 
m growftr and merastasis in vivo; mRNA and protein expression in ccfls undergotng 

evaluated by tuauy means Itnowu to ftroso slcilled in the art, e.g., microscopy for quantitative 

identification of downstream or reporter gene expression (CAT, luoifenrse, p-gal, OH turd 
; ,„e m, e*. via ohcnriluminoscenoe, fluorescence, colorimcfiio reacfions, anfibody bmdnrg, 
inducible markers, and Ugand binding assays. 

••Inhibitors", "activators", and "modulators" of breast cancer polynuolcottdo 
m4 polypeptide sequences are used to refer to activating, inhibitory, or modubtting moteou.es 
or aompounds identified using i„ «. and in vfvo assays of breas. oanoer polynucleotide and 
0 peptide sequences. Inhibitors are compounds that. e.g„ bind ,0, partiafly or teteflybloolr 
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activity, decrease, prevent, delay ^ ^ense nucleic acids 

ac^or^ressiorrofbreastcancerprcterr, ^, ^ ^UvaW are 

eompounds that increase, open, or mooul ators also include 

np .guiate brcae, cancer protein actrvty. W*£« ^ ^ ^ as 

well as natnraUy oacnmng and synthette hgan , ^ ^ 

expressing dre breast cancer proten, ,» tA - . «*, desclibed 
mutator — d,3ndtbcnde.e™nin g d,= toU -U^ ^ 
above. Activators and inhibitors of breas, cancer can also betd^aj 

ot ! „, more breast cancer proteins, e.g„ 1, 2, 3, «. , . • ^ 
caneer proteins, auch as breaa, cancer protems encoded by the sequen 

0 ofapolypepUdersactaevedwhentheactrvrty polypeptide is 

Stably 50%, -note preferably 25-0%. Achvauo no £££'J^ „ , , 0% , 

more preferably 1 50%, more preferably 200-500 A (t.e„ two 

Wta.ionofgnawm.loaaofg^^rorsennn^K 
30 ability to form or suppress tumora when mjeoted mto aurtabt 
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Tecfcniflwe pp. 231-241 (3 ri e<L 1994). ... 

"Cancer cells," "transformed" cells or "transformation" in tissue culture, refers 
5 tospontaneousormducedphenotypicchangesthatdonotnecess^ 

new geneticmaterial. Although tartm^^^*^^"*^ 
vjand incorporation of new genomic DNA, or uptake of exogenous DNA, it can also anse 
spontaneously or following exposure to a carcinogen, thereby mutating an endogenous gene. 
Transformation is associated with phenotypic changes, such as immortalization of cells, 
.0 aberrant growth control, nonmorphological changes, and/or malignancy (see, Freshney, 
Culture of Animal Cells a Manual of Basic Technique (3' d ed. 1994)). 

"Antibody" refers to a polypeptide comprising a framework regum from an 
hnmunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
1 5 epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable regum 
genes Light chains are classified as either kappa or lambda. Heavy chains are classified as 
gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM IgA, IgD and IgE, respectively. Typically, the antigen-binding region of an anubody or 
its functional equivalent will be most critical in specificity and affinity of binding. See Paul, 

20 Fundamental Immunology. 

An exemplary immunoglobulin (antibody) structural unit comprises a 
tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair 
having one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terrmnus 
of each chain defines a variable region of about 100 to 1 10 or more amino acids pnmanly 
25 responsible for antigen recognition. The terms variable light chain (V L ) and variable heavy 
chain (V H ) refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well- 
characterized fragments produced by digestion with various peptidases. Thus, e.g., pepsin 
digests an antibody below the disulfide linkages in the hinge region to produce F(ab)' 2 . a 
dimer of Fab which itself is a light chain joined to V H -C H 1 by a disulfide bond. The F(ab)' 2 
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. vthe disulfide linkage in the hinge region, 
may be reduced under^ldcondUionstob^ ^ 

thereby —gtheFCah),^^ £^^^0-1-, 3d* 
chemically or by using recombinant mod ification of whole 

*^<»"» ^ ^..^^ Helena,, or po.yclona! 

producaonofsinglechainanabod.esCU.S.PatentVW, , sllch 

. other— , "^^"l-i^os andher.ro.onoF* fragments « 
, display technology oa„ be used .0 *** 554 
specifically bind lo selocted antigena (aee. e.g, McCafferty ««/.,« 

P»» Ma*a - W:779 - 78 ! (1 ! v 3 eoule „ wU ch (a) fheeonafan. 

A "chimeric antibody" is an anrtbody molecule m I 
rt . „ i, ^d, replaced or exchanged so «ha< Ore anugenbtndmg arte 

region, or a portion thereof,, a altereo, , . adi{fere „ toraltere< lctaa,efIeotor function 

^^"rrir^^^^-ior^-varfable 
different or altered antigen specificity. 
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ri-fgeness— usly aUows *e general of a gene expr^onp^.s 

l0 sampleTof breast and other tissues from surviving cancer patients. By companng expression 

are imports* (Wooing bo* up- and down-regniation of genes, in each of these states , 

obtained. . u . 

The identification of sequences that are differentially expressed m breast 

For example.aparticular treatment regime may be evaluated: does a chemofcerapeuUc drug 
act to down-regulate breast cancer, and thus tumor growth or recurrence, in a partxcular 
patient Similarly, diagnosis and treatment outcomes may be done or confined by 
Imparmgpatientsampleswiththelmownexpressionprofiles. Metastatic tissue can also be 
20 analyzedtodeterminethestageofbreastcancerinthetissue. Furthermore, these gene 
expression profiles (or individual genes) allow screening of drug candidates with an eye to 
kicking or altering a particular expression profile; e.g., screening can be done for drugs 
that suppress the breast cancer expression profile. This may be done by making b.ocmps 
comprising setsofthe important breast cancer genes, which can then be used in these screens. 
25 These methods can also be done on the protein basis; that is, protein expression levels of the 
bre ast cancer proteins can be evaluated for diagnostic purposes or to screen candidate agents. 
!n addition, the breast cancer nucleic acid sequences can be administered for gene therapy 
purposes, including the administration of antisense nucleic acids, or me brea^^ 
(including antibodies and other modulators thereof) administered as therapeutic drugs. 
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., M nlirIe ; c ac id and protein sequences that 
ro « ere nMy«p« SS edta^'^ hmm,OT " e * tr.trula.taHi e expresseda, 
a higher level) in breast cancer, a* well as tm 4- 

h0 we,er,aswiU he applied hy^=»* evatotion; thU s, other breas, 

sequences are provtded <h» taals (incW i„g sheep, goal, pigs, cows, 

^.hsmslers.gurneap.gs.e.O.P^- ^ ^^orgatnsma 

horses, etc.) and pets, e.g., (dogs, eats, etc.). Breasr cancer q 

m »v be obtained using the techniques outlined helow. 

" BrJcanccrseqnenccacaninCndeboAnucleicactdandan^oaotd 

selected probes to the breast cancer sequences can be generated. 

AhreasicancersequencecanheinWaUyidentifiedhysubs^halnuCetcacd 

md/ „ rmi noa=idse,nencehon,ology.ou,ehre K .cancetse,nencesou,linedhe«in.Such 

homology can be based upon the overal. nucleic acid or amino acid sequence and ts 
^determined as outhned below, using either homology progmn, or hybndtzatton 

COndi,i0 ° S ' Poriden.ifyrngbreastcancer-associa^seqnenOehreas.cancerscreen 

te „e Outer suitable comparisons inclnde comparing breast cancer samp,e warn 
Static cancer apples from other cancers, such as lung, breast, gash— 

stat esand.is S uennder 8 oingme,as to is,areapphed,obiocmpscompns,ngnuc 1 e.c.od 
30 I! Thesampiesar.ars.miorodissec.ed.irapp.tcahle.andheatedastsltnownm.hear, 
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« nfmRNA Suitable biochips are commercially available, e.g. 

zzzzzi 

s „amountmoth^ 

that the target be disease specific, to minimize possible side effects. 

In a preferred embodiment, breast cancer sequences are those that are up- 
dated in breast cancer; that is, the expression of these genes is higher in the breast cancer 
tissueascomparedtonon-canceroustissue. ^-regulation" as used herem often means at 
least aboutatwo-fold change, preferably at least aboutathree fold change, with atleast about 
five-folder higher being preferred. All unigene cluster identification numbers and access.n 
numbers herein are for the GenBank sequence database and the sequences of the access.on 
numbers are hereby expressly incorporated by reference. GenBank is known in the art, see. 
e g. Benson, DA, et al, Nucleic Acids Research 26:1-7 (1998) and 
http //www.ncbi.nlm.nih.gov/. Sequences are also available in other databases, e.g., 
EuropeanMolecular Biology Laboratory (EMBL) and DNA Database of Japan (DDBJ). 
U S Patent Application N. 09/687,576, with the same assignee as the present apphcahon, 
further discloses related sequences, compositions, and methods of diagnosis and treatment of 
breast cancer is hereby expressly incorporated by reference. 

In another preferred embodiment, breast cancer sequences are those that are 
down-regulated in the breast cancer; that is, the expression of these genes is lower in breast 
cancer tissue as compared to non-cancerous tissue (see. , g .. Tables 1,2, 3, 15, 16 etc...> 
"Down-regulation" as used herein often means at least about a two-fold change, preferably at 
least about a three fold change, with at least about five-fold or higher being preferred. 
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can additionally provide high-resolution, high-sensitivity ^^^^^ ^b^Ttiticture, 
5 "rrrrrr^inn^ennhe 

^ activity correlation (« Anderson, MM.** «** 
,0 ^^paperpresented^BCP^^oe « 

,998), Subcellular logical information can also be utihzed tn abtologt al sensor 
If to pit «. bhely toxicologies! effect of chemical expose and l«te,y 
Zle Leshold s( aeeU.S.Pa«e»tNo. 5 ,S l U3.). Sintiiar advantages acorn, from 
rTrelevantroodterh— es and hioaotive agon, (o,„ nnoleic acids, sacchandes, 

teW.atl.as.onese.of^ayd^Th.datacnnUu.edtana.d^is^e^ 

inwhich datacanbemaintabsed and transmitted, he, is preferably - 
2 „ alase. The electronic daabase of the invention can he maintained on any eiectiontc 

„ is prefer disputed on a wide ate. network, snoh aa tire World Wd. Web. 

The focns of the present section on databases ma. inctade peptide senuence 

25 ^basesoanbeasaembMfor^yaasaydauact.nirednsinganaasayof.he.nven.ton 

Thooompositionaandmemodsforidentifymgand/or^t^gmetelattve 
^ocabsolnuabnndanceofavanetyofmolecmarandmaoromolccmarspeoiestioma 

^1 
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other, Although the data generated from the assays 

„ ■ th TfTIIpU U .S.Pa««n,s«,023,6 5 9 ^5,9«,7 12 d iS 0 1 o S eareU«.o O al 
known in the art. ror example, u.^ allows 

JL be catalogued and searched aceording to one or more protem fhnctron 
T r ™«5953 727d i so 1 oseaare 1 adonaUa.abasehav i „ g seone„eereeords 

< ~± ======= 

H -tic Patent 5 538 897 discloses a method 

< hPtween a kev sequence and a target sequence. U.S. Patent 

,0 described as on*e anaiyncad processing (OLAP), which entails the consohdatton of 
" " pI, 5 295,26. reports a hybrid database atmcbare U. which the H* of each database 

the merger of two or more such tree structures. 

See afso Moun, « »/.. POM* 

OeuL eds., 19, 8 »; Raahidi » Buehler, ^ — » " 
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Thep t e S e n .inven ti onp ro vi d6 saco m p«.etda te baseoon V „ S mgacompu K r 

w ift da BS p« i&tag .l« S <>u rce o ft he«e«-co»Uini„ B s mI p,e & o I „wh 1 cheachse< 1 ue„o. 

soecificity record was obtained. , 

toaaexemplary^bod^at^oaaoftaso^sof^^ 

sample is from a control *» sample known lo be free of pathologica! disorder, In a 
variation, a. least on. of <he sonrces is .known paOtologieal tisane specimen, e.g., a 
neop.aatiotesionoranotetisanospeoitnon^beanalyzedforbreas.o.noe, Inanotiter 

variation the assay record, cross-tab-late one or mora of Ota following parsers for each 
targe, species in a sample: (1) a nniqne identification code, which can inolnda, e.g., a target 
molecolar structnre and/or characteristic separation coordinate (e.g„ electrophoretic 
eoordinateO; (2) sample soorce; and (3) absolnte and/or relative rmandty of mo target apeo.es 

present in the sample. 

The invention also provides for the storage and retrieval of a collection of 
target data in a computer data storage apparatus, which can include magnetic disks, optical 
disks magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, 
m agnetic bubble memory devices, and other data storage devices, including CPU registers 
and on-CPU data storage array, Typically, the target data records are stored as a bit pattern 
in an array of magnetic domains on a magnetizable medium or as an array of charge states or 
transistor gate states, such as an array of cells in a DRAM device (e. g ., each cell compnsed of 
a transistor and a charge storage area, which may be on the transistor). In one embodunent, 
the invention provides such storage devices, and computer systems built therewuh, 
comprising a bit pattern encoding a protein expression fingerprint record comprising umque 
identifiers for at least 10 target data records cross-tabulated with target source. 
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roftherelativean.ountofapepddeornucleicacidsequenceinapoolofsequences 

determined from a polypeptide or nucleic acid sample of a specunen. 

TheinventionalsopreferablyprovidesamagneticdislcsuchasanlBM- 

compatible(DOS, Windows, Wmdows95/98/2000, Wmdows NT, OS/2) or other format 

the invention inafile format suitable for retrieval and processing in a computenzed sequence 
analysis, comparison, or relative quantitation method. 

The invention also provides a network, comprising a plurality of computmg 
devices linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, 
ISDN line, wireless network, optical fiber, or other suitable signal transmission medmm, 
whereby at least one network device (e.g., computer, disk array, etc.) comprises a pattern of 
magnetic domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM 
cells) composing a bit pattern encoding data acquired from an assay of the mventron. 

The invention also provides a method for transmitting assay data that mcludes 
generating an electronic signal on an electronic communications device, such as a modem, 
!SDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the srgnal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 
database comprising a plurality of assay results obtained by the method of the inventus 
In a preferred embodiment, the invention provides a computer system for 
comparing a query target to a database containing an array of data structures, such as an assay 
result obtained by the method of the invention, and ranking database targets based on the 
degree of identity and gap weight to the target data. A central processor is preferably 
30 initialized to load and execute the computer program for alignment and/or comparison of the 
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Execution of .he computer program results m tire centia, p 
^medauae.wMeheompn.e.abina.ytie^tionofa.^y^ 

Ue target data or record and the compter program ^ 
M memo., which is typically random access memo, 

SGRAM, or SDRAM). Takers are ,ar*ed according ,0 ^ « t,, »nd tire 

Software Darwin); a data file can be en opticul 0, magnetic drA a data server, a memo. 
d CT ^g DRAM, SRAM, SGRAM, SDRAM, EPROM, HH. — * «•* « 

tha ,deseKb=d«bove,wUchcomprises ; (l)acompu,cr;( 2 ,»s,oredbi,pa..emeneod,nga 

» my be led in me computer; (3, a eomparison Urge,, sncb as a ,uery target; and (4) 
^ gl ,»r aiignmen, and comparison, typically wim Ordering of companson re- 
on the basis of computed similarity values. 

Characteristics ofbreast cancer-associated proteins 

Breaatcanoerprote^ofthepreaentinventionmaybeclasstfiedassecreud 

proteins.banamembranepmtemsormti.eUularproteins. mono embodhu^ abreast 
rusmand/or inthe nucleus. ht rao d ,u,srpm.einsareinvo,vedmati aspeo B of cetiu ar 
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h %ded 1994) For example, many intraceUular proteins 
have enzyme acuvity such as ^^^^ ^ ^ „lar proteins 

««. motifs for which defined functions have been attributed, in 

targets in a sequence dependent manner. PTB domains, which are disbnct from 
doI^,arnoh i nd C yroai„epho S phor,.a t edra Ig e,s.SH3dom^nahind,o P rohne.nch 

„ addUion, PH domahrs, tefratncopepbde repeats and WD domains to name 0* a 
5 raw have bean shown to mediate protein-protein interactions. Some of thesa may also be 
^vedmhindingmphosphotiptdaorothersecondmessengata. AswtUbeapprectatadby 
oneofordmaryskinmmaa^dnesamodfsannhaidandfledondaehasisorprn^ 

sequence- thus, an anaiysia of to sequence of proteins may provide instgh. mho both the 
Zmafropo^oftomo^-ormo^wimwhichflmpm^n^^ 
,0 OnenaeMdanataaoisKamfrro.emfimdUe.XwhichiaaiargecnUeodonofmatUp.e 
' ^.noeaUgnmentaandmddenMarho.mod.Ucnv.rmgmanycnmmon ptotem dommns. 

Center in England, and the Karolinska Institute in Sweden (see, e,, Bateman e, al, Nuc. 

25 al, Nuc. Adds to. 27:260-262 (1999); and Sonnhammer a, o/„ NucAadsRes. 26.320-322 

0958)> ' In tmother embodiment, the breast cancer sequences are transmembrane 
proteins. Transmembrane proteins am monies , hat span a phospholipid bilayer of a ce„. 
lymnyhaveanintracelh. ^don^^axfracnUntadoman.orbofrt. Themfr^U^a, 
3 „ Jainsofsnchpro t .in S ma,h.veanumberof to o,ions«rc to dmg«hnsoafr.adydeso„bed 
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,, the intracellular domain may have enzymatic aeSvity 
for inttaeellular protein, For example, the 

^^»^^^^ ? ^^^<*^ 
domamof^embranep^inaser^bo^ ^^^oryUnon 

conKta i„ g ^ ^ ^ fem „ „ .any - 

Qiich as G protein coupled receptors (CrfCKs; are u« 

• -h. Thmfore unon analysis of the ammo acid 
m av be followed by charged ammo acrds. Therefore, open an y ^ 

JLce of a particular prorein, the localization and number of ttansmemo.^ demons 

-P--^^^ u ^ eomtoof ^ taeprot ^ M ed i ve«;ho W ^, 

conserved are found repeatedly among various extract ^ ^ 
5 rLoand/or^onabavobo.^^entox^mo^^ 
^llulardomainsaieinvolvedmbindingtoothetmolecmea. In one aspect, extmcellul 

Uganda. wKohmaybepeprides, proteins,or — 

« Hndtottreircgnatereoeptoratoiniriamavarieryofcellulaxtesponsea. Outer 
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associated Uganda can be tethered to the cell, e.g., " 

Breast cancer proteins thai are bansmembrane are parttcularly pre 

WMypermeablirted.oprovideaocesstointraeellnlarpro.ems. 

nw „ 1 a 1S obeappree i a.edbymoseinnaear,.htUa«mbranepro.emoan 

Umadesomblebyremovinghanamem ^brane se,ne„eea, e.g., ^ngh .coombs, memods. 

Furthennore, transmembrane proteins ma. have been mad. soluble can be made to be 
15 secreted mrongh recombinant means by adding an appropriamsignmseqnenee 

In another cmbodhnen., the breast cancer proteins are secreted protetns; me 
season of which can be either consStuuve or regulated. These proteins have a argnal 
peptide or signal sequence that hugets the molecule to me secretory pathway. Secreted 
proteins arc involved in numerous physiological even*; by virtue of dtcir crculabng nature, 
20 Lyservetohar.mi.sig.aU.ovanouaomerccl.Wca.Thesecre.eapm.emmaytoeonm 

„„ aallsincloscproxnni^bamc cell tha, secreted m.fac«or)or an endocrme manner (achng 

aspects of physiology. Breast cancer protems ma, are secret protema are parhcularly 
25 pmferredin.hepresen,mvenuonas t heyse re ea,good,arge K fordiagnos«,cm*ers,,g., 

for blood, plasma, serum, or stool tests. 

Use of breast cancer nucleic acids 

As described above, breast cancer sequence is initially identifiedby 

30 substanualnucle^^ 
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thesamemolecule. of the invention, e.g., to sequences 

The breast cancer nucleic acid sequences oi me ui 
i„ Tables .-25 can be fragments oflarger Benes, U, they are nuoleie aoid segments 

Ilcodingtegiona. Accurdingly. a, -bo appreciated by those in, ho at,, usingth. 

te full long* sequence, see Ausubel, , * su P ra. Much can be done * — - 
maIiy sequences canbeclus,eren«o include ntui^e fences correaponding» a single 
gene, o.g„ system, such . UniQono (see. h« p .y/ww.nobi.nhn.nih.gov/UntGenef,. 

Once the breas, oancor nucleic acid ia identified, i, can be cloned and, tf 
necessary, its consttiuen, parts recombined ,o fortn the entire breas, cancer nucleic acd 
coding regions or tine entire tnRNA sequence. Once isolated from Us natural source, 
conned within a plaatnid or other vector excised .herefrom as a linear nucleic ac.d 
^ent.titereoombinan.hreaatcance.nud.ioaeidcanbemrther-usedasaprobeotdenniy 

and isolate o,her breas, cancer nucleic acids, e.g., extended coding regions. 1, can also be 
used as a "precusof ■ nucleic aoid to make modified or variant breast cancer nucletc actds 

and proteins. . , 

The breast cancer nucleic acids of the present invention are used in several 
ways inafnstembodiment.nucleicacidprobestothebreastcancernucleicacidsarernade 

and attached to biochips to be used in screening and diagnostic nrethods, as outlined below, 
or for administration, e.g., for gene therapy, vaccine, and/or antisense applicaUons. 
Alternatively, the breast cancer nucleic acids that include coding regions of breast cancer 
proteins can be put into expression vectors for the expression of breast cancer proteins, agam 
for screening purposes or for administration to a patient. 

b a preferred embodiment, nucleic acid probes to breast cancer nucle.c adds 

30 .othmenucleicacidse^ 
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nad . The nuCeic - ptnbes attache, ,0 the biochip - designed to be — V 
L pta nenU* to *e breast cancer -etc a* ,, the targe, ae^uene, (e— set 

itadt*-.*-*— ^n^no.r.periect^ma^any^rofb^ 

air— .^«m^^*^^ toW r£^ 
^suandeonucteicacidaofbepresentinveouo, However, ifthenumbe o n, ta, ona 

^^■ha.nohvbrid^o.^ocour^evenOte.^a^ofhybnd^ 

complenW herein is menn. to. the probes are sufficiently comp.emen.ary to me Urge, 
sequences to hybridize under normai reaction conditions, particularly high stnngency 

conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partmlly smgle 
and partially double stranded. The strandedness of the probe is dictated by the structure, 
composition, and properties of the target sequence. In general, the nucleic acid probes range 
from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally whole 
genes are not used. In some embodiments, much longer nucleic acids can be used, up to 
hundreds of bases. 

In a preferred embodiment, more than one probe per sequence rs used, with 
either overlapping probes or probes to different sections of the target being used. That is, 
two three, four or more probes, with three being Referred, are used to build in a redundancy, 
for a particular target. The probes can be overlapping (i.e., have some sequence in common), 
or separate. In some cases, PGR primers may beused to amplify signal for higher sensibly. 

As will be appreciated by those in the art, nucleic acids can be attached or 
iram obilizedtoasolidsupportinawidevarietyofways. By "immobilized" and grammatical 
equivalents herein is meant the association or binding between the nucleic acid probe and the 
solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent. By "non- 
covalent binding" and grammatical equivalents herein is meant one or more of electrostaUc, 
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• included in non-covalent binding is the covalent 

• A, described herein, the nucleic acids can either be 

^^-d I.— I Corona— AswiUbe^^c 
glass and mediae or finalized gJass, plasto (ino.uding acrylic., po.ystyre.re and 

po^Wes.ny^norni— s, resins,, .,ca or s.Uca- 
IZ nra.er.aU inCuding sfflcon and modified silioon, carbon, nre.aU, inorgamo gUsses, 
pW ics etc. In genera., fire snbstta.es a.low optical detection and do no, appreuab.y 

25 ir^lficB.ooMp.U.S.AppUcnfionS^No.O^U.fi.edMarcbU,^, 

herein incorporated by reference in its entirety. 

Generally the substrate is planar, although as will be appreciated by those in 
the art other configurations of substrates may be used as well. For example, the probes may 
be placed on the inside surface of a tube, for flow-through sample analysis to minimize 
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cached to -aces coding aani.ro groups, eg. using .intern as arc - 

^rhc^-btfUnctJo...^-"-^-^ 1 ^^"™ 
catalog lechnica. section on cross-linkcrs, pages 155-200). In addition, in some cases, 

to to embodiment, oligonucleotides are synthesized as ia known in the art, 
^^anaehod.onneaurfaceofti.esohdsupporfAswiUbeappreei^hy.hoseskiUed 

to the art, either the 5' or 3' terminus may be attached to the solid support, or attachmen. may 

be via an internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very 
strong yetnou-covalent For example, biotinylated oligonucleotides can be made, wmch 
bind to surfaces covalently coated with streptavidin, resulting in attachment. 

Alternatively, the oUgonucleotides may be synthesized on the surface, as is 
known in the art.For example, photoactivation techniques utilizing photopolymerizatton 
pounds and techniques are used. In a preferred embodiment, the nucleic acids can be 

inWO 95/25 1 16; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references 
cited within, all of which are expressly incorporated by reference; these methods of 
attachment form the basis of the Affimetrix GeneChip™ technology. 

Often, ampUfication-based assays are performed to measure the express™ 
tovd of breast cancer-associated sequences. These assays are typically performed in 
conjunction with reverse transcription. In such assays, abreast cancer-associated nucleus acxd 
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PCR). In a quanutative amplification, me comparison to appropriate 

p^-^^o^.^;— Methodsof 

controls provides a measure of the amount ^ . ^ ^ Detailed protocols for 

, ampliation are „el, Imown to *- * « ^ „ ^ „„, 

qoantitttive PCR are provided, e.g„ ur touts « «/., KS 

♦ i c thf 5' nuclease activity of the polymerase, e.g., Ampul aq, 
rSSS^ T—separatesme, 

1S M p !if «on< 5 <«,e.g,u«^ 

a • , it ,„ t LCW(seeWu&Wallaee,Oe»omics4:560(19 8 9),Landegren««l.,Se,^ 
cha,nrer>ctron(LCR)(seeWu Option amplification 

•>jl-ltm(1988),andBanmgereto/.,GeiieliV.li'l.i«o;„ r 
241.1077(1985), , m8o . U 73 (1989)), setf-snstaineri sequence repltoattoo 

20 n:r;o"^ 

etc. 

and recombinant DNA technology ^ 

30 m^ T -— <— - 
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refeK „ ona ^ -a. ^ ^ for , g ., taclode . 

secretory leader is operably tanked to DNA for a po yp p 

to a coding sequence if it affects the transcnpt.cn of the sequence, 
is operably linked to a coding sequence if it is positioned so as to facilitate ' IanS 
GoteraUy "operably linked" means that the DNA sequences being linked are contiguous, 
" case of . secretory Under, contiguous and in reading phase. However, enhancers 

soshicrion si.es. If such sites do not axis, sy.the.ic ohgonucleoude adaplors 0. hnkera are 

nucleic acid will generally he appropritde ,o dte host call used .0 express the breast cancer 
protein. Numerous types of appropriate expression vectors, and suitable regulatory 
sequences are known in the art for a variety of host cells. 

In general, transcriptional and tntnslational regulator, sequences may tnclude, 
b„, am no. limited to, pro.no.er sequences, rihoaoma! binding si.es, wuscripnona. sUrt and 

stop sequences, — *>< ** ^ 01 

Jprafened embodiment .he mg-ory sequences indude a promoter and hanscttphonal 



25 start and stop sequences. 

Promoter sequences encode either constitutive or inducible promoters. The 
promoters may be either naturally occurring promoters or hybrid promoters. Hybnd 
promoters, which combine elements of more than one promoter, are also known m the art, 
and are useful in the present invention. 
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vectors, the expressioa vector coma expression construct. 

appropriatehomologoussequenoeformclus.onmthevector. c 

i.„«i»«».h*.«t(*F— *-*»«*-^* 

,„ addition, in a preferred embodim en., One express vector — . 
stable mamer S ene » a..o„ tbe sCecrion of tinned *- * ~ — " 
W e U knowninmeartandwfflvaryvrimmehostcellused. a hv culturin. a 

Th e b^oan M pro«imof(h e p»e D tto=»tiona re produc e dby«toga 

Hostceutmnsf. bnnedwithan expression v^ — nuc^acid »co* g a rea. 

canoes pro* Conditions appropriate for breast cancer protein expression «*,ary «Mt 

in the ar, through routine expen.nen.adon or optimization. For example, rhe use of 
c^Bavepmmo^mthe^sionvecWrwin^optumzm^gm^and 

MaddiUon-insomeembodim en* me timing— 

viruses tmdtbusharvesttimeseleetiouoanbeoruoialforproduotyteld. 

Appropriate host cells inchrde yeas,, bacteria, arcbaebacteria, fun*, and msec, 

BHK, CHO, COS, HeLa ceUs, HUVEC Ourmannmbffloa, vem en^ba! 
ceils) THPleellXamacropbage ceil lin.)and various other human ceUs and cell hues. 

in a preferred embodiment, the breaa, cancer proteins arc expressed m 
3 ^abanceUs. Mammaiian expression systems are aiso known in mo ari, and include 
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< andhaveabroadhostrange. Examples loclude the SV40 early prom 

LTR premeler, — major ,.e promeler, herpes simple, ~ prom«r, 

,0 Bxamplesof^on^or^polyademyahon,^, 

include those derived form SV40. 

The methods of introducing exogenous nucleic acid into mammahan hosts, as 

well as other hosts, is well known in the art, and will vary with the host cell used. 
Techniques include dextran-mediated transfection, calcium phosphate precipitin, 
15 polybrene mediated transfection, protoplast fusion, electroporation, vxral mfecUon 

encapsulation of the polynucleotide^) in liposomes, and direct microinjection of the DNA 

into nuclei. . . - 

In a preferred embodiment, breast cancer proteins are expressed in bactenal 
systems. Bacterial expression systems are well known in the art. Promoters from 
20 bacteriophagemayalsobeusedandareknowninmeart. In addition, synthetic promoters 
andhybrid promoters are also useful; eg., the tac promoter is ahybrid of the trp and lac 
"moterse^ences^ ' 
'tersofnon-bacterial origin that have the ability to bind bacterial RNA polymerase and 
initiate transcription. In addition to a functioning promoter sequence, an efficient nbosome 
25 binding site is desirable. The expression vector may also include a signal peptide sequence 
that provides for secretion of the breast cancer protein in bacteria. The protein is either 
secreted into the growth media (gram-positive bacteria) or into the periplasmic space, located 
between the inner and outer membrane of the cell (gram-negative bacteria). The bactenal 
expression vector may also include a selectable marker gene to allow for the selection of 
30 bacterial strains that have been transformed. Suitable selection genes include genes which 
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to art s^haa calcium chloride treatment, electmporauon, and othas. 

Moneembodimeat.oreartcancerpro^areprodueedinmsecceua. 

exoression vectors, are well known in the art. 

P toapreferredamb^bre^c^pro^ispre^my-'^ 
Yearte^oasyatemaar.weU^mmeart.andmeWe^e.^^ 

15 B^«r^fia^^l^.^^^^^ P - p ° Mm - 
Schizosaccharomyces pombe, and itorowia lijiolyrtar. 

The breast cancer protein may alao be made as a Won protean, uamg 

desired epitope is amaU, the breaat cancer protein may be fiaaed .o a earner P»tem «o f rm .» 
20 immnnogen. Aitemafiveiy, me breaker protein may be made aa a fiaaron protern, 

bl eaa<oancer peptide, .be nuc.eic acid encoding the peptide may be indeed to other nnolerc 

acid for expression purposes. 

,„ , prefer embodiment, the breast oanoer protein ,a pnnfied or rsolated 
25 after eapreasion. B re as,o«ncerpro.e i nam»ybeiso 1 a, 0l orpnrifted i n.vane«yofw.ya 
ta0 »n to thoae shUled in the art depending on what other components are present m the 
sarcple Standard purification methods include eiectrophoretic, mdaoular, immunologroal 

30 l^pufifiedusmgashmdardanfi-breartcancerprntemantibodycninnm. 
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• tinn with orotein concentration, are also useful. For 
and diafUtration techniques, » conjunctton wtth protet 

r i- fj^ c Thevmavbeusedasimmunoselection 
nucleic acids are useful in a number of apphcations. They may 

reagents, as vaccine reagents, as screening agents, etc. 

Variants of breast cancer proteins . ^ 

del e„on or insertion, with amino acid suctions being particutarty preferred. Tho ammo 
^^on.^onordel^mayocour^y residue within .he bmasrcaneer 

" PeP '"' e ' ^oinolud.dwitoonee^Uoen.ofbreas.oanoerpro.etaoftheprosoo. 
invention are amino acid sequence variants. Tfiese variants typicafiy H in,o one or more of 
musses: substitutional insertions! or deWonat vanan K . These variants ordmanly are 
p^bysitespeoiacmutag^sof^desmareDNA^gtob^canccr 

20 pm tt in, U sin g ca K e..eorPCRm«Ugene S isoro*er,eobm,ues»eU to own taft e*.o 
U c ;DNAenoodin g .hevarian t , a ndn,e re a ft erexprossua gto DN Am r«comb,na„.e« 

cnta n as oufihteri above. However, variant br^canoerpmtein fragments M -0 
M ,00-150 residues may bo pmpamd by in vitro synthesis using estabUshed teehnrques. 
^aeid sequence vaHan B »e character by Unopr^etenninednab^ of 0aovana»n, 

25 a fealure dta. sets them apart from naburaUy occurring allelic or inlerspecres vanatron of 
teas, cancer protein amino acid sequence. The variants ryprcnUy exhibi. the same 
a**, biological active as the nahrraUy occurring analogue, although vanants can a.so 
be selected which have modified characteristics as will be more firlly outlined below. 

While ure site or region for inhoducing an. amino acid sequence varmtron ,s 

30 p^detemnneri.firemuutionpers.neerino.bepmdctenninen. For example, in order to 
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u , f from about 1 to 20 amino acids, although considerably larger 

relationships provided in the definition section. 

ThovananttWic-ly^Wtntcsamc^vebiologi^acn^^wnl 

fcc variant may he designed snehtttatdtohioiogtoai aetfvi V of fnehrens. oancerprotetn ,s 
altered. For example, glycosylate sites may be altered or removed. 

Substantial changes in function or tanunologicttl identity are made hy 
^suhan^ons.ha.aretac^v.^^d^ahov.For^ 

lone m mela of the alteration, for example the a,ph,he,ical or hetn-ahee, shnc«e ; 

T» anhstitntiot* which in genera, are expect to produce me greatest changes m m. 
poiypeptid* properties arethosein which(a) a hydrophilic residue, e.g. aeryiornareonyl ,s 
2L for (or hy, a hydrophohio residue, e.g. leucyl, isofeucyl, v* or 
3 alanyl; 0» a cysteine or proline is substituted for (or by) an, other restdue; (c) a rostdoe 
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• ,i nr histidvl is substituted for (or by) 

rwtvne of covalent modification uicnifles re*, e 

» f,h,S mSnM ° ^ *!, Me ^ m organic derive agen, that is capable 
acid residues ofa breast cMoaipolypepUda^th an g 

fUUvdescribedbelow. Commonly used crossbnking agents mc.ude.e.g., U 

xuuy ae^nu iwdroxvsuccinimide esters, e.g., esters 

^l33^ob« S »eoinirnidy^iona,e,, Wfte .oao M -*-<-* 

ZiZlu— ■ ana agents «* as menry ,3^dopne„»rop— . 

O^rnodifieadonaWudedeamida.ronofg.n^ylandaaparagmy 
didoes tothecorreapondrnggmnnny.andaspa^y.residnes, respeetivdy, hydroxy.anon o 

terminal amine, and amidarion of any C-terminal carboxyl gronp. 

An^^ofcovalentmr^fic^onofaKb^^POlyPapM' 
^ndedwithinUneseopeofUnsinvennon eomprisesatrerrng the native glycosytaUonpanem 

"1 Hypepbde, and/or addntg one or more glycoaylation sites ft* are no. present rn me 
X JJebreaateanoorpoiypeptide. OlyooayUtionpanems o„bea«ered nrmany 

U. «* example theuseof different -W»-»-t-*— — * 
fences can result in different glycosylate pattema. 
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Addition of glycosylate sites to breast cancer polypeptides may also be 

r Wb^^lgtianunoacids^ence thereof, 
accomplished by altermg toe tQ ^ native 

by the addition of, or substitution by, one or more senneo 

, rfnr n linked glvcosylation sites). The breast canoci 

sequence breast cancer polypeptide (for O-UnKect giyc y 

10 polypeptide Such methods are described u. the art, e.g, m " 
Wriston, CRC Crtt. Rev. Biochem., pp. 259-306 (1981). 

Remova, of carbohydra.0 moMes present on die hreaat cancer polypepbde 
^.o^pUshedeheaueaUy.re^^yorbym^anhatiPitto.ofoodom 

ene^gforareu^aetdreaidue.thataen-eas^forg.yeosylation. Chemical 

« „l, ^re*. Bmcto". a****. 259:52 (1987) and by noge e, Anal. B~d.e... - - «. 
(1981) Enzymatic eleavage of oarbohydrate moieties on polypeptides oan be actaevedby the 
nseof a variety of endo-and exo-glycosidaaea as described by Thotakure e< „/., M 

Bizwno/., 138:350(1987). 
20 Another type of covalent modification of breast oanoer compnsea finking the 

breastoanoerpo!ypeptidetoon«ofa»arietyofnonproteinaceonspolvmera,e.g., 

US.Parea.Nos. 4,640,835; 4,496,689; V01,m 4,670,417; 4,791.192 or 4,179,337. 

Breaat oanoer polypeptides of the present invention may alao be modtfied m a 

25 way to form chimeric moleou)ea compriaing a breast oanoer polypeptide ftised to another, 
„e.ero.ogoos polypeptide or amino add sequence. In one embodiment, soch a clnmenc 
m olecnle comprises . fission of ahreas, oanoer po.ypep.ide win, a tag po.ypophde whtoh 
provides an epitope to whioh an anti-tag antibody oan selectively bind. The epitope tag ts 
generally placed a. the amino-or oaaboxy.-termi.ns of fine breaa, cancer po lyP epnde. The 
30 Lenoeo t a»ohepi«ope- K gge4fo m sofabreas.oancorpo.ypa P tidecanbode.« tt dna,ng 
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an antibody against the tag polypeptide. Also, provision of the epitope tag enables the breast 
cancer polypeptide to be readily purified by affinity purification using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fusion of a breast cancer polypeptide with an 
immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the 
chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

Various tag polypeptides and their respective antibodies are well known in the 
art. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; 
fflS6 and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field et 
al., Mol. Cell. Biol. 8:2159-2165 (1988)); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 
9E10 antibodies thereto (Evan et al, Molecular and Cellular Biology 5:3610-3616 (1985)); 
and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky et al., 
Protein Engineering 3(6):547-553 (1990)). Other tag polypeptides include the Flag-peptide 
(Hopp et al, BioTechnology 6:1204-1210 (1988)); the KT3 epitope peptide (Martin et al., 
Science 255:192-194 (1992)); tubulin epitope peptide (Skinner et al., J. Biol. Chem. 
266:15163-15166 (1991)); and the T7 gene 10 protein peptide tag (Lutz-Freyermuth et al., 
Proc. Natl. Acad. Sci. USA 87:6393-6397 (1990)). 

Also included are other breast cancer proteins of the breast cancer family, and 
breast cancer proteins from other organisms, which are cloned and expressed as outlined 
, below. Thus, probe or degenerate polymerase chain reaction (PCR) primer sequences may be 
used to find other related breast cancer proteins from humans or other organisms. As will be 
appreciated by those in the art, particularly useful probe and/or PCR primer sequences 
include the unique areas of the breast cancer nucleic acid sequence. As is generally known in 
the art, preferred PCR primers are from about 15 to about 35 nucleotides in length, with from 
about 20 to about 30 being preferred, and may contain inosine as needed. The conditions for 
the PCR reaction are well known in the art (e.g., Innis, PCR Protocols, supra). 

Antibodies to breast cancer proteins 

In a preferred embodiment, when the breast cancer protein is to be used to 
generate antibodies, e.g., for immunotherapy or immunodiagnosis, the breast cancer protein 
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^—-i— ^.-«— «**"»+rr b> t ^zz 

mad e.o aS -e,brea S ,c a ncerp ro .einwiUbea b .. to b Wt oU,e M He„ 6 U,p^n, 

particularly tinear epitope, In a preferred embodiment, •» epitope is tuu,u.-, tba, .a, 
anab-HiieageneratedtoauniqueepitopeahowUtrteernocross-reactrnty. 

Medrodsofprepanngpo.yclo^anrtbodi.sareknown.o.bea.aUertartrs.rt 

(. fc O-^-^-iB*-*'-*«** Po.yclona.andbodieacaaberaUed.na 
LL* e.g. by one or more injections of an immunizing a g en, and, 
Typically, .he immunizing agent and/or adjnvan. win be injected in the mamma! by muHtpie 
aubcutaneous or intraperitoneal injections. The immnnbdng agen. may todude a protiem 
encoded by a nooleio acid of .he fignrea or fragment thereof or a fusion protein thereof. It 
may be useful .0 conjugate the inununizmg agent to a protein known to be tamunogenrc m 
the mammal being immunized. Examples of snob immunogenic ptoteins taclude but are not 
limited to keyhole limpet hemocyanta, aerum albumin, bovine thytoglobutin, and aoybean 
ttypain inhibitor. Examples of adjuvants which may be employed include Freund's oomp.ete 
adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose 
dicorynomyoolate). The immunization protocol may be selected by one skilled in the art 

without undue experimentation. 

The antibodies may, alternatively, be monoclonal antibodies. Monoclonal 
antibodies may be prepared using hybridoma methods, such as those described by Kohler & 
Milstein, Nature 256:495 (1975). In a hybridoma method, a mouse, hamster, or other 
appropriate host animal, is typically immunized with an immunizing agent to elicit 
lymphocytes that produce or are capable of producing antibodies that will specifically bmd to 
the immunizing agent. Alternatively, the lymphocytes may be immunized m vitro. The 
in^unizing agent will typically include a polypeptide encoded by a nucleic add of Tables 1- 
25 or fragment thereof, or a fusionprotein thereof. Generally, either peripheral blood 
lymphocytes ("PBLs") are used if cells of human origin are desired, or spleen cells or lymph 
node cells are used if non-human mammalian sources are desired. The lymphocytes are then 
fused with an immortalized cell line using a suitable fusing agent, such as polyethylene 



PCT/US02/02242 

WO 02/059377 



pp.59 103(198 » boviliean dbumanorigin. Usually, rat or mouse 

prfrcularly myeloma calls of rodent, bovr ^ 

theuniusea,uim /tj^ppt or HPRT) the culture medium 

hypoxanthine guanine phosphoribosyl transferase (HOPRT or HPRT), . 

maaiiumawhiahaubs^oespraventmagrowmofHGPRT-defioien.oalls. 

to one embodiment, toe antibodies are bispecifio antibodtes. Brspectfic 
Mli bod,es are monoctona., preferably buman or humanized, antibodies that have btodtog 

« (topesontoesame^tigen. too„eembodi ra en«,oneo t toebtodin g spectae,,,e S , a 
plLncndedbyanucleicacidTables 1-25 or a fragment thereof, the other one ,s for any 
otoer antigen, and preferably for a ceU-surfece protein or receptor or receptor subunrt, 
preferably one fta. is tumor specific. Alternatively, tetramer-type technology may create 

multivalent reagents. m 

to a prefaared embodimenube antibodies to breast canoer protetn are capable 

of reducing or efiminating a biologiea! fimctton of a breast cancer protein, as is desenbed 
below Tbat is, toe addition of anti-breaa. cancer protein antibodtes (eitirer polyclonal or 
preferably monoolona!) to breast cancer tissue (or cells enntoining breast cancer) may reduce 

toe tike is preferred, with a. leas, about 50% being particularly preferred and about a 95- 

100% decrease being especially preferred. 

to a preferred embodiment toe antibodies to the breast cancer protems are 
humanized antibodies (e.g., Xenerex Biosciences, Mederex, too., Ahgenix, Inc., Protem 
Design Ubs,lnc.) Humanized forms of non-human (e.g„ murine) antibodies are ohrmenc 
m „,ecul=s of immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, 
FabF»b'F(ab')2orotoeran,igen-biudmgs»bser 1 »encesofantibodies)whiohcon,am 

, ruimmalacuenoederivcdfiomnon-humanlmmunoglobulto. Humanized anfiborfios molude 
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human species (donor antibody, such as »« * or rabbtf hav,n 1 

- r. „» instances Fv framework residues of the human 
affinity and capacity, fn some instances, rv tr nmiajilx i antibodies 

MIM are replaced by oorresponding non-humtur residues. Humanized an 
may also comprise reaiduea which are found neither in me recipient anhbody „ m me 
Tporred CDR or framowora In genera., a humaruzed anfibody wt compnse 

^anfimfya.lofa.feaatone.and.ypicnny^variabfodonuina.inwmchaffor 
^aflya.lofmeCDR.gionaco^ml^^ofano.hnn^mrnmnog.obuhn 

andaltorsubsmtfallyaUofmenamework^regionaarethoseofahuman 

„ Th, humanized antibody optimally also will compnse 
immunoglobulin consensus sequence. The humanizeo annuo y p 

* teas, a portion of an imm«nog.obulin constant region (Fo), typicaUy ma, of a human 
immunog.ohu.rn (Jones a, a,., Nmre 321 1522-525 (.986); Rieohmann a, al,m,ure 
332-323-329 (1988); andPreafa, Cur. Op. Smc,. Bid 2:593-596 (1992)). Humaruzatton 
cffl be eaaendaUy performed Mowing the method of Whiter and co-worker. (Jones a< of, 
Nam* 32H522-525 (1986); Riechmann e, al, »maa 332;323-327 (.988); Verhocyen at 
ScW 239:1534-1536 (1988)), by substituting roden, CDRs or CDR sequences for the 
corresponding sequences of a human antibody. According, such humanized antibodies are 
chimeric antibodies (U.S. PatentNo. 4,816,567), wherein substantially less than an ,mao. 
human variable domain has been substituted by ttte corresponding seqnenoe ta a non- 

human species. . 

Human antibodies can also be produced using various techniques known in the 

art including phage display libraries (Hoogenboom & Winter, /. Mol. Biol. 227:381 (1991); 

Marks et al, J. Mol. Biol. 222:581 (1991)). The techniques of Cole et al and Boerner et al. 

are also available for the preparation ofhuman monoclonal antibodies (Cole et al, 

Monoclonal Antibodies and Cancer Therapy, p. 77 (1985) and Boerner et al.,J. Immunol 

147(l)-86-95 (1991)). Similarly, human antibodies can be made by introducing of human 

^unoglobulin loci into transgenic animals, e.g., mice in which the endogenous 

i^unoglobulin genes have been partially or completely inactivated. Upon challenge, 

3 huffl an antibody production is observed, which closely resembles that seen in humans m all 
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, „ ,. «s-8« 859 (1994); Morrison, Nature 368.812 -11 
783 (1992): Lonberg elal., Nature 368.856-859 (W), 

(1994); Fishwild et at, Btorectaotogy 14.845-51 (1996) N g 

14:826 (1996); I^nberg * Huszar, M» *— - > 
By hnntunotherapy is n,- treatment of breast cancer mth an anUbody ratsed 

ag.nstbreas.cancerprotein, As used herein, unmunotherapy 

LpJ. (panen., hrduction of an immune response is (he reanl. of providing (he recip 
with an airtigen (o which antibodies are raised. As appreciated by one of ordinary skill in the 
art the antigen may be provided by injecting a polypeptide against which anttbodies are 
desired to be raised into a recipient or oonracnng the recipient with a nucleic acid capabte of 
exposing Ure antigen and under conditions for expression of the antigen, leaning to an 

immune response. *:u rtf i^ c 
In a preferred embodiment the breast cancer proteins agamst winch antrbod.es 

are raised are secreted proteins as described above. Without being bound by theory, 
antibodies used for treatment, bind and prevent the secreted protein from binding to rts 
receptor, thereby inactivating the secreted breast cancer protein. 

In another preferred embodiment, the breast cancer protein to which 

antibodies ^.^^^^"^ 
used for treatment, bind the extracellular domain of the breast cancer protein and prevent it 
from binding to other proteins, such as circulating ligands or cell-associated molecule, The 
5 antibodymaycausedow.-regul^ 

appreciated by one of ordinary skill in the art, the antibody may be a competitive, non- 
competitive or uncompetitive inhibitor of protein binding to the extracellular domam of the 
breast cancer protein. The antibody is also an antagonist ofthe breast cancer protem. 
Furthers antibody prevents activation of me transmembrane breast cancer protein. In one 
30 aspect, when the antibody prevents the binding of other molecules to the breast cancer 
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♦ of the cell The antibody may also be used to target or 

protein, the antibody prevents growth of the cell, ine y { m 

ensitize the cell to cytotoxic agents, including, but not Unnted to Tr^-a, IL 1 ^ Y 

Il^orchernotherapeuticagents^ 

activates serum— 

mediating cytotoxicity or antigen-dependent cytotoxtctty (ADCC). Thus, bre 

LerpLin. AnW^^ 

otherwise provide means to locally ablate cells. 

In another preferred embodiment, the antibody is conjugated to an effector 

such ldioactivel^^ 

pro tem. Jt^m**"^^"^*^*"*^^ 

may inhibit enzymatic activity such as protease or collagenase or protein kinase actmty 

associated with breast cancer. , 
In a preferred embodiment, the therapeutic moiety can also be a cytotox,c 

creancer. Cytotoxic agents are nnmeroos and varied and inolndo,bn. are no. muted to, 
cytotoxicdnagsortoxinsoraouvetragmentsofsnchtoxins. Soitahie toxins and men 

chain, curch,, crotin, phenomycn, enomycin and the Ute. Cytotoxio agents also melnde 

pmteins, orbinding of a radionuclide to a cheMng agent tira, has been covalentiyahached o 
1. antibody. Targeting me therapeutic moiety to transmembrane breast cancer protems not 
01u y serves .0 increase the focal annexation of therapeutic moiety in tire breast cancer 
JLed area, but also serves to reduce deleterious aide effects that may be associated wtth 



30 the therapeutic moiety 
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In another preferred embodiment, the breast cancer protein against which the 
antibodies are raised is an intracellular protein. In this case, the antibody may be conjugated 
to a protein which facilitates entry into the cell In one case, the antibody enters the cell by 
endocytosis. In another embodiment, a nucleic acid encoding the antibody is administered to 
5 the individual or cell. Moreover, wherein the breast cancer protein can be targeted within a 
cell, i.e., the nucleus, an antibody thereto contains a signal for that target localization, i.e., a 
nuclear localization signal. 

The breast cancer antibodies of the invention specifically bind to breast cancer 
proteins. By "specifically bind" herein is meant that the antibodies bind to the protein with a 
10 Kd of at least about 0.1 mM, more usually at least about 1 \xM, preferably at least about 0.1 
liM or better, and most preferably, 0.01 \M or better. Selectivity of binding is also 
important. 

Detection of breast cancer sequence for diagnostic and therapeutic applications 

15 In one aspect, the RNA expression levels of genes are determined for different 

cellular states in the breast cancer phenotype. Expression levels of genes in normal tissue 
(i.e., not undergoing breast cancer) and in breast cancer tissue (and in some cases, for varying 
severities of breast cancer that relate to prognosis, as outlined below) are evaluated to provide 
expression profiles. An expression profile of a particular cell state or point of development is 

20 essentially a "fingerprint" of the state. While two states may have any particular gene 
similarly expressed, the evaluation of a number of genes simultaneously allows the 
generation of a gene expression profile that is reflective of the state of the cell. By comparing 
expression profiles of cells in different states, information regarding which genes are 
important (including both up- and down-regulation of genes) in each of these states is 

25 obtained. Then, diagnosis may be performed or confirmed to determine whether a tissue 
sample has the gene expression profile of normal or cancerous tissue. This will provide for 
molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in the temporal and/or cellular gene expression 

30 patterns within and among cells and tissue. Thus, a differentially expressed gene can 
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qualitatively have its expression altered, including an activation or inactivation, in, e.g., 
normal versus breast cancer tissue. Genes may be turned on or turned off in a particular state, 
relative to another state thus permitting comparison of two or more states. A qualitatively 
regulated gene will exhibit an expression pattern within a state or cell type which is 
5 detectable by standard techniques. Some genes will be expressed in one state or cell type, but 
not in both. Alternatively, the difference in expression may be quantitative, e.g., in that 
expression is increased or decreased; i.e., gene expression is either upregulated, resulting in 
an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 

1 0 standard characterization techniques as outlined below, such as by use of Affymetrix 
GeneChip™ expression arrays, Lockhart, Nature Biotechnology 14:1675-1680 (1996), 
hereby expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PCR, northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is at least 

15 about 50%, more preferably at least about 100%, more preferably at least about 150%, more 
preferably at least about 200%, with from 300 to at least 1000% being especially preferred. 

Evaluation may be at the gene transcript, or the protein level. The amount of 
gene expression may be monitored using nucleic acid probes to the DNA or RNA equivalent 
of the gene transcript, and the quantification of gene expression levels, or, alternatively, the 

20 final gene product itself (protein) can be monitored, e.g. , with antibodies to the breast cancer 
protein and standard immunoassays (ELISAs, etc.) or other techniques, including mass 
spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to breast 
cancer genes, i.e., those identified as being important in a breast cancer phenotype, can be 
evaluated in a breast cancer diagnostic test 

25 In a preferred embodiment, gene expression monitoring is performed 

simultaneously on a number of genes. Multiple protein expression monitoring can be 
performed as well. Similarly, these assays may be performed on an individual basis as well. 

In this embodiment, the breast cancer nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of breast cancer sequences in 
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a particular cell. The assays are further described below in the example. PCR techniques can 
be used to provide greater sensitivity. 

In a preferred embodiment nucleic acids encoding the breast cancer protein are 
detected. Although DNA or RNA encoding the breast cancer protein may be detected, of 
5 particular interest are methods wherein an mRNA encoding a breast cancer protein is 
detected. Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is 
complementary to and hybridizes with the mRNA and includes, but is not limited to, 
oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 

1 0 examined on a solid support such as nylon membranes and hybridizing the probe with the 
sample. Following washing to remove the non-specifically bound probe, the label is 
detected. In another method detection of the mRNA is performed in situ. In this method 
permeabilized cells or tissue samples are contacted with a detectably labeled nucleic acid 
probe for sufficient time to allow the probe to hybridize with the target mRNA. Following 

1 5 washing to remove the non-specifically bound probe, the label is detected. For example a 
digoxygenin labeled ribopirobe (RNA probe) that is complementary to the mRNA encoding a 
breast cancer protein is detected by binding the digoxygenin with an anti-digoxygenin 
secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3- 
indoyl phosphate. 

20 In a preferred embodiment, various proteins from the three classes of proteins 

as described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The breast cancer proteins, antibodies, nucleic acids, modified proteins and cells 
containing breast cancer sequences are used in diagnostic assays. This can be performed on 
an individual gene or corresponding polypeptide level. In a preferred embodiment, the 

25 expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

As described and defined herein, breast cancer proteins, including 
intracellular, transmembrane or secreted proteins, find use as markers of breast cancer. 
30 Detection of these proteins in putative breast cancer tissue allows for detection or diagnosis 
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of breast cancer. In one embodiment, antibodies are used to detect breast cancer proteins. A 
preferred method separates proteins from a sample by electrophoresis on a gel (typically a 
denaturing and reducing protein gel, but may be another type of gel, including isoelectric 
focusing gels and the like). Following separation of proteins, the breast cancer protein is 
5 detected, e.g., by immunoblotting with antibodies raised against the breast cancer protein. 
Methods of immunob lotting are well known to those of ordinary skill in the art. 



in situ imaging techniques, e.g., in histology (e.g., Methods in Cell Biology: Antibodies in 
Cell Biology, volume 37 (Asai, ed. 1993)). In this method cells are contacted with from one 

10 to many antibodies to the breast cancer protein(s). Following washing to remove non- 
specific antibody binding, the presence of the antibody or antibodies is detected. In one 
embodiment the antibody is detected by incubating with a secondary antibody that contains a 
detectable label. In another method the primary antibody to the breast cancer protein(s) 
contains a detectable label, e.g. an enzyme marker that can act on a substrate. In another 

15 preferred embodiment each one of multiple primary antibodies contains a distinct and 

detectable label. This method finds particular use in simultaneous screening for a plurality of 
breast cancer proteins. As will be appreciated by one of ordinary skill in the art, many other 
histological imaging techniques are also provided by the invention. 



20 ability to detect and distinguish emissions of different wavelengths. In addition, a 
fluorescence activated cell sorter (FACS) can be used in the method. 



cancer from blood, serum, plasma, stool, and other samples. Such samples, therefore, are 
useful as samples to be probed or tested for the presence of breast cancer proteins. 
25 Antibodies can be used to detect a breast cancer protein by previously described 
immunoassay techniques including ELISA, immunoblotting (western blotting), 
immunoprecipitation, BIACORE technology and the like. Conversely, the presence of 
antibodies may indicate an immune response against an endogenous breast cancer protein. 



30 nucleic acid probes to tissue arrays is done. For example, arrays of tissue samples, including 



In another preferred method, antibodies to the breast cancer protein find use in 



In a preferred embodiment the label is detected in a fluorometer which has the 



In another preferred embodiment, antibodies find use in diagnosing breast 



In a preferred embodiment, in situ hybridization of labeled breast cancer 
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breast cancer tissue and/or normal tissue, are made. /// situ hybridization {see, e.g., Ausubel, 
supra) is then performed. When comparing the fingerprints between an individual and a 
standard, the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the 
findings. It is further understood that the genes which indicate the diagnosis may differ from 
those which indicate the prognosis and molecular profiling of the condition of the cells may 
lead to distinctions between responsive or refractory conditions or may be predictive of 
outcomes. 

In a preferred embodiment, the breast cancer proteins, antibodies, nucleic 
acids, modified proteins and cells containing breast cancer sequences are used in prognosis 
assays. As above, gene expression profiles can be generated that correlate to breast cancer, 
in terms of long term prognosis. Again, this may be done on either a protein or gene level, 
with the use of genes being preferred. As above, breast cancer probes may be attached to 
biochips for the detection and quantification of breast cancer sequences in a tissue or patient. 
The assays proceed as outlined above for diagnosis. PCR method may provide more 
sensitive and accurate quantification. 

Assays for therapeutic compounds 

In a preferred embodiment members of the proteins, nucleic acids, and 
antibodies as described herein are used in drug screening assays. The breast cancer proteins, 
antibodies, nucleic acids, modified proteins and cells containing breast cancer sequences are 
used in drug screening assays or by evaluating the effect of drug candidates on a "gene 
expression profile" or expression profile of polypeptides. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes after treatment with a candidate 
agent (e.g., Zlokarnik, et aL, Science 279:84-8 (1998); Heid, Genome Res 6:986-94, 1996). 

In a preferred embodiment, the breast cancer proteins, antibodies, nucleic 
acids, modified proteins and cells containing the native or modified breast cancer proteins are 
used in screening assays. That is, the present invention provides novel methods for screening 
for compositions which modulate the breast cancer phenotype or an identified physiological 
function of a breast cancer protein. As above, this can be done on an individual gene level or 
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by evaluating the effect of drug candidates on a "gene expression profile". In a preferred 
embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokamik, supra. 
5 Having identified the differentially expressed genes herein, a variety of assays 

may be executed. In a preferred embodiment, assays may be run on an individual gene or 
protein level. That is, having identified a particular gene as up regulated in breast cancer, test 
compounds can be screened for the ability to modulate gene expression or for binding to the 
breast cancer protein. "Modulation" thus includes both an increase and a decrease in gene 

10 expression. The preferred amount of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing breast cancer, with changes of at least 
10%, preferably 50%, more preferably 100-300%, and in some embodiments 300-1000% or 
greater. Thus, if a gene exhibits a 4-fold increase in breast cancer tissue compared to normal 
tissue, a decrease of about four-fold is often desired; similarly, a 10-fold decrease in breast 

1 5 cancer tissue compared to normal tissue often provides a target value of a 10-fold increase in 
expression to be induced by the test compound. 

The amount of gene expression may be monitored using nucleic acid probes 
and the quantification of gene expression levels, or, alternatively, the gene product itself can 
be monitored, e.g., through the use of antibodies to the breast cancer protein and standard 

20 immunoassays. Proteomics and separation techniques may also allow quantification of 
expression. 

In a preferred embodiment, gene expression or protein monitoring of a number 
of entities, i.e., an expression profile, is monitored simultaneously. Such profiles will 
typically involve a plurality of those entities described herein.. 
25 In this embodiment, the breast cancer nucleic acid probes are attached to 

biochips as outlined herein for the detection and quantification of breast cancer sequences in 
a particular cell. Alternatively, PCR may be used. Thus, a series, e.g., of microtiter plate, 
may be used with dispensed primers in desired wells. A PCR reaction can then be performed 
and analyzed for each well. 
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Expression monitoring can be performed to identify compounds that modify 
the expression of one or more breast cancer-associated sequences, e.g., a polynucleotide 
sequence set out inTable 17. Generally, in a preferred embodiment, a test modulator is added 
to the cells prior to analysis. Moreover, screens are also provided to identify agents that 
5 modulate breast cancer, modulate breast cancer proteins, bind to a breast cancer protein, or 
interfere with the binding of a breast cancer protein and an antibody or other binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes any molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 

10 indirectly alter the breast cancer phenotype or the expression of a breast cancer sequence, 
e.g., a nucleic acid or protein sequence. In preferred embodiments, modulators alter 
expression profiles, or expression profile nucleic acids or proteins provided herein. In one 
embodiment, the modulator suppresses a breast cancer phenotype, e.g. to a normal tissue 
fingerprint. In another embodiment, a modulator induced a breast cancer phenotype. 

15 Generally, a plurality of assay mixtures are run in parallel with different agent concentrations 
to obtain a differential response to the various concentrations. Typically, one of these 
concentrations serves as a negative control, i.e., at zero concentration or below the level of 
detection. 

Drug candidates encompass numerous chemical classes, though typically they 
20 are organic molecules, preferably small organic compounds having a molecular weight of 
more than 100 and less than about 2,500 daltons. Preferred small molecules are less than 
2000, or less than 1500 or less than 1000 or less than 500 D. Candidate agents comprise 
functional groups necessary for structural interaction with proteins, particularly hydrogen 
bonding, and typically include at least an amine, carbonyl, hydroxyl or carboxyl group, 
25 preferably at least two of the functional chemical groups. The candidate agents often 
comprise cyclical carbon or heterocyclic structures and/or aromatic or polyaromatic 
structures substituted with one or more of the above functional groups. Candidate agents are 
also found among biomolecules including peptides, saccharides, fatty acids, steroids, purines, 
pyrimidines, derivatives, structural analogs or combinations thereof. Particularly preferred 
30 are peptides. 
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In one aspect, a modulator will neutralize the effect of a breast cancer protein. 



By "neutralize" is meant that activity of a protein is inhibited or blocked and the consequent 
effect on the cell. 



5 screened for an ability to bind to a breast cancer polypeptide or to modulate activity. 

Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating- the property 
and activity of those variant compounds. Often, high throughput screening (HTS) methods 
10 are employed for such an analysis. 



providing a library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 
15 display a desired characteristic activity. The compounds thus identified can serve as 

conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 



compounds generated by either chemical synthesis or biological synthesis by combining a 
number of chemical "building blocks" such as reagents. For example, a linear combinatorial 

20 chemical library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of 
chemical building blocks called amino acids in every possible way for a given compound 
length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical 
compounds can be synthesized through such combinatorial mixing of chemical building 
blocks (Gallop etal, J. Med. Chenu 37(9):1233-1251 (1994)). 

25 Preparation and screening of combinatorial "chemical libraries is well known to 

those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent No. 5,010,175, Furka, Pept Prot. Res. 37:487-493 
(1991), Houghton et ai, Nature, 354:84-88 (1991)), peptoids (PCT Publication No WO 
91/19735), encoded peptides (PCT Publication WO 93/20242), random bio-oligomers (PCT 

30 Publication WO 92/00091), benzodiazepines (U.S. Pat. No. 5,288,5 14), diversomers such as 



In certain embodiments, combinatorial libraries of potential modulators will be 



In one preferred embodiment, high throughput screening methods involve 



A combinatorial chemical library is a collection of diverse chemical 
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hydantoins, benzodiazepines and dipeptides (Hobbs et al, Proc. Nat. Acad. Sci. USA 
90:6909-6913 (1993)), vinylogous polypeptides (Hagihara et al, J. Amer. Chem. Soc. 
114:6568 (1992)), nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding 
(Hirschmann et al, J. Amer. Chem. Soc. 1 14:9217-9218 (1992)), analogous organic syntheses 
5 of small compound libraries (Chen et al, J. Amer. Chem. Soc. 116:2661 (1994)), 

oligocarbamates (Cho, et al, Science 261 : 1303 (1993)), and/or peptidyl phosphonates 
(Campbell et al, J. Org. Chem. 59:658 (1994)). See, generally, Gordon et al, J. Med. Chem. 
37:1385 (1994), nucleic acid libraries (see, e.g., Strategene, Corp.), peptide nucleic acid 
libraries (see, e.g., U.S. Patent 5,539,083), antibody libraries (see, e.g., Vaughn^ al, Nature 

10 Biotechnology 14(3):309-314 (1996), and PCT/US96/10287), carbohydrate libraries (see, 
e.g., Liang etal, Science 274:1520-1522 (1996), and U.S. Patent No. 5,593,853), and small 
organic molecule libraries (see, e.g., benzodiazepines, Baum, C&EN, Jan 18, page 33 (1993); 
isoprenoids, U.S. Patent No. 5,569,588; thiazolidinones and metathiazanones, U.S. Patent No. , 
5,549,974; pyrrolidines, U.S. Patent Nos. 5,525,735 and 5,519,134; morpholino compounds, 

15 U.S. Patent No. 5,506,337; benzodiazepines, U.S. Patent No. 5,288,514; and the like). 



available (see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 
Rainin, Woburn, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA). 



solution phase chemistries. These systems include automated workstations like the 
automated synthesis apparatus developed by Takeda Chemical Industries, LTD, (Osaka, 
Japan) and many robotic systems utilizing robotic arms (Zymate II, Zymark Corporation, 
Hopkinton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif), which mimic the manual 
25 synthetic operations performed by a chemist. Any of the above devices are suitable for use 
with the present invention. The nature and implementation of modifications to these devices 
(if any) so that they can operate as discussed herein will be apparent to persons skilled in the 
relevant art. In addition, numerous combinatorial libraries are themselves commercially 
available (see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, Tripos, Inc., St. Louis, 



Devices for the preparation of combinatorial libraries are commercially 



20 



A number of well known robotic systems have also been developed for 
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MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, 
Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect enhancement or inhibition of breast cancer gene transcription, 
5 inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 
polypeptide activity. 

High throughput assays for the presence, absence, quantification, or other 
properties of particular nucleic acids or protein products are well known to those of skill in 
the art. Similarly, binding assays and reporter gene assays are similarly well known. Thus, 

10 e.g., U.S. Patent No. 5,559,410 discloses high throughput screening methods for proteins, 
US. Patent No. 5,585,639 discloses high throughput screening methods for nucleic acid 
binding (i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 
throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available 

15 (see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 
typically automate entire procedures, including all sample and reagent pipetting, liquid 
dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as well 

20 as a high degree of flexibility and customization. The manufacturers of such systems provide 
detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring 

25 proteins or fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing 
proteins, or random or directed digests of proteinaceous cellular extracts, may be used. In 
this way libraries of proteins may be made for screening in the methods of the invention. 
Particularly preferred in this embodiment are libraries of bacterial, fungal, viral, and 
mammalian proteins, with the latter being preferred, and human proteins being especially 
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preferred. Particularly useful test compound will be directed to the class of proteins to which 
the target belongs, e.g., substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to about 
30 amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 
5 to about 15 being particularly preferred. The peptides may be digests of naturally occurring 
proteins as is outlined above, random peptides, or "biased" random peptides. By 
"randomized" or grammatical equivalents herein is meant that each nucleic acid and peptide 
consists of essentially random nucleotides and amino acids, respectively. Since generally 
these random peptides (or nucleic acids, discussed below) are chemically synthesized, they 

10 may incorporate any nucleotide or amino acid at any position. The synthetic process can be 
designed to generate randomized proteins or nucleic acids, to allow the formation of all or 
most of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence 

15 preferences or constants at any position. In a preferred embodiment, the library is biased. 
That is, some positions within the sequence are either held constant, or are selected from a 
limited number of possibilities. For example, in a preferred embodiment, the nucleotides or 
amino acid residues are randomized within a defined class, e.g., of hydrophobic amino acids, 
hydrophilic residues, sterically biased (either small or large) residues, towards the creation of 

20 nucleic acid binding domains, the creation of cysteines, for cross-linking, prolines for SH-3 
domains, serines, threonines, tyrosines or histidines for phosphorylation sites, etc., or to 
purines, etc. 

Modulators of breast cancer can also be nucleic acids, as defined above. 

As described above generally for proteins, nucleic acid modulating agents may 
25 be naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
For example, digests of procaryotic or eucaryotic genomes may be used as is outlined above 
for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical 
moieties, a wide variety of which are available in the literature. 
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After the candidate agent has been added and the cells allowed to incubate for 
some period of time, the sample containing a target sequence to be analyzed is added to the 
biochip. If required, the target sequence is prepared using known techniques. For example, 
the sample may be treated to lyse the cells, using known lysis buffers, electroporation, etc., 
5 with purification and/or amplification such as PCR performed as appropriate. For example, 
an in vitro transcription with labels covalently attached to the nucleotides is performed. 
Generally, the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a 
fluorescent, a chemilumine^cent, a chemical, or a radioactive signal, to provide a means of 

10 detecting the target sequence's specific binding to a probe. The label also can be an enzyme, 
such as, alkaline phosphatase or horseradish peroxidase, which when provided with an 
appropriate substrate produces a product that can be detected. Alternatively, the label can be 
a labeled compound or small molecule, such as an enzyme inhibitor, that binds but is not 
catalyzed or altered by the enzyme. The label also can be a moiety or compound, such as, an 

15 epitope tag or biotin which specifically binds to streptavidin. For the example of biotin. the 
streptavidin is labeled as described above, thereby, providing a detectable signal for the 
bound target sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct 
hybridization assays or can comprise "sandwich assays", which include the use of multiple 

20 probes, as is generally outlined in U.S. Patent Nos. 5,681,702, 5,597,909, 5,545,730, 

5.594.117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 

5.594.118, 5,359,100, 5,124,246 and 5,681,697, all of which are hereby incorporated by 
reference. In this embodiment, in general, the target nucleic acid is prepared as outlined 
above, and then added to the biochip comprising a plurality of nucleic acid probes, under 

25 conditions that allow the formation of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, 
including high, moderate and low stringency conditions as outlined above. The assays are 
generally run under stringency conditions which allows formation of the label probe 
hybridization complex only in the presence of target Stringency can be controlled by 
30 altering a step parameter that is a thermodynamic variable, including, but not limited to, 
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temperature, formamide concentration, salt concentration, chaotropic salt concentration pH, 
organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is 
generally outlined in U.S. Patent No. 5,681,697. Thus it may be desirable to perform certain 
5 steps at higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. 
Components of the reaction may be added simultaneously, or sequentially, in different orders, 
with preferred embodiments outlined below. In addition, the reaction may include a variety 
of other reagents. These include salts, buffers, neutral proteins, e.g. albumin, detergents, etc. 
10 which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 

The assay data are analyzed to determine the expression levels, and changes in 
15 expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the breast cancer phenotype. 
In one embodiment, screening is performed to identify modulators that can induce or 
suppress a particular expression profile, thus preferably generating the associated phenotype. 
In another embodiment, e.g., for diagnostic applications, having identified differentially 
20 expressed genes important in a particular state, screens can be performed to identify 

modulators that alter expression of individual genes. In an another embodiment, screening is 
performed to identify modulators that alter a biological function of the expression product of 
a differentially expressed gene. Again, having identified the importance of a gene in a 
particular state, screens are performed to identify agents that bind and/or modulate the 
25 biological activity of the gene product. 

In addition screens can be done for genes that are induced in response to a 
candidate agent. After identifying a modulator based upon its ability to suppress a breast 
cancer expression pattern leading to a normal expression pattern, or to modulate a single 
breast cancer gene expression profile so as to mimic the expression of the gene from normal 
30 tissue, a screen as described above can be performed to identify genes that are specifically 
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modulated in response to the agent. Comparing expression profiles between normal tissue 
and agent treated breast cancer tissue reveals genes that are not expressed in normal tissue or 
breast cancer tissue, but are expressed in agent treated tissue. These agent-specific sequences 
can be identified and used by methods described herein for breast cancer genes or proteins. 
5 In particular these sequences and the proteins they encode find use in marking or identifying 
agent treated cells. In addition, antibodies can be raised against the agent induced proteins 
and used to target novel therapeutics to the treated breast cancer tissue sample. 

Thus, in one embodiment, a test compound is administered to a population of 
breast cancer cells, that have an associated breast cancer expression profile. By 

10 "administration" or "contacting" herein is meant that the candidate agent is added to the cells 
in such a manner as to allow the agent to act upon the cell, whether by uptake and 
intracellular action, or by action at the cell surface. In some embodiments, nucleic acid 
encoding a proteinaceous candidate agent (i.e., a peptide) may be put into a viral construct 
such as an adenoviral or retroviral construct, and added to the cell, such that expression of 

1 5 the peptide agent is accomplished, e.g., PCT US97/01 019. Regulable gene therapy systems 
can also be used. 

Once the test compound has been administered to the cells, the cells can be 
washed if desired and are allowed to incubate under preferably physiological conditions for 
some period of time. The cells are then harvested and a new gene expression profile is 
20 generated, as outlined herein. 

Thus, e.g., breast cancer tissue may be screened for agents that modulate, e.g., 
induce or suppress the breast cancer phenotype. A change in at least one gene, preferably 
many, of the expression profile indicates that the agent has an effect on breast cancer activity. 
By defining such a signature for the breast cancer phenotype, screens for new drugs that alter 
25 the phenotype can be devised. With this approach, the drug target need not be known and 
need not be represented in the original expression screening platform, nor does the level of 
transcript for the target protein need to change. 

In a preferred embodiment, as outlined above, screens may be done on 
individual genes and gene products (proteins). That is, having identified a particular 
30 differentially expressed gene as important in a particular state, screening of modulators of 
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either the expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "breast cancer proteins" or 
a "breast cancer modulatory protein". The breast cancer modulatory protein may be a 
fragment, or alternatively, be the full length protein to the fragment encoded by the nucleic 
5 acids of the Tables. Preferably, the breast cancer modulatory protein is a fragment. In a 
preferred embodiment, the breast cancer amino acid sequence which is used to determine 
sequence identity or similarity is encoded by a nucleic acid of Table 25. In another 
embodiment, the sequences are naturally occurring allelic variants of a protein encoded by a 
nucleic acid of Table 25. In another embodiment, the sequences are sequence variants as 

10 further described herein. 

Preferably, the breast cancer modulatory protein is a fragment of 
approximately 14 to 24 amino acids long. More preferably the fragment is a soluble 
fragment. Preferably, the fragment includes anon-transmembrane region. In a preferred 
embodiment, the fragment has an N-terminal Cys to aid in solubility. In one embodiment, the 

15 C-terminus of the fragment is kept as a free acid and the N-terminus is a free amine to aid in 
coupling, i.e., to cysteine. 

In one embodiment the breast cancer proteins are conjugated to an 
immunogenic agent as discussed herein. In one embodiment the breast cancer protein is 
conjugated to BSA. 

20 Measurements of breast cancer polypeptide activity, or of breast cancer or the 

breast cancer phenotype can be performed using a variety of assays. For example, the effects 
of the test compounds upon the function of the breast cancer polypeptides can be measured 
by examining parameters described above. A suitable physiological change that affects 
activity can be used to assess the influence of a test compound on the polypeptides of this 

25 invention. When the functional consequences are determined using intact cells or animals, 
one can also measure a variety of effects such as, in the case of breast cancer associated with 
tumors, tumor growth, tumor metastasis, neovascularization, hormone release, transcriptional 
changes to both known and uncharacterized genetic markers (e.g., northern blots), changes in 
cell metabolism such as cell growth or pH changes, and changes in intracellular second 
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messengers such as cGMP. In the assays of the invention, mammalian breast cancer 
polypeptide is typically used, e.g., mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in 
vitro. For example, a breast cancer polypeptide is first contacted with a potential modulator 

5 and incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, 
the breast cancer polypeptide levels are determined in vitro by measuring the level of protein 
or mRNA. The level of protein is measured using immunoassays such as western blotting, 
ELISA and the like with an antibody that selectively binds to the breast cancer polypeptide or 
a fragment thereof. For measurement of mRNA, amplification, e.g., using PCR, LCR, or 

10 hybridization assays, e.g., northern hybridization, RNAse protection, dot blotting, are 
preferred. The level of protein or mRNA is detected using directly or indirectly labeled 
detection agents, e.g., fluorescently or radioactively labeled nucleic acids, radioactively or 
enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using the breast cancer 

15 protein promoter operably linked to a reporter gene such as luciferase, green fluorescent 
protein, CAT, or p-gal. The reporter construct is typically transfected into a cell. After 
treatment with a potential modulator, the amount of reporter gene transcription, translation, or 
activity is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on 

20 individual genes and gene products (proteins). That is, having identified a particular 

differentially expressed gene as important in a particular state, screening of modulators of the 
expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "breast cancer proteins." 
The breast cancer protein may be a fragment, or alternatively, be the full length protein to a 

25 fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes 
is performed. Typically, the expression of only one or a few genes are evaluated. In another 
embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
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differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or 
isolated gene product is used; that is, the gene products of one or more differentially 
5 expressed nucleic acids are made. For example, antibodies are generated to the protein gene 
products, and standard immunoassays are run to determine the amount of protein present. 
Alternatively, cells comprising the breast cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a breast 
cancer protein and a candidate compound, and determining the binding of the compound to 

1 0 the breast cancer protein. Preferred embodiments utilize the human breast cancer protein, 
although other mammalian proteins may also be used, e.g. for the development of animal 
models of human disease. In some embodiments, as outlined herein, variant or derivative 
breast cancer proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the breast cancer 

15 protein or the candidate agent is non-difflxsably bound to an insoluble support having isolated 
sample receiving areas (e.g. a microliter plate, an array, etc.). The insoluble supports may be 
made of any composition to which the compositions can be bound, is readily separated from 
soluble material, and is otherwise compatible with the overall method of screening. The 
surface of such supports may be solid or porous and of any convenient shape. Examples of 

20 suitable insoluble supports include microtiter plates, arrays, membranes and beads. These are 
typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 
teflon™, etc. Microtiter plates and arrays are especially convenient because a large number 
of assays can be carried out simultaneously, using small amounts of reagents and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 

25 with the reagents and overall methods of the invention, maintains the activity of the 

composition and is nondiffusable. Preferred methods of binding include the use of antibodies 
(which do not sterically block either the ligand binding site or activation sequence when the 
protein is bound to the support), direct binding to "sticky" or ionic supports, chemical 
crosslinking, the synthesis of the protein or agent on the surface, etc. Following binding of 

30 the protein or agent, excess unbound material is removed by washing. The sample receiving 
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areas may then be blocked through incubation with bovine serum albumin (BSA), casein or 
other innocuous protein or other moiety. 

In a preferred embodiment, the breast cancer protein is bound to the support, 
and a test compound is added to the assay. Alternatively, the candidate agent is bound to the 

5 support and the breast cancer protein is added. Novel binding agents include specific 
antibodies, non-natural binding agents identified in screens of chemical libraries, peptide 
analogs, etc. Of particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays for 

10 protein binding, functional assays (phosphorylation assays, etc.) and the like. 

The determination of the binding of the test modulating compound to the 
breast cancer protein may be done in a number of ways. In a preferred embodiment, the 
compound is labeled, and binding determined directly, e.g., by attaching all or a portion of 
the breast cancer protein to a solid support, adding a labeled candidate agent (e.g., a 

1 5 fluorescent label), washing off excess reagent, and determining whether the label is present 
on the solid support. Various blocking and washing steps may be utilized as appropriate. 

In some embodiments, only one of the components is labeled, e.g., the 
proteins (or proteinaceous candidate compounds) can be labeled. Alternatively, more than 
one component can be labeled with different labels, e.g., 125 I for the proteins and a fluorophor 

20 for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

In one embodiment, the binding of the test compound is determined by 
competitive binding assay. The competitor is a binding moiety known to bind to the target 
molecule (i.e., a breast cancer protein), such as an antibody, peptide, binding partner, ligand, 

25 etc. Under certain circumstances, there may be competitive binding between the compound 
and the binding moiety, with the binding moiety displacing the compound. In one 
embodiment, the test compound is labeled. Either the compound, or the competitor, or both, 
is added first to the protein for a time sufficient to allow binding, if present. Incubations may 
be performed at a temperature which facilitates optimal activity, typically between 4 and 

30 40°C. Incubation periods are typically optimized, e.g., to facilitate rapid high throughput 
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screening. Typically between 0.1 and 1 hour will be sufficient. Excess reagent is generally 
removed or washed away. The second component is then added, and the presence or absence 
of the labeled component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 
5 compound. Displacement of the competitor is an indication that the test compound is binding 
to the breast cancer protein and thus is capable of binding to, and potentially modulating, the 
activity of the breast cancer protein. In this embodiment, either component can be labeled. 
Thus, e.g., if the competitor is labeled, the presence of label in the wash solution indicates 
displacement by the agent. Alternatively, if the test compound is labeled, the presence of the 

10 label on the support indicates displacement. 

In an alternative embodiment, the test compound is added first, with 
incubation and washing, followed by the competitor. The absence of binding by the 
competitor may indicate that the test compound is bound to the breast cancer protein with a 
higher affinity. Thus, if the test compound is labeled, the presence of the label on the 

1 5 support, coupled with a lack of competitor binding, may indicate that the test compound is 
capable of binding to the breast cancer protein. 

In a preferred embodiment, the methods comprise differential screening to 
identity agents that are capable of modulating the activity of the breast cancer proteins. In 
this embodiment, the methods comprise combining a breast cancer protein and a competitor 

20 in a first sample. A second sample comprises a test compound, a breast cancer protein, and a 
competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the breast cancer protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 

25 agent is capable of binding to the breast cancer protein. 

Alternatively, differential screening is used to identify drug candidates that 
bind to the native breast cancer protein, but cannot bind to modified breast cancer proteins. 
The structure of the breast cancer protein may be modeled, and used in rational drug design 
to synthesize agents that interact with that site. Drug candidates that affect the activity of a 
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breast cancer protein are also identified by screening drugs for the ability to either enhance or 
reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably 
control and test samples are performed in at least triplicate to obtain statistically significant 

5 results. Incubation of all samples is for a time sufficient for the binding of the agent to the 
protein. Following incubation, samples are washed free of non-specifically bound material 
and the amount of bound, generally labeled agent determined. For example, where a 
radiolabel is employed, the samples may be counted in a scintillation counter to determine the 
amount of bound compound. 

10 A variety of other reagents may be included in the screening assays. These 

include reagents like salts, neutral proteins, e.g. albumin, detergents, etc. which may be used 
to facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 

1 5 of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
compound capable of modulating the activity of a breast cancer protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising breast cancer 
proteins. Preferred cell types include almost any cell. The cells contain a recombinant 

20 nucleic acid that encodes a breast cancer protein. In a preferred embodiment, a library of 
candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous 
or subsequent exposure of physiological signals, e.g. hormones, antibodies, peptides, 
antigens, cytokines, growth factors, action potentials, pharmacological agents including 

25 chemotherapeutics, radiation, carcinogenics, or other cells (i.e. cell-cell contacts). In another 
example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate breast cancer agents are identified. 
Compounds with pharmacological activity are able to enhance or interfere with the activity of 
the breast cancer protein. Once identified, similar structures are evaluated to identify critical 

30 structural feature of the compound. 
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In one embodiment, a method of inhibiting breast cancer cell division is 
provided. The method comprises administration of a breast cancer inhibitor. In another 
embodiment, a method of inhibiting breast cancer is provided. The method comprises 
administration of a breast cancer inhibitor. In a further embodiment, methods of treating cells 
5 or individuals with breast cancer are provided. The method comprises administration of a 
breast cancer inhibitor. 

In one embodiment, a breast cancer inhibitor is an antibody as discussed 
above. In another embodiment, the breast cancer inhibitor is an antisense molecule. 

A variety of cell growth, proliferation, and metastasis assays are known to 
10 those of skill in the art, as described below. 

Soft agar growth or colony formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells are 
transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 
1 5 such as semi-solid or soft agar. The transformed cells, when transfected with tumor 

suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 
grow. Soft agar growth or colony formation in suspension assays can be used to identify 
modulators of breast cancer sequences, which when expressed in host cells, inhibit abnormal 
cellular proliferation and transformation. A therapeutic compound would reduce or eliminate 
20 the host cells' ability to grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft. 

Techniques for soft agar growth or colony formation in suspension assays are 
described in Freshney, Culture of Animal Cells a Manual of Basic Technique (3 rd ed., 1994), 
herein incorporated by reference. See also, the methods section of Garkavtsev et al. (1996), 
25 supra, herein incorporated by reference. 

Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until 
they touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
30 continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 

78 



WO 02/059377 PCT/US02/02242 



higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 
pattern of normal surrounding cells. Alternatively, labeling index with ( 3 H)-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney (1994), 

5 supra. The transformed cells, when transfected with tumor suppressor genes, regenerate a 
normal phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with ( 3 H)-thymidine at saturation density is a 
preferred method of measuring density limitation of growth. Transformed host cells are 
transfected with a breast cancer-associated sequence and are grown for 24 hours at saturation 

10 density in non-limiting medium conditions. The percentage of cells labeling with ( 3 H)- 
thymidine is determined autoradiographically. See, Freshney (1994), supra. 

Growth factor or serum dependence 

Transformed cells have a lower serum dependence than their normal 
1 5 counterparts (see, e.g.. Temin, J. Natl. Cancer Insti. 37:167-175 (1966); Eagle et al, J. Exp. 
Med. 131:836-879 (1970)); Freshney, supra. This is in part due to release of various growth 
factors by the transformed cells. Growth factor or serum dependence of transformed host 
cells can be compared with that of control 

20 Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter "tumor 
specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells (see, e.g., 
Gullino, Angiogenesis, tumor vascularization, and potential interference with tumor growth. 

25 in Biological Responses in Cancer,??. 178-184 (Mihich(ed.) 1985)). Similarly, Tumor 
angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 
counterparts. See, e.g., Folkman, Angiogenesis and Cancer, Sent Cancer Biol. (1992)). 

Various techniques which measure the release of these factors are described in 
Freshney (1994), supra. Also, see, Unkless et al. , J. Biol. Chem. 249:4295-4305 (1974); 

30 Strickland & Beers, Biol. Chem. 251:5694-5702 (1976); Whur et al, Br. J. Cancer 42:305- 
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312 (1980); Gullino, Angiogenesis, tumor vascularization, and potential interference with 
tumor growth, in Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 1985); 
Freshney Anticancer Res. 5:111-130 (1985). 

5 Invasiveness intoMatrigel 

The degree of invasiveness into Matrigel or some other extracellular matrix 
constituent can be used as an assay to identify compounds that modulate breast cancer- 
associated sequences. Tumor cells exhibit a good correlation between malignancy and 
invasiveness of cells into Matrigel or some other extracellular matrix constituent. In this 
1 0 assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 
gene in these host cells would decrease invasiveness of the host cells. 

Techniques described in FresTmey (1994), supra, can be used. Briefly, the 
level of invasion of host cells can be measured by using filters coated with Matrigel or some 
other extracellular matrix constituent Penetration into the gel, or through to the distal side of 
15 the filter, is rated as invasiveness, and rated histologically by number of cells and distance 

moved, or by prelabeling the cells with 125 I and counting the radioactivity on the distal side of 
the filter or bottom of the dish. See, e.g., Freshney (1984), supra. 

Tumor growth in vivo 

20 Effects of breast cancer-associated sequences on cell growth can be tested in 

transgenic or immune-suppressed mice. Knock-out transgenic mice can be made, in which 
the breast cancer gene is disrupted or in which a breast cancer gene is inserted. Knock-out 
transgenic mice can be made by insertion of a marker gene or other heterologous gene into 
the endogenous breast cancer gene site in the mouse genome via homologous recombination. 

25 Such mice can also be made by substituting the endogenous breast cancer gene with a 

mutated version of the breast cancer gene, or by mutating the endogenous breast cancer gene, 
e.g., by exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 

30 is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 
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that possess germ cells partially derived from the mutant cell line. Therefore, by breeding the 
chimeric mice it is possible to obtain a new line of mice containing the introduced genetic 
lesion {see, e.g., Capecchi et al t Science 244:1288 (1989)). Chimeric targeted mice can be 
derived according to Hogan et al. 9 Manipulating the Mouse Embryo: A Laboratory Manual, 
5 Cold Spring Harbor Laboratory (1988) and Teratocarcinomas and Embryonic Stem Cells: A . 
Practical Approach, Robertson, ed., IRL Press, Washington, D.C., (1987). 

Alternatively, various immune-suppressed or immune-deficient host animals 
can be used. For example, genetically athymic "nude" mouse {see, e.g., Giovanella et al, J. 
Natl. Cancer Inst 52:921 (1974)), a SCDD mouse, a thymectomized mouse, or an irradiated 

10 mouse {see, e.g., Bradley et aL 9 Br. J. Cancer 38:263 (1978); Selby et aL t Br. J. Cancer 
41:52 (1980)) can be used as a host. Transplantable tumor cells (typically about 10 6 cells) 
injected into isogenic hosts will produce invasive tumors in a high proportions of cases, while 
normal cells of similar origin will not. In hosts which developed invasive tumors, cells 
expressing a breast cancer-associated sequences are injected subcutaneously. After a suitable 

1 5 length of time, preferably 4-8 weeks, tumor growth is measured (e.g., by volume or by its two 
largest dimensions) and compared to the control. Tumors that have statistically significant 
reduction (using, e.g., Student's T test) are said to have inhibited growth. 

Polynucleotide modulators of breast cancer 

20 Antisense Polynucleotides 

In certain embodiments, the activity of a breast cancer-associated protein is 
down-regulated, or entirely inhibited, by the use of antisense polynucleotide, i.e., a nucleic 
acid complementary to, and which can preferably hybridize specifically to, a coding mRNA 
nucleic acid sequence, e.g., a breast cancer protein mRNA, or a subsequence thereof. 

25 Binding of the antisense polynucleotide to the mRNA reduces the translation and/or stability 
of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise 
naturally-occurring nucleotides, or synthetic species formed from naturally-occurring 
subunits or their close homologs. Antisense polynucleotides may also have altered sugar 
30 moieties or inter-sugar linkages. Exemplary among these are the phosphorothioate and other 
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sulfur containing species which are known for use in the art Analogs are comprehended by 
this invention so long as they function effectively to hybridize with the breast cancer protein 
mRNA. See, e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

Such antisense polynucleotides can readily be synthesized using recombinant 
5 means, or can be synthesized in vitro. Equipment for such synthesis is sold by several 
vendors, including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 

Antisense molecules as used herein include antisense or sense 
oligonucleotides. Sense oligonucleotides can, e.g., be employed to block transcription by 
10 binding to the anti-sense strand. The antisense and sense oligonucleotide comprise a single- 
stranded nucleic acid sequence (either RNA or DNA) capable of binding to target mRNA 
(sense) or DNA (antisense) sequences for breast cancer molecules. A preferred antisense 
molecule is for a breast cancer sequences in Tables 1-25, or for a ligand or activator thereof. 
Antisense or sense oligonucleotides, according to the present invention, comprise a fragment 
15 generally at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The 
ability to derive an antisense or a sense oligonucleotide, based upon a cDNA sequence 
encoding a given protein is described in, e.g., Stein & Cohen {Cancer Res. 48:2659 (1988 and 
van der Krol et al (BioTechniques 6:958 (1988)). 

20 Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and 
inhibit transcription of breast cancer-associated nucleotide sequences. A ribozyme is an 
RNA molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes 
have been described, including group I ribozymes, hammerhead ribozymes, hairpin 

• 25 ribozymes, RNase P, and axhead ribozymes (see, e.g., Castanotto et al, Adv. in 

Pharmacology 25: 289-3 17 (1994) for a general review of the properties of different 
ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel et al, 
Nucl Acids Res. 18:299-304 (1990); European Patent Publication No. 0 360 257; U.S. Patent 
30 No. 5,254,678. Methods of preparing are well known to those of skill in the art [see, e.g. t 
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WO 94/26877; Ojwang et aL, Proc. Natl Acad. Sci. USA 90:6340-6344 (1993); Yamada et 
al, Human Gene Tlierapy 1:39-45 (1994); Leavitt et aL, Proc. Natl. Acad. Sci. USA 92:699- 
703 (1995); Leavitt et aL, Human Gene Therapy 5:1 151-120 (1994); and Yamada et al t 
Virology 205: 121-126(1994)). 
5 Polynucleotide modulators of breast cancer may be introduced into a cell 

containing the target nucleotide sequence by formation of a conjugate with a ligand binding 
molecule, as described in WO 91/04753. Suitable ligand binding molecules include, but are 
not limited to, cell surface receptors, growth factors, other cytokines, or other ligands that 
bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does 

10 not substantially interfere with the ability of the ligand binding molecule to bind to its 

corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of breast 
cancer may be introduced into a cell containing die target nucleic acid sequence, e.g., by 
formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 

15 understood that the use of antisense molecules or knock out and knock in models may also be 
used in screening assays as discussed above, in addition to methods of treatment. 

Thus, in one embodiment, methods of modulating breast cancer in cells or 
organisms are provided. In one embodiment, the methods comprise administering to a cell an 
anti-breast cancer antibody that reduces or eliminates the biological activity of an endogenous 

20 breast cancer protein. Alternatively, the methods comprise administering to a cell or 
organism a recombinant nucleic acid encoding a breast cancer protein. This may be 
accomplished in any number of ways. In a preferred embodiment, e.g. when the breast 
cancer sequence is down-regulated in breast cancer, such state may be reversed by increasing 
the amount of breast cancer gene product in the cell. This can be accomplished, e.g., by 

25 overexpressing the endogenous breast cancer gene or administering a gene encoding the 
breast cancer sequence, using known gene-therapy techniques, e.g.. In a preferred 
embodiment, the gene therapy techniques include the incorporation of the exogenous gene 
using enhanced homologous recombination (EHR), e.g. as described in PCT/US93/03868, 
hereby incorporated by reference in its entirety. Alternatively, e.g. when the breast cancer 
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sequence is up-regulated in breast cancer, the activity of the endogenous breast cancer gene is 
decreased, e.g. by the administration of a breast cancer antisense nucleic acid. 

In one embodiment, the breast cancer proteins of the present invention may be 
used to generate polyclonal and monoclonal antibodies to breast cancer proteins. Similarly, 
5 the breast cancer proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify breast cancer 
antibodies useful for production, diagnostic, or therapeutic purposes. In a preferred 
embodiment, the antibodies are generated to epitopes unique to a breast cancer protein; that 
is, the antibodies show little or no cross-reactivity to other proteins. The breast cancer 
10 antibodies may be coupled to standard affinity chromatography columns and used to purify 
breast cancer proteins. The antibodies may also be used as blocking polypeptides, as outlined 
above, since they will specifically bind to the breast cancer protein. 

Methods of identifying variant breast cancer-associated sequences 

1 5 Without being bound by theory, expression of various breast cancer sequences 

is correlated with breast cancer. Accordingly, disorders based on mutant or variant breast 
cancer genes may be determined. In one embodiment, the invention provides methods for 
identifying cells containing variant breast cancer genes, e.g., determining all or part of the 
sequence of at least one endogenous breast cancer genes in a cell. This may be accomplished 

20 using any number of sequencing techniques. In a preferred embodiment, the invention 

provides methods of identifying the breast cancer genotype of an individual, e.g., determining . 
all or part of the sequence of at least one breast cancer gene of the individual. This is 
generally done in at least one tissue of the individual, and may include the evaluation of a 
number of tissues or different samples of the same tissue. The method may include 

25 comparing the sequence of the sequenced breast cancer gene to a known breast cancer gene, 
i.e., a wild-type gene. 

The sequence of all or part of the breast cancer gene can then be compared to 
the sequence of a known breast cancer gene to determine if any differences exist. This can be 
done using any number of known homology programs, such as Bestfit, etc. In a preferred 

30 embodiment, the presence of a difference in the sequence between the breast cancer gene of 
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the patient and the known breast cancer gene correlates with a disease state or a propensity 
for a disease state, as outlined herein. 

In a preferred embodiment, the breast cancer genes are used as probes to 
determine the number of copies of the breast cancer gene in the genome. 
5 In another preferred embodiment, the breast cancer genes are used as probes to 

determine the chromosomal localization of the breast cancer genes. Information such as 
chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like axe identified in the breast 
cancer gene locus. 

10 

Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of a breast cancer protein 
or modulator thereof, is administered to a patient. By "therapeutically effective dose" herein 
' is meant a dose that produces effects for which it is administered. The exact dose will depend 

1 5 on the purpose of the treatment, and will be ascertainable by one skilled in the art using 
known techniques (e.g., Ansel et aL 9 Pharmaceutical Dosage Forms and Drug Delivery; 
Lieberman, Pharmaceutical Dosage Forms (vols. 1-3, 1992), Dekker, ISBN 0824770846, 
082476918X, 0824712692, 0824716981; Lloyd, The Art, Science and Technology of 
Pharmaceutical Compounding (1999); and Pickar, Dosage Calculations (1999)). As is 

20 known in the art, adjustments for breast cancer degradation, systemic versus localized 

delivery, and rate of new protease synthesis, as well as the age, body weight, general health, 
sex, diet, time of administration, drug interaction and the severity of the condition may be 
necessary, and will be ascertainable with routine experimentation by those skilled in the art. 
U.S. Patent Application N. 09/687,576, further discloses the use of compositions and 

25 methods of diagnosis and treatment in breast cancer is hereby expressly incorporated by 
reference. 

A "patient" for the purposes of the present invention includes both humans 
and other animals, particularly mammals. Thus the methods are applicable to both human 
therapy and veterinary applications. In the preferred embodiment the patient is a mammal, 
30 preferably a primate, and in the most preferred embodiment the patient is human. 
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The administration of the breast cancer proteins and modulators thereof of the 



present invention can be done in a variety of ways as discussed above, including, but not 
limited to, orally, subcutaneously, intravenously, intranasally, transdermally, 
intraperitoneally, intramuscularly, intrapulmonary, vaginally, rectally, or intraocularly. In 
5 some instances, e.g., in the treatment of wounds and inflammation, the breast cancer proteins 
and modulators may be directly applied as a solution or spray. 



cancer protein in a form suitable for administration to a patient. In the preferred embodiment, 
the pharmaceutical compositions are in a water soluble form, such as being present as 

1 0 pharmaceutical^ acceptable salts, which is meant to include both acid and base addition 
salts. "Pharmaceutically acceptable acid addition salt" refers to those salts that retain the 
biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 

15 propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 
methanesulfonic acid, ethanesulfonic acid, pi-toluenesulfonic acid, salicylic acid and the like. 
'Tharmaceutically acceptable base addition salts" include those derived from inorganic bases 
such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 

20 manganese, aluminum salts and the like. Particularly preferred are the ammonium, 

potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 

25 tripropylamine, and ethanolamine. 



following: carrier proteins such as serum albumin; buffers; fillers such as microcrystalline 
cellulose, lactose, corn and other starches; binding agents; sweeteners and other flavoring 
agents; coloring agents; and polyethylene glycol. 



The pharmaceutical compositions of the present invention comprise a breast 



The pharmaceutical compositions may also include one or more of the 



WO 02/059377 PCT/US02/02242 



The pharmaceutical compositions can be administered in a variety of unit 
dosage forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that breast cancer protein modulators (e.g., antibodies, 

5 antisense constructs, ribozymes, small organic molecules, etc.) when administered orally, 
should be protected from digestion. This is typically accomplished either by complexing the 
molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 
packaging the molecuie(s) in an appropriately resistant carrier, such as a liposome or a 
protection barrier. Means of protecting agents from digestion are well known in the art. 

10 The compositions for administration will commonly comprise a breast cancer 

protein modulator dissolved in a pharmaceutically acceptable carrier, preferably an aqueous 
carrier. A variety of aqueous carriers can be used, e.g., buffered saline and the like. These 
solutions are sterile and generally free of undesirable matter. These compositions may be 
sterilized by conventional, well known sterilization techniques. The compositions may 

1 5 contain pharmaceutically acceptable auxiliary substances as required to approximate 

physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents 
and the like, e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, 
sodium lactate and the like. The concentration of active agent in these formulations can vary 
widely, and will be selected primarily based on fluid volumes, viscosities, body weight and 

20 the like in accordance with the particular mode of administration selected and the patient's 
needs (e.g., Remington 's Pharmaceutical Science (15th ed., 1980) and Goodman & Gillman, 
The Pharmacologial Basis of Therapeutics (Hardman et a/.,eds., 1996)). 

Thus, a typical pharmaceutical composition for intravenous administration 
would be about 0. 1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per 

25 patient per day may be used, particularly when the drug is administered to a secluded site and 
not into the blood stream, such as into a body cavity or into a lumen of an organ. 
Substantially higher dosages are possible in topical administration. Actual methods for 
preparing parenterally administrable compositions will be known or apparent to those skilled 
in the art, e.g., Remington 's Pharmaceutical Science and Goodman and Gillman, The 

30 Pharmacologial Basis of Therapeutics, supra. 
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The compositions containing modulators of breast cancer proteins can be 
administered for therapeutic or prophylactic treatments. In therapeutic applications, 
compositions are administered to a patient suffering from a disease (e.g., a cancer) in an 
amount sufficient to cure or at least partially arrest the disease and its complications. An 
5 amount adequate to accomplish this is defined as a "therapeutically effective dose." Amounts 
effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple administrations of the compositions may be administered 
depending on the dosage and frequency as required and tolerated by the patient. In any event, 
the composition should provide a sufficient quantity of the agents of this invention to 

10 effectively treat the patient. An amount of modulator that is capable of preventing or slowing 
the development of cancer in a mammal is referred to as a "prophy tactically effective dose." 
The particular dose required for a prophylactic treatment will depend upon the medical 
condition and history of the mammal, the particular cancer being prevented, as well as other 
factors such as age, weight, gender, administration route, efficiency, etc. Such prophylactic 

1 5 treatments may be used, e.g., in a mammal who has previously had cancer to prevent a 

recurrence of the cancer, or in a mammal who is suspected of having a significant likelihood 
of developing cancer. 

It will be appreciated that the present breast cancer protein-modulating 
compounds can be administered alone or in combination with additional breast cancer 

20 modulating compounds or with other therapeutic agent, e.g., other anti-cancer agents or 
treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
comprising nucleic acid sequences set forth in Tables 1-25, such as antisense polynucleotides 
or ribozymes, will be introduced into cells, in vitro or in vivo. The present invention provides 
25 methods, reagents, vectors, and cells useful for expression of breast cancer-associated 
polypeptides and nucleic acids using in vitro (cell-free), ex vivo or in vivo (cell or 
organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 
30 introducing foreign nucleotide sequences into host cells may be used. These include the use 
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of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and any of the other well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell (see, 
e.g., Berger & Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology 
5 volume 152 (Berger), Ausubel et al, eds., Current Protocols (supplemented through 1999), 
and Sambrook et al, Molecular Cloning - A Laboratory Manual (2nd ed., Vol. 1-3, 1989. 



administered as therapeutic agents, and can be formulated as outlined above. Similarly, 
breast cancer genes (including both the full-length sequence, partial sequences, or regulatory 
10 sequences of the breast cancer coding regions) can be administered in a gene therapy 
application. These breast cancer genes can include antisense applications, either as gene 



15 vaccine compositions to stimulate HTL, CTL and antibody responses.. Such vaccine 

compositions can include, e.g., lipidated peptides (see f e.g.,Vitiello, A. et al, J. Clin. Invest. 
95:341 (1995)), peptide compositions encapsulated in poly(DL-lactide-co-glycolide) ("PLG") 
microspheres (see, e.g., Eldridge, et al,Molec. Immunol 28:287-294, (1991); Alonso et al, 
Vaccine 12:299-306 (1994); Jones et al, Vaccine 13:675-681 (1995)), peptide compositions 

20 contained in immune stimulating complexes (ISCOMS) (see, e.g., Takahashi et al., Nature 
344:873-875 (1990); Hu et al, Clin Exp Immunol 113:235-243 (1998)), multiple antigen 
peptide systems (MAPs) (see, e.g., Tarn, Proc. Natl Acad. ScL U.S.A. 85:5409-5413 (1988); 
Tarn, J. Immunol Methods 196:17-32 (1996)), peptides formulated as multivalent peptides; 
peptides for use in ballistic delivery systems, typically crystallized peptides, viral delivery 

25 vectors (Perkus, et al, In: Concepts in vaccine development (Kaufinann, ed., p. 379, 1996); 
Chakrabarti, et al, Nature 320:535 (1986); Hu et al, Nature 320:537 (1986); Kieny, et al, 
AIDS Bio/Technology 4:790 (1986); Top et al, J. Infect. Dis. 124:148 (1971); Chanda et al, 
Virology 175:535 (1990)), particles of viral or synthetic origin (see, e.g., Kofler et al, J. 
Immunol Methods. 192:25 (1996); Eldridge et al, Sem. Hematol. 30:16 (1993); Falo et al, 

30 Nature Med. 7:649 (1995)), adjuvants (Wairen et al, Annu. Rev. Immunol 4:369 (1986); 



In a preferred embodiment, breast cancer proteins and modulators are 



therapy (i.e. for incorporation into the genome) or as antisense compositions, as will be 
appreciated by those in the art. 

Breast cancer polypeptides and polynucleotides can also be administered as 
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Gupta et aL, Vaccine 1 1:293 (1993)), liposomes (Reddy et aL, J. Immunol 148:1585 (1992); 
Rock, Immunol. Today 17:131 (1996)), or, naked or particle absorbed cDNA (Ulrner, et aL, 
Science 259:1745 (1993); Robinson et aL, Vaccine 11:957 (1993); Shiver et aL, In: Concepts 
in vaccine development (Kaufinann, ed., p. 423, 1996); Cease & Berzofsky, Annu. Rev. 
5 Immunol. 12:923 (1994) and Eldridge et al y Sem. Hematol 30:16 (1993)). Toxin-targeted 
delivery technologies, also known as receptor mediated targeting, such as those of Avant 
Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a 
substance designed to protect the antigen from rapid catabolism, such as aluminum hydroxide 

10 or mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 
Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 
MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 
Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 

15 aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 
tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 
polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 

20 Vaccines can be administered as nucleic acid compositions wherein DNA or 

RNA encoding one or more of the polypeptides, or a fragment thereof, is administered to a 
patient. This approach is described, for instance, in Wolff et. aL, Science 247:1465 (1990) as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; 
WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 

25 include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, 

cationic lipid complexes, and particle-mediated ("gene gun") or pressure-mediated delivery 
(see, e.g., U.S. Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the 
invention can be expressed by viral or bacterial vectors. Examples of expression vectors 

30 include attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
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vaccinia virus, e.g., asa vector to express nucleotide sequences that encode breast cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 
5 Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 
described in Stover et al 7 Nature 35 1 :456-460 (1991). A wide variety of other vectors useful 
for therapeutic administration or immunization e.g. adeno and adeno-associated virus vectors, 
retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the like, 
will be apparent to those skilled in the art from the description herein {see, e.g., Shata et aL 3 

1 0 Mot Med Today 6:66-71 (2000); Shedlock et al 9 JLeukoc Biol 68:793-806 (2000); Hipp et 
aljn Vivo 14:571-85 (2000)). 

Methods for the use of genes as DNA vaccines are well known, and include 
placing a breast cancer gene or portion of a breast cancer gene under the control of a 
regulatable promoter or a tissue-specific promoter for expression in a breast cancer patient. 

1 5 The breast cancer gene used for DNA vaccines can encode full-length breast cancer proteins, 
but more preferably encodes portions of the breast cancer proteins including peptides derived 
from the breast cancer protein. In one embodiment, a patient is immunized with a DNA 
vaccine comprising a plurality of nucleotide sequences derived from a breast cancer gene. For 
example, breast cancer-associated genes or sequence encoding subfragments of abreast 

20 cancer protein are introduced into expression vectors and tested for their immunogenicity in 
the context of Class I MHC and an ability to generate cytotoxic T cell responses. This 
procedure provides for production of cytotoxic T cell responses against cells which present 
antigen, including intracellular epitopes. 

In a preferred embodiment, the DNA vaccines include a gene encoding an 

25 adjuvant molecule with the DNA vaccine. Such adjuvant molecules include cytokines that 
increase the immunogenic response to the breast cancer polypeptide encoded by the DNA 
vaccine. Additional or alternative adjuvants are available. 

In another preferred embodiment breast cancer genes find use in generating 
animal models of breast cancer. When the breast cancer gene identified is repressed or 

30 diminished in cancer tissue, gene therapy technology, e.g., wherein antisense RNA directed 
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to the breast cancer gene will also diminish or repress expression of the gene. Animal models 
of breast cancer find use in screening for modulators of a breast cancer-associated sequence 
or modulators of breast cancer. Similarly, transgenic animal technology including gene 
knockout technology, e.g. as a result of homologous recombination with an appropriate gene 
5 targeting vector, will result in the absence or increased expression of the breast cancer 
protein. When desired, tissue-specific expression or knockout of the breast cancer protein 
may be necessary. 

It is also possible that the breast cancer protein is overexpressed in breast 
cancer. As such, transgenic animals can be generated that overexpress the breast cancer 
10 protein. Depending on the desired expression level, promoters of various strengths can be 
employed to express the transgene. Also, the number of copies of the integrated transgene 
can be determined and compared for a determination of the expression level of the transgene. 
Animals generated by such methods find use as animal models of breast cancer and are 
additionally useful in screening for modulators to treat breast cancer. 

15 

Kits for Use in Diagnostic and/or Prognostic Applications 

For use in diagnostic, research, and therapeutic applications suggested above, 
kits are also provided by the invention. In the diagnostic and research applications such kits 
may include any or all of the following: assay reagents, buffers, breast cancer-specific nucleic 

20 acids or antibodies, hybridization probes and/or primers, antisense polynucleotides, 

ribozymes, dominant negative breast cancer polypeptides or polynucleotides, small molecules 
inhibitors of breast cancer-associated sequences etc, A therapeutic product may include 
sterile saline or another pharmaceutically acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions 

25. (i.e., protocols) for the practice of the methods of this invention. While the instructional 
materials typically comprise written or printed materials they are not limited to such. Any 
medium capable of storing such instructions and communicating them to an end user is 
contemplated by this invention. Such media include, but are not limited to electronic storage 
media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the 
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like. Such media may include addresses to internet sites that provide such instructional 
materials. 

The present invention also provides for kits for screening for modulators of 
breast cancer-associated sequences. Such kits can be prepared from readily available 

5 materials and reagents. For example, such kits can comprise one or more of the following 
materials: a breast cancer-associated polypeptide or polynucleotide, reaction tubes, and 
instructions for testing breast cancer-associated activity. Optionally, the kit contains 
biologically active breast cancer protein. A wide variety of kits and components can be 
prepared according to the present invention, depending upon the intended user of the kit and 

1 0 the particular needs of the user. Diagnosis would typically involve evaluation of a plurality 
of genes or products. The genes will be selected based on correlations with important 
parameters in disease which may be identified in historical or outcome data. 

15 EXAMPLES 

Example 1: Tissue Preparation, Labeling Chips, and Fingerprints 

Purifying total RNA from tissue sample using TRIzol Reagent 

20 The sample weight is first estimated. The tissue samples are homogenized in 

1 ml of TRIzol per 50 mg of tissue using a homogenizer (e.g., Polytron 3100). The size of 
the generator/probe used depends upon the sample amount. A generator that is too large for 
the amount of tissue to be homogenized will cause a loss of sample and lower RNA yield. A 
larger generator (e.g., 20 mm) is suitable for tissue samples weighing more than 0.6 g. Fill 

25 tubes should not be overfilled. If the working volume is greater than 2 ml and no greater than 
10 ml, a 15 ml polypropylene tube (Falcon 2059) is suitable for homogenization. 

Tissues should be kept frozen until homogenized. The TRIzol is added 
directly to the frozen tissue before homogenization. Following homogenization, the insoluble 
material is removed from the homogenate by centrifugation at 7500 x g for 15 min. in a 

30 Sorvall superspeed or 12,000 x g for 10 min. in an Eppendorf centrifuge at 4°C. The cleared 
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homogenate is then transferred to a new tube(s). Samples may be frozen and stored at -60 to 

-70°C for at least one month or else continue with the purification. 

The next process is phase separation. The homogenized samples are incubated 
for 5 minutes at room temperature. Then, 0.2 ml of chloroform per 1ml of TRIzol reagent is 
5 added to the homogenization mixture. The tubes are securely capped and shaken vigorously 
by hand (do not vortex) for 15 seconds. The samples are then incubated at room temp, for 
2-3 minutes and next centrifuged at 6500 rpm in a Sorvall superspeed for 30 min. at 4oC. 

The next process is RNA Precipitation. The aqueous phase is transferred to a 
fresh tube. The organic phase can be saved if isolation of DNA or protein is desired. Then 
10 0.5 ml of isopropyl alcohol is added per 1ml of TRIzol reagent used in the original 

homogenization. Then, the tubes are securely capped and inverted to mix. The samples are 
then incubated at room temp, for 10 minutes an centrifuged at 6500 rpm in Sorvall for 20 
min. at 4°C. 

The RNA is then washed. The supernatant is poured off and the pellet washed 
1 5 with cold 75% ethanol. 1 ml of 75% ethanol is used per 1 ml of the TRIzol reagent used in 
the initial homogenization. The tubes are capped securely and inverted several times to 
loosen pellet without vortexing . They are next centrifuged at <8000 rpm (<7500 x g) for 5 
minutes at 4°C. 

The RNA wash is decanted. The pellet is carefully transferred to an 
20 Eppendorf tube (sliding down the tube into the new tube by use of a pipet tip to help guide it 
in if necessary). Tube(s) sizes for precipitating the RNA depending on the working volumes. 
Larger tubes may take too long to dry. Dry pellet. The RNA is then resuspended in an 
appropriate volume (e.g., 2 -5 ug/ul) of DEPC H 2 0. The absorbance is then measured. 

The poly A+ mRNA may next be purified from total RNA by other methods 
25 such as Qiagen' s RNeasy kit. The poly A + mRNA is purified from total RNA by adding the 
oligotex suspension which has been heated to 37°C and mixing prior to adding to RNA. 
The Elution Buffer is incubated at 70°C. If there is precipitate in the buffer, warm up the 2 x 
Binding Buffer at 65°C. The the total RNA is mixed with DEPC-treated water, 2 x Binding 
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Buffer, and Oligotex according to Table 2 on page 16 of the Oligotex Handbook and next 
incubated for 3 minutes at 65°C and 10 minutes at room temperature. 

The preparation is centrifuged for 2 minutes at 14,000 to 18,000 g, preferably, 
at a "soft setting," The supernatant is removed without disturbing Oligotex pellet. A little bit 
of solution can be left behind to reduce the loss of Oligotex. The supernatant is saved until 

satisfactory binding and elution of poly A + mRNA has been found. 

Then, the preparation is gently resuspended in Wash Buffer OW2 and pipetted 
onto the spin column and centrifuged at full speed (soft setting if possible) for 1 minute. 

Next, the spin column is transferred to a new collection tube and gently 
resuspended in Wash Buffer OW2 and centrifuged as described herein. 

Then, the spin column is transferred to a new tube and eluted with 20 to 100 ul 
of preheated (70°C) Elution Buffer. The Oligotex resin is gently resuspended by pipetting up 
and down. The centrifugation is repeated as above and the elution repeated with fresh elution 
buffer or first eluate to keep the elution volume low. 

The absorbance is next read to determine the yield, using diluted Elution « 

Buffer as the blank. 

Before proceeding with cDNA synthesis, the mRNA is precipitated before 
proceeding with cDNA synthesis, as components leftover or in the Elution Buffer from the 
Oligotex purification procedure will inhibit downstream enzymatic reactions of the mRNA. 
0.4 vol. of 7.5 M NH40Ac + 2.5 vol. of cold 100% ethanol is added and the preparation 
precipitated at -20°C 1 hour to overnight (or 20-30 min. at -70°C), and centrifuged at 
14,000-16,000 x g for 30 minutes at 4°C. Next, the pellet is washed with 0.5 ml of 80% 
ethanol (-20°C) and then centrifuged at 14,00046,000 x g for 5 minutes at room temperature. 
The80% ethanol wash is then repeated. The last bit of ethanol from the pellet is then dried 
without use of a speed vacuum and the pellet is then resuspended in DEPC H2O at lug/ul 
concentration. 

Alternatively the RNA mav he purified using other methods ( e.g., Qiagen' s RNeasy kit). 
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No more than 100 ug is added to the RNeasy column. The sample volume is 
adjusted to 100 ul with RNase-free water. 350 ul Buffer RLT and then 250 ul ethanol 
(100%) are added to the sample. The preparation is then mixed by pipetting and applied to an 
RNeasy mini spin column for centrifugation (15 sec at >10,000 rpm). If yield is low, reapply 
5 the flowthrough to the column and centrifuge again. 

Then, transfer column to a new 2 ml collection tube and add 500 ul Buffer 
RPE and centrifuge for 15 sec at >10,000rpm. The flowthrough is discarded. 500 ul Buffer 
RPE and is then added and the preparation is centriuged for 15 sec at >i 0,000 rpm. The 
flowthrough is discarded, and the column membrane dried by centrifuging for 2 min at 
10 maximum speed. The column is transferred to a new 1.5-ml collection tube. 30-50 ul of 

RNase-free water is applied directly onto column membrane. The column is then centrifliged 
for 1 min at >10,000 rpm and the elution step repeated. 

The absorbance is then read to determine yield. If necessary, the material may 
be ethanol precipitated with ammonium acetate and 2.5X volume 100% ethanol. 

15 

First Strand cDNA Synthesis 

The first strand can be make using using Gibco's "Superscript Choice System 
for cDNA Synthesis" kit. The starting material is 5 ug of total RNA or 1 ug of polyA+ 
mRNAL For total RNA, 2 ul of Superscript RT is used; for polyA+ mRNA, 1 ul of 
20 Superscript RT is used. The final volume of first strand synthesis mix is 20 ul. The RNA 
should be in a volume no greater than 10 ul. The RNA is incubated with 1 ul of 100 pmol 

T7-T24 oligo for 10 min at 70°C followed by addition on ice of 7 ul of: 4ul 5X 1 st Strand 
. Buffer, 2 ul of 0.1M DTT, and 1 ul of lOmM dNTP mix. The preparation is then incubated at 
37°C for 2 min before addition of the Superscript RT followed by incubation at 37°C for 1 
25 hour. 

Second Strand Synthesis 

For the second strand synthesis, place 1st strand reactions on ice and add: 91 

ul DEPC H 2 0; 30 ul 5X 2nd Strand Buffer; 3 ul lOmM dNTP mix; 1 ul 10 U/ul E.coli DNA 

30 Ligase; 4 ul 10 U/ul E.coli DNA Polymerase; and 1 ul 2 U/ul RNase H, Mix and incubate 2 
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hours at 16°C. Add 2 ul T4 DNA Polymerase. Incubate 5 min at 16°C. Add 10 ul of 0.5M 
EDTA. 

Cleaning up cDNA 

5 The cDNA is purified using Phenol:Chloroform:Isoamyl Alcohol (25:24: 1) 

and Phase-Lock gel tubes. The PLG tubes are centrifuged for 30 sec at maximum speed. 
The cDNA mix is then transferred to PLG tube. An equal volume of 
phenol:chloroform:isamyl alcohol is then added, the preparation shaken vigorously (no 
vortexing), and centrifuged for 5 minutes at maximum speed. The top aqueous solution is 
1 0 transferred to a new tube and ethanol precipitated by adding 7.5X 5M NH40Ac and 2.5X 
volume of 100% ethanol. Next, it is centrifuged immediately at room temperature for 20 
min, maximum speed. The supernatant is removed, and the pellet washed with 2X with cold 
80% ethanol. As much ethanol wash as possible should be removed before air drying the 
pellet; and resuspending it in 3 ul RNase-free water. 

15 

In vitro Transcription (TVT) and labeling with biotin 

In vitro Transcription (IVT) and labeling with biotin is performed as follows: 
Pipet 1.5 ul of cDNA into a thin-wall PCR tube. Make NTP labeling mix by combining 2 ul 
T7 lOxATP (75 mM) (Ambion); 2 ul T7 lOxGTP (75 mM) (Ambion); 1.5 ul T7 lOxCTP (75 

20 mM) (Ambion); 1.5 ul T7 lOxUTP (75 mM) (Ambion); 3.75 ul 10 mM Bio-ll-UTP 

(Boehringer-Mannheim/Roche or Enzo); 3.75 ul 10 mM Bio-16-CTP (Enzo); 2 ul lOx T7 
transcription buffer (Ambion); and 2 ul lOx T7 enzyme mix (Ambion). The final volume is 
20 ul. Incubate 6 hours at 37°C in a PCR machine. The RNA can be furthered cleaned. 
Clean-up follows the previous instructions for RNeasy columns or Qiagen's RNeasy protocol 

25 handbook. The cRNA often needs to be ethanol precipitated by resuspension in a volume 
compatible with the fragmentation step. 

Fragmentation is performed as follows. 15 ug of labeled RNA is usually 
fragmented. Try to minimize the fragmentation reaction volume; a 10 ul volume is 
recommended but 20 ul is all right. Do not go higher than 20 ul because the magnesium in 

30 the fragmentation buffer contributes to precipitation in the hybridization buffer. Fragment 
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RNA by incubation at 94 C for 35 minutes in 1 x Fragmentation buffer (5 x Fragmentation 
buffer is 200 mM Tris-acetate, pH 8.1; 500 mM KOAc; 150 mM MgOAc). The labeled 
RNA transcript can be analyzed before and after fragmentation. Samples can be heated to 
65°C for 15 minutes and electrophoresed on 1% agarose/TBE gels to get an approximate idea 
of the transcript size range 

For hybridization, 200 ul (10 ug cRNA) of a hybridization mix is put on the 
chip. If multiple hybridizations are to be done (such as cycling through a 5 chip set), then it 
is recommended that an initial hybridization mix of 300 ul or more be made. The 
hybridization mix is: fragment labeled RNA (50 ng/ul final cone); 50 pM 948-b control 
oligo; 1.5 pM BioB; 5 pM BioC; 25 pM BioD; 100 pM CRE; 0.1 mg/ml herring sperm DNA; 
0.5 mg/ml acetylated BSA; and 300 ul with lxMES hyb buffer. 

The hybridization reaction is conducted with non-biotinylated IVT (purified 
by RNeasy columns) (see example 1 for steps from tissue to IVT): The following mixture is 
prepared: 

IVT antisense RNA; 4 ug: ^1 

Random Hexamers (1 ^ig/|il): 4 ^1 

H 2 0: Mi 

14 ul 

Incubate the above 14 (il mixture at 70°C for 10 min.; then put on ice. 

The Reverse transcription procedure uses the following mixture: 
0.1MDTT: 3 fil 

50XdNTPmix: 0.6^1 
H 2 0: 2.4^1 
Cy3 or Cy5 dUTP (ImM): 3 \il 
SS RT II (BRL): 1^1 

16 \il 

The above solution is added to the hybridization reaction and incubated for 30 min., 42°C. 

Then, 1 ul SSII is added and incubated for another hour before being placed on ice. 
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The 50X dNTP mix contains 25mM of cold dATP, dCTP, and dGTP, lOmM 
of dTTP and is made by adding 25 ul each of lOOmM dATP, dCTP, and dGTP; 10 pi of 
lOOmM dTTP to 15 ul H 2 0. 1 

RNA degradation is performed as follows. Add 86 pi H 2 0, 1.5 pi 1M NaOH/ 
2 mM EDTA and incubate at 65°C, 10 min.. For U-Con 30, 500 pi TE/sample spin at 7000 g 
for 10 min, save flow through for purification. For Qiagen purification, suspend u-con 
recovered material in 500 ul buffer PB and proceed using Qiagen protocol. For DNAse 
digestion, add 1 ul of 1/100 dilution of DNAse/30 ul Rx and incubate at 37°C for 15 min. 
Incubate at 5 min 95°C to denature the DNAse. 

Sample preparation 

For sample preparation, add Cot-1 DNA, 10 pi; 50X dNTPs, 1 ul; 20X SSC, 2.3 |xl; Na pyro 
phosphate, 7.5 pi; 10 mg/ml Herring sperm DNA; 1 ul of 1/10 dilution to 21.8 final vol. Dry 
in speed vac. Resuspendin 15 plH 2 0. Add 0.38 pi 10% SDS. Heat 95°C, 2 min and slow 
cool at room temp, for 20 min. Put on slide and hybridize overnight at 64°C. Washing after 
the hybridization: 3X SSC/0.03% SDS: 2 min., 37.5 ml 20X SSC+0.75ml 10% SDS in 
250ml H 2 0; IX SSC: 5 min., 12.5 ml 20X SSC in 250ml H 2 0; 0.2X SSC: 5 min., 2.5 ml 20X 
SSC in 250ml H 2 0. Dry slides and scan at appropiate PMT's and channels. 
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TABLE 1: Figure 1 from BRCA 001 US 

Table 1 shows genes, (incorporated in their entirety here and throughout the application 
where primekeys are provided), downregulated in tumor tissue compared to normal breast 
tissue. 



10 



15 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R 1 : Ratio of normal breast tissue to tumor 



Pkey ExAccn UnigenelD UnlgeneTitte 

20 100472 D90084 Hs.1023 pyruvate dehydrogenase (lipoamide) alpha 

100499 T51986 Hs.283108 hemoglobin, gamma G 
100545 M55405 gb:Homo sapiens mucin (MUC-3) mRNA, part 

100549 BE142019 Hs.222056 Homo sapiens CDNAFU1 1572 fis. done HE 

100613 X52078 Hs.101047 transcription factor 3 (E2A immunogtobul 

25 100635 BE259039 Hs.129953 Ewing sarcoma breakpoint region 1 

100645 X16841 Hs.167988 neural cefl adhesion molecule 1 
100654 A03758 NM_000477^Homo sapiens albumin (ALB), m 

100702 L27065 gfcHuman neurofibromatosis 2 (NF2) mRNA, 

1 00915 M60832 Hs.249239 collagen, type VIII, alpha 2 

30 100971 BE379727 Hs.83213 fatty acid binding protein 4, adipocyte 

101125 AJ250562 Hs.B2749 transmembrane 4 superfamfly member 2 

101166 M90424 Hs.2099 lipocalin 1 (protein migrating faster In 

101184 NMJ01674 Hs.460 activating transcription factor 3 

101336 NMJ)06732 Hs.75678 FBJ murine osteosarcoma viral oncogene h 

35 101367 X03350 Hs.4 alcohol dehydrogenase 1B (class I), beta 
101447 M21305 gb:Human alpha satellite and satellite 3 

101461 N98569 Hs.76422 phosphofipase A2, group HA (platelets, 

101511 M27826 Hs.267319 endogenous retroviral protease 

101634 AV650262 Hs.75765 GR02 oncogene 

40 101736 M74447 Hs.502 transporter 2, ATP-binding cassette, sub 
102208 U22961 gb:Human mRNA clone with similarity to L 

102297 NM.001504 Hs.198252 G protein-coupled receptor 9 

102450 U48251 Hs.75871 protein kinase C binding protein 1 - - 

102515 U89337 Hs.169886 tenascinXB 

45 102571 U60115 Hs.239069 four and a half UM domains 1 

102800 AA313538 gb:EST185419 Colon carcinoma (HCC)ceU 

102857 NM.006744 Hs.76461 retinol-binding protein 4, interstitial 

102990 AA829286 Hs.332053 serum amyloid A 1 

103434 X98085 Hs.54433 tenasdn R (restricts, janusinj 

50 103747 AA081995 gb:zn26d06.r1 Stratagene neuroepithefium 

103750 AA126129 gb:zm78c07.r1 Stratagene neuroepitheiium 

103812 AA137107 Hs.326391 Homo sapiens, clone MGC:16638, mRNA, com 

103851 AA326216 Hs.8719 hypothetical protein MGC1136 

104080 AB041036 Hs.57771 kallikrein 1 1 (KLK11 ; TLSP; PRSS20; hipp 

55 104093 R50727 Hs.336970 ESTs 

104106 AA422123 gfczv26h12.r1 Soares_NhHMPu_S1 Homo sapi 

104109 AL353957 Hs.284181 hypothetical protein DKF2p434P0531 

104250 F06638 Hs.12440 Homo sapiens done 24734 mRNA sequence 
104340 AA426189 gb:zw11e09.r1 Soares_NhHMPu S1 Homosapl 

60 104492 N73185 Hs,94285 EST 

104506 N91071 Hs.109650 ESTs 

104511 N99542 Hs.572 orosomucoid 1 

104532 AI498763 Hs.203013 hypothetical protein FL J 12748 

100 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



104536 R24024 
104572 Y11312 
104659 AW969769 
104677 AA009764 
104711 AA017245 
104731 AA01930Q 
104764 AI039243 
105005 AI2982Q8 
105036 M130390 
105105 R61532 
105231 AW970043 
105239 AA221036 
105921 AA421973 
105957 BE242857 
106052 N79885 
106119 AL359624 
106181 AI803651 
106194 AW976171 
106283 A1085846 
106379 AL042069 
106451 AW235928 
106491 AA135688 
106700 AA906434 
106782 AW054886 
106851 AI458623 
106870 AI983730 
106892 AI347578 
106954 AF128847 
106991 AJ223811 
107103 AI446183 
107124 AB006532 
107148 AI005036 
107214 AF127026 
107242 AB020672 
107331 AI905985 
107351 U51704 
107423 W26652 
107447 W28516 
107451 AL042425 
107453 A1092790 
107459 W38002 
107683 N53167 
107711 W96141 
107754 AA017462 
107757 BE621721 
107864 AA025060 
107872 BE271708 
107888 AA025836 
107997 AL049176 
108056 AA043675 
108081 AA093668 
108113 AA012881 
108238 AA059473 
108257 AA677927 
108335 AA070500 
108351 M071193 
108382 NM06770 
108392 AA075124 
108441 AA079079 
108446 AA085383 
108497 AA074897 
108604 AA934589 
108662 AF1 17646 
108706 AA121820 
108738 AA126583 
108827 AI273692 
109123 AI028376 



Hs.158101 
Hs.132463 
Hs.105201 
Hs.190380 
Hs.32794 
Hs.125070 
Hs.278585 
Hs.28805 
Hs.25549 
Hs.87016 
Hs.238039 

Hs.169119 
Hs.27021 
Hs.6382 
Hs.1 1387 
Hs.191608 
Hs.286194 
Hs.25522 
Hs.1 19021 
Hs.313182 
Hs.10083 
Hs.3776 
Hs-25682 

Hs.26530 
Hs.124015 
Hs.204038 
Hs.30127 
Hs.9572 
Hs.31442 
Hs.334305 
Hs.5394 
Hs.175411 
Hs.1 11805 
Hs.323428 
Hs.6163 
Hs.19210 
Hs.283976 
Hs.334703 

Hs.47623 
Hs.220687 
Hs.269244 
Hs.280792 
Hs.61246 
Hs.95110 
Hs.191637 
Hs.82223 
Hs.62633 
Hs.28578 
Hs.72531 
Hs.66783 
Hs.144269 



Homo sapiens cDNA RJ 14673 fis, clone NT 
phosphoinositide-3-kinase. class 2, beta 
ESTs 
ESTs 
ESTs 

ESTs, Moderately similar to 154374 gene 
ESTs 
ESTs 

hypoftetka! protein FU20B98 
hypothetical protein FU22938 
hypothetical protein FLJ 11090 
gb:zr03f 12.r1 Stratagene NT2 neuronal pr 
ESTs, Weakly similar to T25731 hypotheti 
hypothetical protein FU11159 
ESTs, Highly similar to T00391 hypotheti 
KIAA1453 protein 
ESTs 

hypothetical protein RJ22233 
KIAA1808 protein 
DKFZP434N061 protein 
ESTs 

Homo sapiens, clone IMAGE:4 139786, mRNA, 
zinc finger protein 216 
Homo sapiens mRNA forK!AA1863 protein, 
gb:tk04g09.x1 NCL.CGAPJ.u24 Homo sapiens 
serum deprivation response (phosphatidyl 
hypothetical protein MGC2605 



hypothetical protein 

ESTs. Highly similar to CYA5_HUMAN ADENY 
RecQ protein-like 4 
GS1999fuII 
myosin IA 
K1AA0865 protein 
ESTs 

ESTs, Moderately similar to ALU8JHUMAN A 
PTEN induced putative kinase 1 
hypothetical protein MGC11308 
hyptheticai protein PR02389 
hypothetical protein FU14529 
Empirically selected from AFFX single pr 
ESTs 
ESTs 
ESTs 

hypothetical protein FU12387 simitarto 
ESTs 

ESTs, Weakly similar to A55943 1-phospha 
ESTs 

chordin-Bke 
ESTs 

muscleblind (Drosophila)-like 
hypothetical protein FU11838 
EST 
ESTs 

gb:zm70h03.s1 Slratagene neuroepithelhim 
gb:zf79b12.s1 Soares_pineaLglan d -N 3HPG 
Hs.67726 macrophage receptor with collagenous str 
gb:zm86a01.s1 Stratagene ovarian cancer 
gb:zm97c09.s1 Stratagene colon HT29 (937 
gb:zn13g03^1 Stratagene hNT neuron (937 
gb:zm85a05.r1 Stratagene ovarian cancer 
Hs.49696 ESTs 

Hs.1 56637 Cas-Br-M (murine) ectropic retroviral tr 
Hs.74569 WAA0842 protein 
Hs.158725 ESTs 
Hs.110470 ESTs 
Hs.73232 ESTs 
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109389 
109546 
109919 
110006 
110141 
110354 
110433 
110448 
110455 
110540 
110553 
110976 
110987 
111158 
111168 
111187 
111307 
111400 
111498 
111651 
111738 
111803 
111995 
112071 
112204 
112258 
112490 
112588 
112654 
112784 
112817 
112885 
112913 
113149 
113174 
113203 
113299 
113367 
113457 
113563 
113574 
113776 
113790 
113807 
113958 
114211 
114254 
114349 
114449 
114464 
114576 
114624 
114844 
114906 
115624 
115666 
115712 
115889 
115949 
116107 
116180 
116267 
116291 
116527 
65 116659 
116708 
117058 
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AA101325 Hs.86154 hypofhefical protein RJ 12457 10 

F01449 Hs.26954 Homo sapiens mRNA; cDNA DKFZp762G123 (fr 5 

R40604 Hs.129539 ESTs, WeaWy similar to MCAT_HUMAN MITOC 10 

AI094674 Hs.30524 ring Bnger protein 24 10 

H46749 Hs.31540 ESTs 10 

W22165 Hs.22586 ESTs 5 

AW294162 Hs.301062 UDP-N-acetyMpha-f>ga!actosamine:poiyp 10 

H51276 Hs.13526 hypoiheflcai protein FU 12688 10 

H52576 gb:yt85e08.r1 Soaresjtoeal jland JI3HPG 5 

H72639 Hs.1 67608 ESTs 5 

H60593 Hs.124990 ESTs 10 

AL044174 Hs.159526 patched (Drosophila) homolog 10 

AI753316 Hs.26034 ESTs 5 

N66616 Hs.138629 Rsapiens mRNA for subtelomeric repeats 5 

AI798376 gb*34b07a1 NC|_CGAP_Ov23 Homo sapiens 10 

AJ224864 Hs.9688 leukocyte membrane antigen 5 

AA641636 Hs.37477 ESTs, Weakly similar to T46908 hypotheti 5 

R00144 Hs.189771 ESTs 10 

AI168511 gb:ow90h09.s1 Soares Jetaijiver_spleen_ 10 

R16733 Hs.20499 ESTs 10 

R26065 gb:yh39d03.s1 Soares placenta Nb2HP Homo 5 

AA593731 Hs.325823 ESTs, Moderately similar to ALU5_HUMAN A 10 

R42333 Hs.302292 ESTs 10 

AL117490 Hs.47225 Ras-associated protein Rap1 10 

NM_006668 Hs.25121 cytochrome P450, subfamily 46 (chotester 10 

R51889 Hs.24990 ESTs 5 

R31094 Hs.24378 ESTs 10 

R77302 gb.7i75h08.s1 Soares placenta Nb2HP Homo 10 

BE618629 Hs.268809 ESTs 5 

T98628 Hs.191290 ESTs 5 

AI057205 Hs.14584 ESTs 5 

AA581428 Hs.5021 EST 10 

T16837 Hs.4241 ESTs 5 

T51588 gb:yb27e06.s1 Stratagene fetal spleen (9 10 

T54659 Hs.301755 Homo sapiens cDNA FU11465 lis, done HE 5 

AA743563 Hs.10305 ESTs 5 

AW207424 Hs.332594 ESTs 10 

N92359 Hs.14518 ESTs, Moderately similar to A48752 B-cel 10 

R16763 Hs.268679 ESTs 5 

AA913635 Hs.326413 Homo sapiens cONA FU20812 fts, done AO 10 

R06874 Hs.268628 ESTs, Moderately simiJar to ALU 1_HUMAN A 5 

A1791905 Hs.95549 hypothetical protein 10 

A1244311 Hs.26912 ESTs 10 

W07586 Hs.8045 ESTs 3 

W86195 gb:zh54e05.s1 Soares JetaLGver_sp!een_ 10 

Z39319 Hs.27347 EST 10 

AB018263 Hs.180338 tumor necrosis factor receptor superfaml 5 

AA745978 Hs.28273 ESTs 5 

AA020736 gb:ze63b11.s1 Soares retina N2b4HR Homo 5 

AA034378 Hs^67319 endogenous retroviral protease 5 

AA065096 gb:2m50a02^1 Stratagene fibroblast (937 5 

AA081 507 gbznQ5b10 A Stratagene hNT neuron (937 5 

AA234826 Hs.87386 EST 5 

AA234462 Hs.87350 ESTs 5 

AK000725 Hs.50579 hypothetical protein FU2071 8 3 

AF173081 Hs.178215 Vertebrate UN7 homolog 1. Tax interacti 5 

AB020649 Hs.74569 KIAA0842 protein 5 

AA398841 Hs.39850 hypothetical protein FU205 17 10 

AI478427 Hs.43125 esophageal cancer related gene 4 protein 10 

AL133916 Hs.172572 hypothetical protein FU2Q093 10 

AA463902 Hs.13522 ESTs, Weakly similar to I38022 hypotheti 5 

AW968703 Hs.30085 hypothetkaJ protein FU231 86 5 

AW4 10377 Hs.41502 hypotheflca! protein FU21 276 5 

AW194253 Hs.68607 ESTs 10 

BE314852 Hs.168694 Homo sapiens ctone 23763 unknown mRNA, p 5 

F10528 Hs.70001 ESTs, Moderately similar to JC6169 nude 5 

AW801806 gb:IL5-UM0070-110400-062-g07UM0070Homo 5 
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117151 AI803656 
117226 N20468 
117323 AI472863 
117571 N34417 
117624 N26627 
117673 N40551 
117847 N49285 
117877 AW263476 
117919 BE222341 
118049 N53145 
118413 AW955698 
118613 AI078236 
118664 N70907 
118858 AL122040 
118902 AA993527 
119039 AI160570 
119159 AF142419 
119216 AA514422 
119317 AK002001 
119366 T77892 
119378 781824 
119528 W38051 
119792 AL049798 
119800 AF086332 
119817 AF088061 
119835 AF086429 
119923 AW803308 
119961 U34249 
120379 AL042725 
120931 AW136934 
121037 AA907743 
121282 AA401695 
121382 AA405783 
121764 AA421452 
122034 AK000229 
122441 AA447555 
122756 AA458945 
122771 AW135093 
123601 AA609122 
123623 AI024595 
123941 M621529 
124215 H62570 
124276 H83465 
124680 AK001527 
125099 NMJM312 
125121 T98199 
125188 BE299567 
125284 NMJ02666 
125906 BE256206 
128484 AA485421 
128511 NMJJ02250 
128538 R44214 
128606 C16161 
128850 AA193106 
128870 H39537 
128903 AW150717 
128931 N62889 
129001 AA443323 
129091 AA056483 
129101 NWL013403 
129146 AL117472 
129213 A1146494 
129228 U40714 
129265 AA530892 
129285 BE617015 
129346 AF1 10141 
129368 NMJW3877 



Hs.42373 ESTs 

gb:yx39b10.s1 Scares melanocyte 2NbHM Ho 
Hs.43387 ESTs 
Hs.44584 ESTs 

Hs.82364 ESTs, Weakly similar to JC4124 pregnancy 
Hs.184043 Homo sapiens Ets-1 binding protein (E1B) 
Hs.182391 ESTs 

Hs.44268 myelin gene expression factor 2 
Hs.279472 ESTs 

gb:yv55f09.s1 Soares fetal liver spleen 
Hs.90960 ESTs 
Hs.49688 ESTs 
Hs.230619 EST 

Hs.102981 Homo sapiens mRNA; cDNA DKFZp434G1972 (f 
Hs.293907 hypothetical protein RJ23403 
Hs.252097 pregnancy specific beta-1-glycoprote!n 6 
Hs.15020 homolog of mouse quaking QK1 (KH domain 
Hs.221849 ESTs 

Hs.51305 v-maf musculoaponeurotic fibrosarcoma (a 
gb:yd20f04.s1 Soares fetal Twer spleen 

Hs.90949 EST 

Empirically selected from AFFX single pr 

Hs.80552 dermatopontin 

Hs!58314 ESTs 

Hs.159690 ESTs 

Hs.58429 ESTs 

- Hs.62954 ferritin, heavyporypeptide 1 
Hs.337461 Human putative zinc finger protein (ZNFB 

gb:DKFZp434B1822_/1 434 (synonym: htes3) 
Hs.97162 ESTs 
Hs.142373 ESTs 
Hs.97334 ESTs 

Hs.111939 Homo saolens cONA FU20470 fis, done KA 
Hs.164851 ESTs, Weakly similar to KIAAQ926 protein 
Hs.98017 Homo sapiens cONA FU20222 fis. clone CO 
Hs.99116 EST 
Hs.95898 ESTs 

Hs.97282 ESTs, Highly similar to G100JHUMAN 110 K 
Hs.1 12645 Homo sapiens mRNA; cDNA DKFZp43402472 (f 
Hs.97508 a disintegrin and metailoproteinase doma 

gb:af47aOZs1 SoaresJotaLfetus_Nb2HF8_ 
gb^r44a01.r1 Soares fetal Ever spleen 
gb:ys91a1 1 .s1 Soares retina N2b5HR Homo 
Hs.163953 hypothetical protein FU10665 
Hs.1 12377 cortic al thymocyte receptor (X. iaevis 
Hs.48403 hypothetical protein FU10847 
Hs.271749 ESTs. Moderately similar to ALU8JHUMAN A 
Hs.103253 periUpin 

Hs.17775 p75NTR-associated cell death executor; o 
Hs.270503 ESTs, Weakly similarto ALU7_HUMAN ALU S 
Hs.10082 potassium intermediate/small conductance 
Hs.101189 ESTs 

Hs.283040 hypothetical protein PR02543 
Hs.180817 chromosome 1 1 open reading frame 23 
Hs.75309 eukaryotic translation elongation factor 
Hs.296176 STAT induced STAT inhfoitor 3 
Hs.107242 Homo sapiens cDNA FU12965 fis, clone NT 
Hs.107812 BPOZ protein 

Hs.301463 Human Chromosome 16 BAC clone CIT987SK-A 
Hs.108665 zinedin 

Hs.108924 SH3-<lomain protein 5 (ponsln) 

Hs.109525 ESTs, Weakly similar to IRX2.HUMAN IROQU 

Hs.239307 tyrosyURNA synthetase 

Hs.171695 dual specificity phosphatase 1 

Hs.1 1006 ESTs, Moderately similar to T17372 plasm 

Hs.288908 WAS protein family, member 2 

Hs.110776 STAT induced STAT inhIbitor-2 
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129371 
129381 
129440 
129441 
5 129516 
129554 
129684 
129702 
129778 
10 129893 
129928 
129973 
129977 
130014 
15 130085 
130089 
130162 
130243 
130315 
20 130339 
130400 
130436 
130478 
130480 
25 130494 
130563 
130589 
130606 
130634 
30 130683 
130689 
130716 
130718 
130722 
35 130798 
130840 
131184 
131261 
131282 
40 131328 
131340 
131341 
131406 
131489 
131543 
131692 
131753 
131756 
131785 
131815 
131619 
131828 
131888 
131927 
55 131949 
132115 
132177 
132296 
132426 
60 132477 
132675 
132796 
132898 
132905 
65 132953 
133116 
133120 



45 



50 



X06328 

AW245805 

W37944 

BE061069 

AF020038 

BE222078 

8E622468 

A1304966 

AK001676 

AK000956 

A1338993 

AJ251760 

NR.00Q399 

NMJW1158 

M62402 

AA452006 

W80711 

D88435 

AI241084 

AA435746 

V00517 

NNL001928 

X72308 

BE222978 

AW390834 

BE270472 

AL1 10226 

AI652143 

AI769067 

AA993269 

NMJJ06691 

AA232075 

AF263462 

N41322 

M81349 

BE048821 

AB040935 

AA360419 

X03350 

AW939251 

AK000393 

AF1 10908 

H83294 

BE394648 

AW966681 

BE559681 

AA829286 

AA443966 

H69342 

AA021258 

BE244961 

AJ000263 

AW294659 

AJ003112 

AK000010 

H81604 

X80818 

AA467752 

AW1 18072 

S68874 

A1291496 

NIVL006283 

W28548 

NM_004235 

BE175645 

BE563966 

NM.003278 



Hs.1 10802 

Hs.1 10903 

Hs.4007 

Hs.301943 

Hs.11223 

Hs.113069 

Hs.11924 

Hs.12035 

Hs.12457 

Hs.13209 

Hs.134535 

Hs.273385 

Hs.1395 

Hs.143102 

Hs.274313 

Hs.333199 

Hs.319946 

Hs.153227 

Hs.154353 

Hs.283108 

Hs. 155597 

Hs.251526 

Hs.15760 

Hs.75874 

Hs.279900 

Hs.16441 

Hs.288382 

Hs.127824 

Hs.17872 

Hs.17917 

Hs.18259 

Hs.18376 

Hs.18441 

Hs.1955 

Hs.20144 

Hs.23954 

Hs.171776 

Hs.4 

Hs.25647 

Hs.25817 

Hs.297660 

Hs.284122 

Hs.27414 

Hs.41639 

Hs.30736 

Hs.332053 

Hs.31595 

Hs.26320 

Hs.32753 

Hs.173103 

Hs.278658 

Hs.34054 

Hs.34780 

Hs.258798 

Hs.178471 

Hs.178078 

Hs.195161 

Hs.89981 

Hs.170917 

Hs.5476 

Hs.173159 

Hs324829 

Hs.7934 

Hs.321264 

Hs.6529 

Hs.65424 



von Willebrand factor 
claudin 5 (transmembrane protein deleted 
Sarcolemmal-assodated protein 
KIAA0467 protein 

isodtrate dehydrogenase 1 (NADP+), soiu 
ESTs 

ESTs, WeaWy similar to I38022 hypotheli 
ESTs, WeaWy similarto I38022 hypotneti 
hypothetical protein FLK081 4 
hypothetical protein FU10094 
ESTs 

guanine nucleotide binding protein (G pr 
early growth response 2 (Krox-20 (Drosop 
amine oxidase, copper containing 2 (reti 
Insulin-Eke growth factor binding prole 
ESTs 

Homo sapiens mRNA for K1AA1727 protein, 
cycfin G associated kinase 
nonselective sodium potassium/proton exc 
gb:zt79e03.s1 Scares Jestis_NHT Homo sap 
hemoglobin, gamma G 
0 component of complement (adipstn) 
small inducible cytokine A7 (monocyte ch 
MYG1 protein 



HSPC015 protein 

DKFZP434H204 protein 

hypothetical protein FU13111 

ESTs, WeaWy similar to T28770 hypotheti 

Homo sapiens, done IMAGE:3875012, mRNA 

extracellular link domain-containing 1 

XPA binding protein 1; putative ATP(GTP) 

K1AA1319 protein 

ESTs 

serum amyloid A4, constitutive 

small inducible cytokine subfamily A (Cy 

cerebral cell adhesion molecule 

inositol(myo)-1(or4)-monophosphatase 1 

alcohol dehydrogenase 16 (class I), beta 

v-fos FBJ murine osteosarcoma viral onco 

BTB (POZ) domain containing 2 

TNF receptor-assodated factor 3 

Wnt inhibitory factor-1 

hypothetical protein 

programmed cell death 2 

K1AA0124 protein 

serum amyloid A1 

ESTs 

TRABIO protein 
ESTs 

FE65-LIKE2 

keratin, hair, basic, 6 (monilethrix) 

Homo sapiens cDNA: FU22488 fis, clone H 

doubtecortex; Bssencephary, X-finked (d 

hypothetical protein FU20003 

K1AA0798 gene product 

glutamate receptor, metabotropic 4 

ESTs 

dlacytgrycerol kinase, zeta (104kD) 
prostaglandin E receptor 3 (subtype EP3) 
Homo sapiens, done IMAG£:35301 23. mRNA, 
transforming, addic coHed-coil contain 
ESTs 

KruppeHike factor 4 (gut) 
LBP protein 32 

ESTs, Weakly similar to 178885 serine/th 
tetranecfin (plasmlnogen-binding protein 

104 
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133139 AF052138 Hs.6580 

133163 AA668224 Hs.6634 

133268 AW956781 Hs.293937 

133272 NMJQ02776 Hs.69423 
5 133379 AA207059 

133407 AF017987 Hs.7306 

133552 H21497 Hs.7471 

133702 L02321 Hs75652 

133719 H26904 Hs.75736 

10 133731 N71725 Hs.272572 

133789 T85626 Hs.76239 

134007 AF072441 Hs.7840 

134055 D86062 Hs.182423 

134111 A1372588 Hs.8022 

15 134117 M081846 Hs.7921 

134177 BE243319 Hs.79672 

134308 AW905827 Hs.81454 

134361 BE549343 Hs.82208 

134369 AF207664 Hs.8230 

20 134449 L34155 Hs.83450 

134467 AI190413 Hs.8373 
134496 M64936 

134510 NMJJQ2757 Hs.250870 

134550 M26315 Hs.85258 

25 134577 BE244323 Hs.85951 

134591 U73394 Hs.166085 

134678 AL008583 Hs.182595 

134728 D10216 Hs.89394 

134758 NM_000078 Hs.89538 

30 134786 T29618 Hs.69640 

134912 T87521 Hs.261457 

134963 NM.003394 Hs.91985 

134969 H22570 Hs.172572 

135001 AA302517 Hs.92732 

35 135066 X04430 Hs.93913 

135173 AL036557 Hs.95910 

135197 U76456 Hs.190787 

135219 AB002361 Hs.96633 

135250 U83171 Hs.97203 

40 135304 AA416829 Hs.191597 

135337 AA905406 Hs.9905 

135417 X55019 Hs.99975 

101367 X03350 Hs.4 
128870 H39537 Hs75309 

45 129381 AW245805 Hs.110903 
130085 M62402 Hs.274313 
130689 NM_006691 Hs.17917 
133120 NM_003278 Hs.65424 
133407 AF017987 Hs7306 

50 133731 N71725 Hs.272572 
134369 AF207664 Hs.8230 
135066 X04430 Hs.93913 
135173 AL036557 Hs.95910 
322580 AK001852 Hs.274151 

55 408790 AW580227 Hs.47860 
418043 AW377752 Hs.83341 
427458 BE208364 Hs.29283 
446674 AA563892 Hs.306000 
449826 U85642 Hs.138506 

60 RC_H15814_s 
YEL024w/RIP1 



Homo sapiens cONA: FU23227 lis, clone C 
Homo sapiens cDNA: FU22547 fis, done H 
ESTs, Weakly similar to FXD2 JHUMAN FORKH 
kaliikrein 10 (KLK10) (PRSSL1) (nes1) 
gbzq80hG9.s1 Stratagene hNT neuron (937 
secreted frfzzied-related protein 1 
BBP-like protein 1 
glutathione S-transferase M5 
apotipoprotein O 
hemoglobin, alpha 2 
hypothetical protein FLJ20608 
caictneurin binding protein 1 
ES1 (zebrafish) protein, human homolog o 
TU3A protein 

Homo sapiens mRNA; cDNA DKFZp566E183 (fr 
KIAA0652 gene product 
ketohexokinase (fructokinase) 
acyl-Coenzyme A dehydrogenase, very long 
a disintegrirvtike and metailoprotease ( 
famlnln, alpha 3 (nicein (150kD), kalini 
ESTs 

gb:Homo sapiens retinoic acid-inducible 
mitogen-activated protein kinase kinase 
CD8 antigen, alpha polypeptide (p32) 
exportin, tRNA (nuclear export receptor 
killer cell immunoglobulin-like receptor 
dynein, axonemal, light polypeptide 4 
POU domain, class 1, transcription facto 
cholesteryl ester transfer protein, plas 
TEK tyrosine kinase, endothelial (venous 
ESTs 

wingless-type MMTV integration site fami 
hypothetical protein FU20093 
K1AA1444 protein 
Interieukin 6 (interferon, beta 2) 
putative lymphocyte 607G1 switch gene 
tissue inhibitor of metalloproteinase 4 
K1AA0363 protein 

small Inducible cytokine subfamily A (Cy 
ESTs 

ESTs, Weakly similar to unnamed protein 
cholinergic receptor, nicotinic, delta p 
alcohol dehydrogenase 1B (dass I), beta 
eukaryotic translation elongation factor 
daudin 5 (transmembrane protein deleted 
insulin-like growth factor binding prote 
extracellular Unk domain-containing 1 
tetranectin (pfasminogen-binding protein 
secreted fnzzled-related protein 1 
hemoglobin, alpha 2 
a disintegrin-lfce and metailoprotease ( 
interieukin 6 (interferon, beta 2) - 
putative lymphocyte G0/G1 switch gene 



neurotrophic tyrosine kinase, receptor, type 2 

AXL receptor tyrosine kinase 

ESTs, Weakly similar to LKHU proteoglycan link 

solute carrier family 4 (anion exchanger), memb 

ESTs 

Human apM1 mRNA for GS3109 (novel adipose 
EST - YEL024w/RIP1 
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Table 1 A shows the accession numbers for those pkeys lacking unigenelD's for Table 1. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 
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Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



20 



Pkey CAT Number Accessions 

108446 1 12224.1 AA085383 M126091 M074174 AA075373 AA079120 AA070831 M075978 AA075372 AA128503 
108497 110079_2 AA074897M1 13914 AA0S4871 AA079329 M071309 AAO84710M129O3OAAO75O42AA074794AA071453AA0788O3 
M148628 AA122204 M074159 AA126185 AA0791 17 AA1 27089 AA070912 AA079280 AA131372 AA078833 AA071087 
AA076131 AA071047 AA079401 AA083070 AA102076 AA1 15163 AA074198 AA134725 AA1 13889 AA121 103 AA075041 
AA085148 AAO71310 M101 144 AA079659 M078931 AA079209 AA070928 AA068994 AA069817 AA076187 AA069053 
AA131489 AA071308 AA063317 M070156 M071430 AA076056 AA075684 AA070053 AA126283 AA126078 AA075895 
25 AA079208 M074583 AA071086 M079623 M070627 AA078802 M076622 AA065051 AA079143AA071110AA079434 

AA148748 MO79230 AA085188 AA074485 AA070580 AA076151 AA083166 AA0851 18 AA079450 M085044 AA120938 
AA079200 AA100188 AA081472 M122355 AA129031 AA085362 AA069220 AA070940 AA075968 AA074563 AA084027 
AA1 15929 

124215 1597154.1 H62570H59063 
117058 1219924.1 AW801806 H90434 BE086530 
1 10455 46874,1 H52576 AF085971 H52172 

111168 38585 1 AI798376 S46400 AW811617 AW811616 W00557 BE142245 AW858232 AW861651 AW858362 AA232351 AA218567 
M055556 AW858231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA1 73345 AA199942 AA223384 
AA227092 AA227080 T12379 AA092174 T61139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 
AA157718 AA157719 M100472 AA100774 AA130756 AA157705 AA157730 AA157715 AA053524 AW849581 AW854566 
C05254 AW882836 T92637 AW812621 AA206583 AA209204 BE156909 AA226824 AI829309 AW991957 N66951 AA527374 
H66215M045564 AI694265 H60808AA149726AW1 95620 BE081333 BE073424 AW817662AW817705 AW817703 
AW817659 BE081531 H59570 
111498 411008.1 AI168511 A1022712AA700366 R07371 R07324 
104340 46289J0 AA4261 89 F 15201 
103747 117944 1 AA081995 AA101099 

134496 46501J M64936 A1025512 AI382987 BE061777 AA089966 BE169930 T41 176 AW594624 BE502415 AA121893 AI269283 T4031 1 
>AA2570il 

A1079277 AI241318 BE327710 AW975215 AW896268 AA884990 BE327514 
118365 1 AA126129AA126033AA082561 

34624 1 AA221036 R87170 BE537068 BE544757 C18935 AW812058 T92565 AA227415 AA233942 AA223237 AA668403 AA601627 
AW869639 BE061833 BE000620 AW961 170 AW847519 AA308542 AW821833 AW945688 C04699 AA205504 AA377241 
AW821667 AA055720 AW817981 AW856468 AA155719 AA179928 T03007 AW754298 AA227407 AA113928 AA307904 
C16859 

AL042725 BE063316 AW975610 AA457591 BE062092 AI655202 AA714296 AI267264 AI075321 AA223286 AA071 122 
AA227649 AA216700 AJ696002 AA101867 AA099426 AA135997 AL041698 T0281 5 T51824 AA207189 T59230 T51868 
AA663341 BE165757 AW818104 AW392886 AA584918 AA099408 AW856396 AW861859 AA053045 
111686.1 AA081507 M070071 AA070840 AA084362 
322947.1 AI458623 AA639708AA485409 R22065AA485570 
113549.1 AA075124AA075208 
22955 11 M55405AW752552 

tigr HT2969 A03758 A06977 A15293 D17029 D17107 D17171 UJ0132 L00133 M12523 M13075 M13076 M92816 U22961 V00494 V00495 

X51363 X51364 X51365 
tigr K73413 L27065 

6735 9 U22961 AA203623 AA503337 AI174733 AI192802 C06O92 AA035357 AI190619 AI1 99244 A1828450 AA602296 AI378195 
AI209170 A1166653 A1127795 AI183846 H77389 AJ589465 AA629390 H94306 AI018388 R68584 AA027196 AI745413 
AI685092 AI093426 A1623873 AI074570 N50096 AA047486 N25060 AA327614 AI042512 AI383957 AA156873 AI333101 
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A1684569AA257011 



103750 
105239 



120379 34624J 



114624 
106851 
108392 
100545 
100654 

100702 
102208 
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N70806 AI141254 A1383191 AM01237 A1080709 AI093400 W84549 T90806 R00012 W01413 AA630557 AI378MB W559265 
Sra S M625 46 R68379 AI133207 AI132980 AI133214 A1064826 AI061615 AI133473AI174652 A 133404 
SJk^W^MI^MI^ ABWSZB AI1331»AB6«02 AI174993 A11M72B ADei645 AMMW16 ABMBM W73BB 
TB^AfS A 110799 D17107 NM 000477 AF190168 R50724 AI248416 AI207432 AI133684 AI133345 AI174710 

S»S2 74753 A1114666 R69184 R00011 AI064997 T60501 AI207701 T71735 AA385318 H73569 T60496 
^mMmxmmS^W3J55iOS R50261 T72061 N80533 T51 189 T74936 AI207490 A1132925 A1064701 
AI17#48WlS 

AI4446aSD16931 T40012 T48403 T56926 T69195 AI133061 T50850 AI400677 AKJ91 136 AA334608 T5741 it Z20979 
10 ^OTTe^E^ 

1U A^2t4t51362AI114589 R06691 AI1 10629 AF063503 AI140543 AA334661 AA332720 AA343262T73513 T86549 

AiSt57^T39981 5 rei407 'T72757 T51749 T56630 AA343125 T72126 R94135T83028 T39972T39896 AI174786 
U 32926 R09Z37 AI064838 I AI133660 T60398 T88753 T55930 T92126 AI444602 T60996 AI1 14792 H9391 1 AI133106 

1 S H7108oSAA3«950AI174743AA334224 AA334281 R06692 T64739 T40163 T60628 T81661 T73179 R01842 

15 S^f3993m9^T40136A)S 

M^016T60361T69176T73356 T58795 T61^^ 

Sa^1819T81684TOT69 Am4710T51776 AA343213 AI114714 T58102 AI110809 R28984 AI174854 AA305675 
OA H54053 T73211AJ1 14590 T48317 T55965 T74857 R84226 T56552 T52231 T74946 T76976 R02576 T95666 AI203974 

^^l^wSmaumW^mzmm AH33325 AI032512 AA701169 AI936354 AIII472O AW33289 

mSaA719iS 
25 aS 9 TK6^ 

10 $52657 T71984 789118 W92684 A1114860 T62093 T61797 AI522333 T73322 H92981 T56018 T61811 T57232 AI336158 

TC1821^ 

T59166 AI1871 11 T64308 T62071 T69427 AI114750 T60430 R09734 T69033 T69141 T69453 T67908 R16809 T69394 

30 T677 51 W174983 T51679TM851H69880 N737MA^^^ 

T28321 T55864 

123941 genbankJW621529 AA621529 

118049 aenbank_N53145 N53145 

102800 14782_20 AA313538 U88895 U88902 

35 104106 AA422123J_atAA422123j 

111738 genbanK_R26065 R26065 

113149 genbank_T515B8 T51588 

113958 genbank_W86195 W86195 

108335 genbank_AA070500 AA070500 

40 108351 genbank_AA071193 AA071193 

108441 genbank_M079079 AA079079 

124276 genbank_H83465 H83465 

101447 entrezLM21305 M21305 

117226 genbank_N20468 N20468 

45 133379 genbank_AA207059 AA207059AA207241 

119366 genbank_T77892 T77892 

! 119528 NOT_FOUND_entrez_W38051 W38051 

112588 genbank_R77302 R77302 

114449 genbank_AA020736 AA020736 

50 114576 genbank_AA065096 M065096 

107459 W38002.sjt W38002.S 

130339 genbankJ\A435746 AA435746 
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TABLE 2: Figure 2 from BRCA 001 US 
5 Table 2 shows genes downregulated in tumor tissue compared to normal breast tissue. 
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Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelO: Unkjene number 

Unlgene Tide: Unigene gene title 

R1: Ratio of normal breast tissue to tumor 



Pkey ExAccn 


UnigenelD 


Unigone Tittle 


R1 


100499 T51966 


Hs.283108 


hemoglobin, gamma G 


10 


100549 BE142019 


Hs.222056 


Homo sapiens cONA FU11572 fis, clone HE 


10 


100654 A03758 




NM_000477*:Homo sapiens albumin (ALB), m 


10 


100971 BE379727 


Hs.83213 


fatty acid binding protein 4, adipocyte 


10 


101184 NM.001674 


Hs.460 


activating transcription factor 3 


10 


101336 NMJ0O6732 


Hs.75678 


FBJ murine osteosarcoma viral oncogene h 


10 


101367 X03350 


Hs.4 


alcohol dehydrogenase 1B (dass 1), beta 


10 


101447 M21305 




gb:Human alpha satellite and satellite 3 


10 


101461 N98569 


Hs.76422 


phosphoGpase A2, group HA (platelets, 


10 


101511 M27826 


Hs.267319 


endogenous retroviral protease 


10 


101736 M74447 


Hs.502 


transporter 2, ATP -binding cassette, sub-famil 


10 


102208 U22961 




gb:Human mRNA clone with similarity to L 


10 


102450 U48251 


Hs.75871 


protein kinase C binding protein 1 


10 


102800 AA313538 




gb:EST185419 Colon carcinoma (HCC) cell 


10 


102857 NMJHJ6744 


Hs.76461 


retinol-bmding protein 4, interstitial 


10 


102990 AA829286 


Hs.332053 


serum amyloid A1 


10 


103747 AA081995 




gbzn26d06j1 Stratagene neuroepithelium 


10 


103812 AA137107 


Hs.326391 


Homo sapiens, done MGC: 16638, mRNA, com 


10 


104093 R50727 


Hs.336970 


ESTs 


10 


104109 AL353957 


Hs.284181 


hypothetical protein DKFZp434P0531 


10 


104250 F06638 


Hs.12440 


Homo sapiens clone 24734 mRNA sequence 


10 


104492 N73185 


Hs.94285 


EST 


10 


104506 N91071 


Hs.109650 


ESTs 


10 


104532 AI498763 


Hs.203013 


hypothetical protein FU12748 


10 


104677 AA009764 


Hs.190380 


ESTs 


10 


104711 AA017245 


Hs.32794 


ESTs 


10 


104731 AA019300 


Hs.125070 


ESTs, Moderately similar to I54374 gene 


10 


105005 AI298208 


Hs.28805 


ESTs 


10 


105036 AA130390 


Hs.25549 


hypothetical protein FU20898 


10 


105239 AA221036 




gbzr03f12.r1 Stratagene NT2 neuronal pr 


10 


106052 N79885 


Hs.6382 


ESTs. Highly similar to T00391 hypotfieti 


10 


106181 A1803651 


Hs.191608 


ESTs 


10 


106283 A1085846 


Hs.25522 


KIAA1808 protein 


10 


106379 AL042069 


Hs.1 19021 


DKFZP434N061 protein 


10 


106451 AW235928 


Hs.313182 


ESTs 


10 


106491 AA135688 


Hs.10083 


Homo sapiens, clone IMAGE:4139786, mRNA, 


10 


106782 AW054886 


Hs.25682 


Homo sapiens mRNA for KIAA1863 protein, 


10 


107124 AB006532 


Hs.31442 


RecQ protein-like 4 


10 


107148 A1005036 


Hs.334305 


GS1999fuD 


10 


107214 AF127026 


Hs.5394 


myosin IA 


10 


107242 AB020672 


Hs.175411 


KIAA0865 protein 


10 


107331 AI905985 


Hs.111805 


ESTs 


10 


107447 W28516 


Hs.19210 


hypothetical protein MGC1 1308 


10 


107451 AL042425 


Hs,283976 


hypthelrcal proteai PR02389 


10 


107872 BE271708 


Hs.95110 


ESTs, Weakly similar to A55943 1-p hospha 


10 


108351 AA071193 




gbzf79b1ls1 SoaresjJbieal_glandJi3HPG 


10 


109546 F01449 


Hs.26954 


Homo sapiens mRNA; cDNA DKFZp762G123 (fr 


10 


110433 AW294162 


Hs.301062 


llDP^^cety^alpha-D-galactQsamine:polyp 


10 


110976 AL044174 


Hs.159526 


patched (Drosophila) homolog 


5 
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111168 AI798376 
111651 R16733 
111803 AA593731 
114484 AA034378 
125284 NNL0Q2666 
128850 AA193106 
128903 AW150717 
129346 AF110141 
129381 AW245805 
129516 AF020038 
129554 BE222078 
130085 M62402 
130243 D88435 
130400 V00517 
130436 NM.001928 
130563 BE270472 
130589 AL110226 
130683 AA993269 
130689 NMJ506691 
130689 AA046747 
130718 N70196 
130798 M81349 
130840 BE048821 
131184 AB040935 
131282 X03350 
131328 AW939251 
131543 AW966881 
131753 M829286 
131785 H69342 
131828 AJ000263 
132426 AW1 18072 
132675 A1291496 
132898 VV28548 
132905 NM-004235 
133120 NMJJ03278 
133407 AF017987 
133719 H26904 
134007 AF072441 
134055 D86062 
134111 A1372588 
134117 M081846 
134177 BE243319 
134369 AF207664 
134496 M64936 
134510 NWL002757 
134550 M26315 
134758 NMJJ00078 
134963 NfwL003394 
135066 X04430 
408790 AW580227 
446674 AA563892 



Hs.20499 

Hs.325823 

Hs.267319 

Hs.103253 

Hs.180817 

Hs.296176 

Hs.288908 

Hs.110903 

Hs.11223 

Hs.113069 

Hs.274313 

Hs.153227 

Hs.283108 

Hs.155597 

Hs.279900 

Hs.16441 

Hs.17872 

Hs.17917 

Hs.17917 

Hs.18376 

Hs.1955 

Hs.20144 

Hs.23954 

Hs.4 

Hs.25647 

Hs.41639 

Hs.332053 

Hs.26320 

Hs.278658 

Hs.89981 

Hs.5476 

Hs.224823 

Hs.7934 

Hs.65424 

Hs.7306 

Hs.75736 

Hs.7840 

Hs.182423 

Hs.8022 

Hs.7921 

Hs.79672 

Hs.8230 

Hs.250870 

Hs.85258 

Hs.89538 

Hs.91985 

Hs.93913 

Hs.47860 

Hs.306000 



g b:tr34b07.x1 NCLCGAP_Ov23 Homo sapiens 
ESTs 

ESTs, Moderately similar to ALU5_HUMAN A 

endogenous retroviral protease 

perilipin 

chromosome 1 1 open reading frame 23 
STAT induced STAT inhibitors 
WAS protein family. member2 
daudin 5 (transmembrane protein deleted 
Isocitrate dehydrogenase 1 (NADP+), soiu 
ESTs 

insulin-like growth factor binding prate 

cydin G associated kinase 

hemoglobin, gamma 6 

D component of complement (adipsin) 

HSPC015 protein 

DKFZP434H204 protein 

Homo sapiens, clone 1MAGE:3875012, mRNA 

extracellular link domain-containing 1 

extracellular link domain-containing 1 

K1AA1319 protein 

serum amyloid A4, constitutive 

small inducible cytokine subfamily A (Cy 

cerebral cell adhesion molecule 

alcohol dehydrogenase 1B (class I), beta 

v-fos FBJ murine osteosarcoma viral onco 

programmed cell death 2 

serum amyloid A1 

TRABID protein 

keratin, hair, basic, 6 (monilethrix) 
diacylglycerol kinase, zeta (104kD) 
Homo sapiens, clone IMAGE:3530123, mRNA. 
ESTs 

KruppeHike factor 4 (gut) 

tetranectin (plasminogen-binding protein 

secreted frizzied-related protein 1 

apolipoprotein D 

calcineurin binding protein 1 

ES1 (zebrafish) protein, human homolog o 

TU3A protein 

Homo sapiens mRNA; cDNA DKFZp566E183 (fr 
KIAA0652 gene product 
a dislntegrin-like and metalloprotease ( 
gb'JHomo sapiens retinoic acid-inducible 



CD8 antigen, alpha polypeptide (p32) 
cholesleryl ester transfer protein, plas 
wingtess-type MMTV integration site fami 
Interteukin 6 (interferon, beta 2) 
neurotrophic tyrosine kinase, receptor, type 2 
solute carrier family 4 (anion exchanger), memb 
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Table 2A shows the accession numbers for those pkeys lacking unigenelD's for Table 2. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 

111168 38585J AI798376 S46400AW811 617 AW81 1616 W0Q557 BE 142245 AW858232 AW861 851 AW858362 AA232351 AA2 18567 



AA055556 AW858231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA173345 AA199942 AA223384 
AA227092 AA227080 T12379 AA092174 T61 139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 AA157718 
AA157719AA100472M100774 AA130756 AA157705AA157730AA157715 AA053524 AW849581 AW854566 C05254 
AW882836 T92637 AW812621 AA206583 AA209204 BE156909 AA226824 AI829309 AW991957 N66951 AA527374 H66215 
AA045564 AI694265 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 AW817659 
BE081531 H59570 



103747 117944 J AA081995 AA101099 

134496 46501 1 M64936 AI025512 AI382987 BE061777 AA089966 BE169930 T41176 AW594624 BE502415 AA121893 AI269283 T40311 

AI684569 AA25701 1 A1079277 AI241318 BE327710 AW975215 AW896268 AA884990 BE327514 
105239 34624J AA221036 R87170 BE537068 BE544757 C18935 AW812058 T92565 AA227415 AA233942 AA223237 M668403 M601627 



AW869639 BE061833 BE000620 AW961 170 AW847519 AA308542 AW821633 AW94568B C04699 AA205504 AA377241 
AW821667 AA055720 AW817981 AW856468 AA155719 AA179928 T03007 AW754298 AA227407 AA1 13928 AA307904 
C16859 



100654 tigr KT2969 A03758 A06977 A15293 D17029 D17107 D17171 L00132 L00133 M12523 M13Q75 M13076 M92816 U22961 V00494 V00495 
X51363 X51364 X51355 

102208 6735 9 U22961 AA203623 AA503337 AI174733 AI192802 C06092 AA035357 AI190619 AI199244 AI828450 AA602296 A1378195 



AI209170 AH86653 AI127795 A1183846 H77389 AI589465 AA629390 H94306 AI018388 R68584 AA027196 A1745413 
AI685092 AJ093426 AJ623873 AI074570 N50096 M047486 N25060 AA327614 AI042512 A1383957 AA156873 AI333101 
N70806 A1141254 AI383191 AM01237 AI080709 AI093400 W84549 T90806 R00012 W01413 AA630557 AI378348 AI559265 
AA877103 W84464 AA625146 R68379 AJ133207 AI132980 AI133214 AI064826 AI061615 AH 33473 AJ174852 AI133404 
AI133272 V00494 M12523 M12523 AI207526 AI133120 A1064802 AI174993 AH 14729 AI061645 AI064716 AI064959 H77388 
T85706 AF075298 AH 10799 017107 NMJJQ0477 AF190168 R50724 AI248416 AI207432 AI133684 Ai133345 AI174710 
AI133290 AI133304 AI174948 AI207484 AH10717 AF074624 A1114515 AF063516 AI110642 A1114559 A1114498 A1114759 
AI207568 AI064960 AI174753 AI114666 R69184 R0001 1 AI064997 T60501 AI207701 T71735 AA385318 H73569 T60496 
H94399 A1133158 T74675 AA484750 T73413 T55909 R50261 T72061 N80533 T51 189 T74936 AI207490 AH 32925 AJ064701 
AI174748 A1114663 AI133104 A1132999 AJ133100 AI064925 AJ064979 AI133063 AA343347 T69091 AA233989 T39772 
AI444620 T52290 D16931 T40012 T48403 T58926 T69195 AJ133061 T50850 A1400677 AI091 136 AA334608 T5741 1 Z20979 
N56507 T87485AI133622AA343370 T40075T69671 T53849T74820AF075316 AI1 10818 T401 21 T57381 AI114468 
AA332728 T51362 A11 14589 R06691 A11 10629 AF083503 AH40543 AA334661 AA332720 AA343262 T73513 T86549 
AM 14840 T57284 T39981 T61407 T72757 T51749 T56630 AA343125 T72126 R94135 T83028 T39972 T39896 A1174786 
AI132926 R09237 A1064838 AI133660 T60398 T88753 T55930 T92126 AI444602 T60996 AI114792 H93911 AJ133106 
R10779 AI065020 T90925 T50889 D17029 AI133703 AA333805 AI133040 AI133017 AI064857 AI1 10730 AF074637 AI207567 
H71080 T73217 AA343950 AI174743 AA334224 AA334281 R06692 T64739 T40163 T60628 T81661 T73179 R01842 
AA501730 T39931 T39662 T40136 AA334904 T71425 H77784 R00874 AI065049 T84512 T55918 AI207595 T39951 
AA005016 T60361 T69176 T73356 T58795 T61233 T39955 T60612 AU14676 ATO64778 AA035710 W52763 AI114786 
T83564 AA341859 T81684 T55769 AI114710 T51776 AA343213 AM 14714 T58102 AI110809 R28984 AI174854 AA305675 
AA343592 T53836 T46869 T64721 T55508 W05241 T54019 T57945 T60513 T48364 AF075308 W86731 T82851 T48269 
H54053 T73211 AI114590 T48317 T55965 T74857 R84226 T56552 T52231 T74946 T76976 R02576 T95666 AI203974 
AI189471 AA005147 AI478102 AI207662 AI192792 AI768421 AI064737 AW051713 AA936693 A1133117 AI766232 AI913646 
T83962 AI0651 12 AI207689 A1174684 AI207702 T81475 AI133325 AI032512 AA701 169 AI936354 AI114720 AI433289 
AA046980 A1823482 AH 14536 AA860651 AW242644 R07469 AW300438 AI133416 AW271670 AI991363 T78943 A1823481 
AA845518 AA719124 AA883454 T68850 T69115 AI935509 AJ150977 T62890 T71374 T68294 AI174774 T67411 T68318 
A1064689 T56624 T69010 T68982 T68302 AI332829 T72908 AI064819 AC05880 T62895 T69430 T95111 AA025050 T73330 
W52657 T71984 T691 18 W92684 AH 14860 T62093 T61797 AI522333 T73322 H92981 T56018 T6181 1 T57232 A1336158 
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T61821 T69457 T62900 T62912 T72917 T46885 AI702448 T57212 T57203 R94581 T7131 1 T61819 T89358 T67708 T70918 
T59166 AI1871 1 1 T64308 T62071 T69427 A1114750 T60430 R09734 T69033 T69141 T69453 T67908 R16809 T69394 
AI207729 T55839 T90273 T73339 AW194909 T75486 T71850 T71305 T71287 T53877 T73452 T68852 N75290 AI312890 
T67751 AI174983 T51679 T54851 H69880 N73734 AA443453 T73466 H69672 N53869 T68447 D11809 D12412 T64300 
T28321 T55884 

102800 14782 20 AA313538 U88895 U88902 

108351 genbankJ\A071193 AA071193 

101447 entre*_M213Q5 M21305 
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TABLE 3: Figure 3 from BRCA 001 US 
Table 3 shows genes downregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

10 UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of normal breast tissue to tumor 



15 

Pkey ExAccn UnigenelD UmgeneTitie R1 

101336 NMJW6732 Hs.75678 FBJ murine osteosarcoma vtral oncogene h 10.0 

102208 U22961 gb:Hurnan mRNA done with similarity to L 10.0 

20 102990 AA829286 Hs.332053 serum amyloid A1 10.0 

1 1 1 168 AI798376 gb:tr34b07jc1 NCI_CGAP_Ov23 Homo sapiens 10.0 

111803 AA593731 Hs.325823 ESTs, Moderately similar to ALU5_HUMAN A 10.0 

130085 M62402 Hs.274313 insulin-like growth factor binding prate 10.0 

130840 BE048821 Hs.20144 small Inducible cytokine subfamily A (Cy 10.0 

25 131543 AW966881 Hs.41639 programmed cell death 2 10.0 

133120 NNL003278 Hs.65424 tetranectin {plasminogen-binding protein 10.0 

134758 NMJJ00078 Hs.89538 cholesteryl ester transfer protein, plas 10.0 
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TABLE 3A 

Table 3A shows the accession numbers for those pkeys lacking unigenelD's for Table 3. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 



1 0 Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



15 Pkey CAT number Accessions 



111168 38585 1 AI798376 S46400 AW811617 AW811616 W00557BE142245AW858232AW861851 AW858362 AA232351 AA218567 
111168 38585.1 M055556 AW858231 AW857541 AW814172 H66214AW814398 AF1 34164 AA243093 ^|]J^J^(^^Sf£^??J_ 771 - 
M227092 AA227080 T12379 AA092174 T61 139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 AA157718 
90 AA157719 AA100472 AA100774 AA130756 AA157705 AA157730 AA157715 AA053524 AW849581 AW854566 C05254 

2U AA209204 BE156909 AA226824 A1829309 AW991957 N66951 AA527374 HS6215 

1027086735 9 U22961 AA503337 AI174733 AI192802 C06092 M035357 AI190619 A1199244 AI828450 AA602296 AI378195 

1022086735.9 SSSSnOTAi58^ 
25 a 093426 A 623873 A 074570 SaA047486 N25060 AA327614 AI042512 AI383957 AA156873 AI333101 N70806 AI141254 

s 

M12523AHn^AI133120AI0^ 

™ D17107 NIVL000477 AF190168 R50724 AI248416 AI207432 AI133684 AI133345 A1174710 A1133290 AI133304 A1174948 

JKJ AI207484 All 10717 AF074624 A11 14515 AF063516 AI1 10642 AI114559 AI114498 AJ114759 AI207568 AI064960 AI174753 

T73413 T55909 R50261 T72061 N80533 T51189 T74936 A1207490 AI132925 AI064701 AI174748 AI114663 AI133104 AI132999 
AI133100 A1064925 A1064979 AI133063 M343347 T69091 AA233989 T39772 AI444620 T52290 DMBT T40012 T«JJ» 
3 5 T58926 T69195 AI133061 T50850 AI400677 AI091136 AA334608 T5741 1 Z20979 N56507 T87485 Al 1 33622 AA34337Q T40075 

J T69671 T53849 T74820 AF075316 AI110818 T40121 T57381 AI114468 AA332728 T51362 AI114589 R06691 AI110629 

AF063503 AI140543 AA334661 AA332720 AA343262 T73513 T86549 AM 14840 T57284 T39981 T61407 ' T72757 
T56630 AA343125 T72126 R94135 T83028 T39972 T39896 AI174786 AI132926 R09237 AI064838 A1133860 T60398 T88753 
TM93QT92126AW446Q2T6^ 

40 AI133040 AI133017 AI064857 A1110730 AF074637 AI207567 H71080 T73217 AA343950 AI174743 M334224 AA334281 

AIG65049 T84512 T55918 A1207595 T39951 M005016 T60361 T69176 T73356 T58795 T61233 T39955 T60612 A!1 14676 
AI064778 AA035710 W52763 AM 14786 T83564 AA341859 T81684 T55769 Al 114710 T51776 AA343213AM 14714 J581Q2 
AM 10809 R28984 A1174854 AA305675 AA343592 T53836 T46869 T64721 T55508 W05241 T54019 T57945 T60513 ^48364 
45 AR7 5308 W86731T82851T48269 H54053T73™^ 

4D R02576 T95665 AI203974 AI189471 AA005147 AI478102 A1207662 AI192792 AI768421 AI064737 AW05 713 ^93^93 

AI1331 17 A1766232 AI913646 T83962 A1065112 A1207689 AI174684 AI207702 T81475 AI133325 AI032512 AA701 169 AI936354 
A 114720 A1433289 AA046980 AI823482 AI1 14536 AA860651 AW242644 R07469 AW300438 ^'^^^ TMor/AH7/77>i^^ 
T78943 A1823481 AA845518 AA719124 AA883454 T68850 T69115 A1935509 AI150977 T6289C > T7137 £68294 AI174774 

< ft TR74H T68318 AI064689 T56624 T69010T68982 T68302 AI332829 T72908 AI064819 AI20588O T62895 T69430 T95111 

J U ' AA025050 T73330 W52657 T71984 T691 18 W92684 AI114860 T62093 T61797 AI522333 T73322 H92981 T56018 T6181 1 

T57232 AI336158T61821 T69457 T62900 T62912 T72917 T46885 AI702448 T57212 T57203 R94581 T71311 T61819T89358 
T67708 T70918 T59166 AI187111 T64308 T62071 T69427 AI114750 T60430 R09734 T69033 T69141 T69453 T67908 R16809 
T69394 AI207729 T55839 T90273 T73339 AW194909 T75486 T71850 T71305 T71287 T53877 T73452 T68852 N75290 

55 AI312890 T67751 AI174983 T51679 T54851 H69880 N73734 AA443453 T73466 H69672 N53869 T68447 D11809 D12412 

T64300T28321T55864 
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TABLE 4: Figure 4 from BRCA 001 US 
5 Table 4 shows genes upregulated in tumor tissue compared to normal breast tissue. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

1 0 UnkjenelD: Unigene number 

Unigene Title: Unigene gene title 
R1 : Ratio of tumor to normal breast tissue 

15 Pkey ExAccn UnigenelD Unigene Title R1 

100113 NfvL001269Hs.84746 chromosome condensation 1 2.3 

100114 X02308 Hs.82962 thymidylate synthetase 2.9 
100131 012485 Hs.1 1951 ectonudeotkie pyrophosphatase/phosphodiesterase 1 1.9 

20 100146 BE185499 Hs.2471 K1M0020 gene product 1.9 

100147 D13666 Hs.136348 osteoblast specific factor 2 (fascicUn l-Dke) (periostln) 7.5 

100154 H60720 Hs.81892 WAA01 01 gene product 9.2 

100163 W44671 Hs.124 gene predicted from cDNA with a complete coding sequence 1.6 

100220 AW015534 Hs.217493 annexlnA2 2.0 

25 100265 038521 Hs.112396 WAA0077 protein 1-5 

100271 BE160O81 Hs.256290 S100 calcium-binding protein A11 (cafgizzarin) 13.5 

100275 BE242802 Hs.154797 K1M0090 protein 5.1 

100323 D50920 Hs.23106 K1AA01 30 gene product 19 

100335 AW247529 Hs.6793 platelet-activating factor acetylhydrolase, teoform lb, gamma subunit (29kD) 2.7 

30 100364 NM.004341 Hs.1 54868 carbamoyl-phosphate synthe tase 2, aspartate transcarbamylase, and dihydroorotase 2.0 

100372 NM_014791Hs.184339 KIAA0 175 gene product 2.6 

100393 D84145 Hs.39913 novel RGD-containing protein 3.2 

100400 AW954324 Hs.75790 phosphatidyBnositol glycan, class C 1.5 

100418 D86978 Hs.84790 KIAA0225 protein 2.0 

35 100482 M65028 Hs.81361 heterogeneous nuclear rtbonucleoproteln A/B 2.9 

100518 NM_004415Hs.74316 desmoplakin (DPI, DPII) 1.9 

100666 L05424 Hs.1 69610 CD44 antigen (homing function and Indian blood group system) 5.7 

100667 L05424 Hs.169610 C044 antigen (homing function and Indian blood group system) 9.0 

100668 L05424 Hs.169610 CD44 antigen (homing function and Indian blood group system) 7.6 
40 100678 AW502935 Hs.740 PTK2 protein tyrosine kinase 2 53.2 

100685 AA328229 Hs.184582 ribosomal protein L24 1.8 

100690 AA383256 Hs.1657 estrogen receptor 1 1.6 

100783 AF078847 Hs.191356 general transcription factor OH, polypeptide 2 (44kO subunit) 5.9 

100850 AA836472 Hs.297939 cathepsinB 1.7 

45 100892 BE245294 Hs.180769 S164 protein 1.7 

100945 AF002225 Hs.180686 ubiquitin protein Bgase E3A (human papilloma virus E6-assodated protein, Angelman syndrome) 1.5 

100969 AA157634 Hs.79172 solute carrier family 25 (mitochondrial carrier; adenine nucleotide transit 6.3 

100988 AK000405 Hs.76480 ubiquiMke 4 11.4 

100999 H38765 Hs.80706 diaphorase (NADHyNADPH) (cytochrome b-5 reductase) 1.6 

50 101031 J05070 Hs.151738 matrix metaDoproteinase 9 {gelatinase B, 92kD gelatinase, 92kO type IV collagenase) 82 

101045 J05614 gb:Human proliferating cell nuclear antigen (PCNA) gene, promoter region. 5.0 

101077 N99692 Hs.75227 Empirically selected from AFFX single probeset 2.6 

101093 L06419 Hs.75093 procoliagen-lysme, 2^oxoglutarate 5-dioxygenase (lysine hydroxylase, Ehlers-Oanlos syndrome type V1)1 .4 

101161 NMJ)06262Hs.37044 peripherin 16.9 

55 101186 AA020956 Hs.179881 core-binding factor, beta subunit 2.0 

101216 AA284166 Hs.84113 cycfm^ependent Wnase inhibito^^ 1.8 

101228 AA333387 Hs.82916 chaperonln containing TCP1, subunit 6A (zeta 1) 1.7 

101247 AA1 32666 Hs.78802 glycogen synthase kinase 3 beta 1.9 

101249 L18964 Hs.1904 protein kinase C. iota 15 

60 101332 J04088 Hs.1 56346 topoisomerase (ONA) II alpha (170kD) 5.2 

101332 J04088 Hs.156346 topoisomerase (ONA) II alpha (170kD) 3.4 

101352 AI494299 Hs.16297 COX17 (yeast) homolog, cytochrome c oxidase assembly protein 6.3 

101396 BE267931 Hs.78996 proliferating cell nuclear antigen 4.2 

101445 M21259 gbHuman Alu repeats in the region 5 to the small nuclear rib 1.9 

65 101470 NM_.000546Hs.1846 tumor protain p53 (U-Fraumenl syndrome) 1.6 
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101478 NNL002890HS.758 RASp21 protein activator (GTPase activating protein) 1 2.5 

101483 M24486 Hs.76768 procoflagen-proline, 2-oxoglutarate 4<lioxygenase (profine 4-hydroxylase), alpha polypeptide I 5.5 
101540 J04977 Hs.84981 X-ray repair complementing defective repair in Chinese hamster celts 5 (double-strand-break rejoining 2.1 

101573 AW248421 Hs.250758 proteasome (prosome, macropain) 26S subunit, ATPase, 3 1.6 

5 101580 NM_012151Hs.83363 coagulation factor Vllt-associated {intronic transcript) 5.7 

101592 AF064853 Hs.91299 guanine nucleotide binding protein (G protein), beta polypeptide 2 i-o 

101592 AF064853 Hs.91299 guanine nucleotide binding protein (G pr 5.6 

101621 BE391804 Hs.62661 guanytate binding proton 1, interferon-inducible. 67kD J* 
101702 AW504089 Hs.179574 protein phosphatase 2 (formerly 2A), regulatory subunit B (PR 52), alpha tsoform 

10 101734 M74099 Hs.1 47049 cut (Drosophila)-fike 1 (CCAAT displacement protein) 2.1 

101759 M80244 Hs.184601 solute carrier family 7 (cationic amino acid transporter, y+ system), member 5 5.0 

101767 M81057 Hs. 1 80884 carboxypeptidase B1 (tissue) J 4 ' 4 
101782 AA306495 Hs.1869 phosphoglucomutase 1 

101805 AW409747 Hs.75612 stress-lnduced-phosphoprotein 1 (Hsp70/Hsp90-organlzing protein) 

15 101806 AA586894 Hs.1 12408 S100 calciunvbinding protein A7 (psoriasin 1) ™ 

101810 NM 000318HS.180612 peroxisomal membrane protein 3 (35kD, Zellweger syndrome) ^ 

101879 AA1~76374 Hs.243886 nuclear autoantigenic sperm protein (histone-binding) i- b 

101911 AA441787 Hs.1 19689 glycoprotein hormones, alpha polypeptide *y 

101920 AF182645 Hs.8024 IK cytokine, down-regulator of HLA II -«vjm^t« o* 
20 101973 U41514 Hs.80120 UDP-N-acetyl*lpha4)-galadosam^ 

101983 AI904232 Hs.75323 prohibitin 8,4 



102009 BE245149 Hs.82643 protein tyrosine kinase 9 J-J 
102036 BE250127 Hs.82906 COC20 (cell division cycle 20. S. cerevisiae, homoiog) 

102083 T35901 Hs,75117 interleukin enhancer binding factor 2,45kD J* 

25 102083 T35901 Hs.75117 interleukin enhancer binding factor 2, 4 

102107 BE258602 Hs.182366 heat shock protein 75 1A 

102123 NM 0Q1809HS.1594 centromere protein A (17kD) ™ 

102165 BE313280 Hs.159627 death associated protein 3 ™ 

102198 AW950852 Hs.74598 polymerase (DNA directed), delta 2, regulatory subunit (50kD) « 

30 102217 AA829978 Hs.301613 JTV1 gene * J 

102220 U24389 Hs.65436 lysosomal *~ 

102234 AW163390 Hs.278554 heterochrornatin-ake protein 1 ]•» 

10226Q AL039104 Hs.1 59557 karyophenn alpha 2 (RAG cohort 1, importin alpha 1) «■« 

102302 AA306342 Hs.69171 protein kinase C-Bke 2 f-' 

35 102330 BE298063 Hs.77254 chromobox homoiog 1 (Drosophila HP1 beta) ™ 
102339 BE378432 Hs.95577 cydin-dependent kinase 4 

102348 U37519 Hs.87539 aldehyde dehydrogenase 3 family, member B2 

102349 AU077055 Hs.289107 baculoviral IAP repeat-containing 2 ** 
102369 U39840 Hs.299867 hepatocyte nuctear factor 3, alpha 

40 102374 U33635 Hs.90572 PTK7 protein tyrosine kinase 7 *f 

102391 AA296874 Hs.77494 deoxyguanosine kinase ^ 

102455 U48705 Hs.75562 discoidin domain receptor family, member 1 *J 

102465 NM 001 359 Hs.8 1548 2,4-dienoyl CoA reductase 1, mitochondrial ™ 

102488 U50939 Hs.61828 amyloid beta precursor protein-binding protein 1,59kD ™ 

45 102489 AL080116 Hs.74420 origin recognition complex, subunit 3 (yeast homoiog )-Iike ™ 

102494 AI188137 Hs.75193 COP9homotag £1 

102501 AF217197 Hs.74562 siah binding protein 1; FBP interacting repressor; pynmidine tract binding splicing u 

102522 BE250944 Hs.183556 solute carrier family 1 (neutral amino acid transporter), member 5 2.8 
102532 AF040253 Hs.70186 suppressor of Ty (S.cerevisiae) 5 homoiog 

50 102564 U59423 Hs.79067 MAD (mothers against decapentaplegic, Drosophila) homoiog 1 ^ 
102568 W81489 Hs.223025 RAB31, member RAS oncogene family 

102580 U60808 Hs.152981 CDP-diacylglycerol synthase (phosphatidate cytidylyltransferase) 1 ^ 

102581 AU077228 HsJ7256 enhancer of zeste (CtosophHa) homoiog 2 J* 

102582 U61232 Hs.32675 tubufin-speaTwchaperonee J-J 
55 102617 AW161453 Hs.198767 ODP9{(x™titutive 1° 

102618 A1037672 Hs.81071 extracellular matrix protein 1 

102627 AL021918 Hs.158174 zinc finger protein 184 (Kruppel-like) J-J 
102663 NM_002270Hs.168075 karyophenn (importin) beta 2 

102676 BE262989 Hs.12045 putative protein f* 

60 102687 NM_007019Hs.93002 ubtquitin carrier protein E2-C J* 
102689 U96132 Hs.171280 hydroxyacyl-Coenzyme A dehydrogenase, type II w 
102696 BE540274 Hs.239 forkhead box M1 4 J 

102704 AU077058 Hs.54089 BRCA1 associated RING domain 1 ™ 

102705 T97490 Hs.50002 small inducible cytokine subfamily A (Cys-Cys), member 19 ^ 
65 102750 AB014460 Hs.66196 nth (E.coH endonuclease lli)-fike 1 J-J 

102801 BE252241 Hs.38041 pyridoxal (pyridoxine, vitamin B6) kinase M 
102812 U90549 Hs.236774 high-mobility group (nonhistone chromosomal) protein 17-hke 3 
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102827 BE244568 Hs.6456 chaperonin containing TCP1, subunit 2 (beta) 5.6 

102831 AA262170 Hs.80917 adaptor-related protein complex 3, sigma 1 subunit 2.0 

102844 AV653790 Hs.324275 WW domain-containing protein 1 1.3 

102868 X02419 Hs.77274 plasminogen activator, urokinase 4.4 

5 102925 BE440142 Hs.2943 signal recognition particle 19kD 1.9 

102935 BE561850 Hs.80506 small nuclear ribonudeoproteln polypeptide A' 2.4 
102968 AU076611 Hs.154672 methyterietetrahydrofolate dehydrogenase (NAD+ dependent), metrienyitetrahydrofolate cydohydroIase2.7 

102983 BE387202 Hs.118638 non-metastatic cells 1, protein (NM23A) expressed in 3.1 

102985 U95742 H&2707 G1 to S phase transition 1 5.2 

10 103023 AW500470 Hs.1 17950 multifundforttl polypeptide similar to SAICAR synthetase and AIR carboxylase 1.6 

103038 M926960 Hs.334883 COC28 protein kinase 1 2.5 

103060 NMJX)5940Hs.155324 matrix metalloproteinase 11 (MMP11;stromelystn3) 4.5 

103080 AU077231 Hs.82932 cyclin D1 (PRAD1: parathyroid adenomatosis 1) 3.1 

103089 D31152 Hs. 179729 collagen, type X, alpha 1 (Schmid metaphyseal chondrodysplasia) 2.4 

15 103177 BE244377 Hs.48876 famesyl-diphosphate famesyttransferase 1 3.5 

103178 AA205475 Hs.275865 ribosomal protein S18 9-9 

103179 NMJW1777Hs.82685 CD47 antigen (Rh-related antigen, Integrin-associaled signal transducer) 1.3 . 
103181 X69636 Hs.334731 Homo sapiens, clone IMAGE:3448306, mRNA, partial cds 2.0 
103185 NrvL006825Hs.74368 transmembrane protein (63kD), endoplasmic relicuiunVGolgi intemiea^te wmpartment 1.6 

20 103191 AA401039 Hs.2903 protein phosphatase 4 (formerly X), catalytic subunit 2.5 

103193 NMJ)04766Hs75724 coatomer protein complex, subunit beta 2 (beta prime) 2.2 

103194 NM_004939Hs.78580 DEAO/H (Asp-Giu-Ata-Asp/His) box polypeptide 1 6.3 
103206 X72755 Hs.77367 monokine Induced by gamma Interferon 8.8 
103223 BE275607 Hs.1708 chaperonin containing TCP1, subunit 3 (gamma) 3.0 

25 103232 X75962 Hs.129780 tumor necrosis factor receptor superfamily, member 4 18 

103238 AI369285 Hs.75189 death-associated protein 5.6 

103297 Nrvt.001545Hs.9078 Immature colon carcinoma transcript 1 1-9 

103330 AI803447 Hs.77496 small nuclear ribonudeoprotein polypeptide G 2.5 

103349 X89059 gbHsapiensmRNA for unknown protein expressed in macrophage 1.6 

30 103376 AL036166 Hs.323378 coated veside membrane protein 1-8 

103391 X94453 Hs.114366 pyrroTme-5-carboxylate synthetase (glutamate gamma-semialdehyde synthetase) 2.3 

103392 X94563 gb:Ksaplens dbi/acbp gene exon 1 & 2. 4.0 
103430 BE564090 Hs.20716 translocase of inner mitochondrial membrane 17 (yeast) homologA 1.3 
103491 AF264750 Hs.288971 myeloid/lymphoid or mixed-lineage leukemia 3 5.6 

35 103505 AL031224 Hs.33102 transcription factor AP-2 beta (activating enhancer- binding protein 2 beta) 5.1 

103547 A1376722 Hs.180062 proteasome (prosome, macropain) subunit, beta type, 8 (farge multifunctionaJ protease 7) 9.7 

103588 NMJ06218HS.85701 prKispholnositide-3-kinase, catalytic, alpha polypepfide 2.0 

103613 NM_000346Hs^316 SRY (sex determining region Y)iox 9 (campomelic dysplasia, autosomal sex-reversal) 1.3 

103621 BE379766 Hs.150675 polymerase (RNA) II (ONA directed) polypeptide K (7,0kD) 2.0 

40 103622 AA609685 Hs.278672 membrane component, chromosome 11, surface marker 1 2.3 

103727 AIB78883 Hs.296381 growth factor receptor-bound protein 2 1*3 

103749 AL135301 Hs.8768 hypothetical protein FU 10849 1.8 

103754 AI015709 Hs.172089 Homo sapiens mRNA; cONA DKFZp586l2022 (from clone DKFZp586i2022) 1.3 

103780 AA094752 Hs.169992 hypothefical 422 Kd protein 7.5 

45 103795 H26531 Hs.7367 Homo sapiens BTB domain protein (BDPL) mRNA, partial cds 1.2 

103797 AA080912 gb:m04d03j1 Stratagene hNT neuron (937233) Homo sapiens cDNA done 5* similar 1.5 

103813 AI042582 Hs.181271 CGM20 protein 15 

103855 W02363 Hs.302267 hypothetical protein FU 10330 1.5 

103886 AK001278 Hs.105737 hypothetical protein FU 1041 6 similar to cc^fitotwe photornorphogenic protein 1 6.5 

50 104052 NM_002407Hs.97644 mammaglobin2 2.9 

104079 AA251242 Hs.103238 ESTs 1.4 

104174 AA478984 Hs.6451 PRO0659 protein 5.6 

104227 AB002343 Hs.98938 protocadherin alpha 9 1.6 

104275 AI751970 Hs.101067 GCN5 (general control of amlno-add synthesis, yeast, homolog)-fike 2 5.4 

55 104325 BE379766 Hs.150675 polymerase (RNA) U (DNA directed) polypeptide K (7 .OkD) 6.3 

104370 AA324597 Hs.21851 Homo sapiens cONA FU12900 fis, clone NT2RP2004321 1.6 

104423 R83113 Hs.1432 protein kinase C substrate 80K-H 5.2 

104482 AB037762 Hs.44268 myelin gene expression factor 2 1.2 

104532 AM98763 Hs.203013 hypothetical protein FU 12748 2.1 

60 104563 AL1 17403 Hs.306189 DKFZP434F 1735 protein 1.2 

104667 AI239923 Hs.30098 ESTs 1.3 

104757 AI694413 Hs.332649 olfactory receptor, family 2, subfamily I, member 6 2.3 

104804 AI858702 Hs.31803 ESTs, Weakly similar to N-WASP (Rsapiensj 1.3 

104806 AB023175 Hs.22982 KIAA0958 protein 2.3 

65 104827 AW052006 Hs.8551 PRP4/STK/WD splicing factor 105 

104846 A1250789 Hs.32478 ESTs 5.6 

104854 AA041276 Hs. 154729 3^osphoinositide dependent protein Wnase-1 12.3 
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104867 AA278898 Hs.225979 hypothetical protein similar to small G proteins, especially RAP-2A 

104871 T78044 Hs.28893 Homo sapiens mRNA; cDNA DKFZp56402364 (from clone DKFZp56402364) 



2.0 
1.3 

104896 AW015318 Hs.23165 ESTs ^ 

104909 AW408164 Hs.249184 transcription factor 19 (SC1) ™ 
5 104916 AW958157 Hs.155489 NS1 -associated protein 1 

104919 AA026880 Hs.25252 prolactin receptor \l 

104930 AF043467 Hs.32893 neurexophilin 2 7? 

104973 NM_.015310Hs.6763 KIAA0942 protein ™ 

104974 Y12059 Hs.278675 bromodomairvcontaining 4 

10 104975 AL136877 Hs.50758 SMC4 (structural maintenance of chromosomes 4, yeasty-Bel 

104975 AL136877 Hs,50758 SMC4 (structural maintenance of chromoso h( ^ r(icft(Vft<n -.j:,. 

104978 AI199268 Hs;19322 Homo sapiens, Similar© R1KEN cDNA 2010317E24 gene, donelMAGE:3502019, mRNA, partial cds 7.2 

104979 AA937934 Hs.321062 ESTs \\ 
104994 AI499930 Hs.334885 mitochondrial GTP binding protein *jj 

15 105009 BE379584 Hs.34789 doikJiyi-diprK>sprKX)Bg<)saccteride-protein grycosyitransferase ™ 
105012 AF098158 Hs.9329 chromosome 20 open reading frame 1 

105028 AI050715 Hs.2331 E2F transcription factor 5, p130-bindlng ^wammimrm* fifl 
105032 AA127818 gb.dl2a02.s1 Soaresjjregnantuterus.NbHPU Homo sapiens cDNA clone IMAGE.501674 3 6.8 

105039 AA907305 Hs.36475 ESTs TL 

20 105041 AB037716 Hs.26204 K1AA1 295 protein ff 

105045 BE242899 Hs.129951 speckle-type POZ protein *! 

105079 AA151342 Hs.12677 CGM47 protein Z~ 

105087 AA147884 Hs.9812 Homo sapiens cDNA FU14388 fis, done HEMBA1002716 ™ 

105088 H58589 Hs.35156 Homo sapiens cONA FU1 1027 fis, done PLACE10041 14 " 
25 105095 Z78407 Hs.27023 vesicie transport-related protein *•* 

105110 BE387350 Hs.33122 KIAA1 160 protein 

105126 AW975433 Hs.36288 ESTs " 

105127 AA045648 Hs.301957 nudix (nudeoside diphosphate linked moiety X)-type imotfS ( " 
105141 AA164687 Hs.177576 mannosyl (alpha-1 glycoprotein beta-l^-N-acetylglucosaminyltransferase, isoenzyme A 2.7 

30 105158 AW976357 Hs.234545 hypothetical protein NUF2R 1* 

105169 BE245294 Hs.180789 S164 protein A 

105186 AA191512 Hs.28005 Homo sapiens cDNA FU1 1309 fis, clone PLACE1010076 w 

105254 AA071276 Hs.19469 K1AA0859 protein l 'Z 

105281 AA263143 Hs.24596 RADSl-lnteracting protein , To 

35 105288 N99673 Hs.3585 ESTs, Weakly simiiarto AF126743 1 DNAJ domain-containing protein MCJ [H.sapiens] 1.9 

105302 AA700122 Hs.3355 sentrin-spedfic protease ? " 

105331 AW270037 Hs. 179507 KIAA0779 protein '° 

105359 NMJ>16O15Hs.8054 CGl-68 protein 7. 

105366 BE264645 Hs.282093 hypothetical protein FU21918 ™ 

40 105373 AW887701 Hs.32356 hypothetical protein aJ20628 " 

105374 BE242803 Hs.262823 hypothetical protein FLM 0326 • 

105387 AW592146 Hs.108636 membrane protein CH1 . 

105393 AF167570 Hs.256583 interleukin enhancer binding factor 3, 90kD ™ 
105399 BE386877 Hs.334811 Npw38-binding protein NpwBP 

45 105400 AF198620 Hs.65648 RNA binding motif protein 8A w ,,0 _ n 

105445 AA252395 gb:zsl2g10 s1 NCI.CGAP.GCB1 Homo sapiens cDNA done IMAGE:685026 3\ mRNA sequence. 5.0 

105507 BE268348 Hs.226318 CCR4-NOT transcription complex, subunit 7 ■* 

105529 AA1 13449 Hs.32471 hypothetical protein FU20364 

105530 AB023179 Hs.9059 WAA0962 protein ™ 
50 105547 AA262640 Hs.27445 unknown 7J 

105564 BE616694 Hs.288042 hypothetical protein FU 14299 ■£ 

105596 AA579535 Hs.18490 hypothetical protein FU2Q452 ™ 

105597 AF054284 Hs.334826 splicing factor 3b, subunit 1, 155kD *■* 
105608 AI808201 Hs.287863 hypothetical protein FU12475 

55 105610 AA280072 Hs.99872 fetal Alzheimer antigen 

105617 AK000892 Hs.4069 glucocorticoid modulatoiy element binding protein 1 
105620 AW302245 Hs.181390 casein kinase 1, gamma 2 

105658 AA985190 Hs.246875 hypothetical protein FU20059 ™ 

105697 AW499988 Hs.27801 zinc finger protein 278 ' 

60 05708 R26944 Hs.180777 Homo sapiens mRNA; cDNA DKFZp564M0264 (from done DKFZp564M0264) 1J 

105743 BE246502 Hs.9598 sema domain. immunoglobulin domain (lg), transmembrane domain (TM) and snort ^ 

105746 AW151952 Hs.46679 hypothetical protein FU20739 \\ 

105759 AI123118 Hs.15159 chemoWne-Bke factor, alternatively spliced ™ 

105771 A1267720 Hs.153221 synovial sarcoma, translocated to X chromosome J* 

65 105820 AA741336 Hs.1 52108 transcriptional unit N143 ff 

105828 AA478756 Hs.194477 E3ubiquitinligaseSMURF2 I-J 

105856 AI262106 Hs.12653 ESTs " 
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105858 AF151066 Hs.281428 hypothetical protein 2.9 

105875 AKQ01708 Hs.32271 hypothetical protein FU 1084 6 1.4 

105930 AF016371 Hs.9880 peptidyl prolyl isomerase H (cyclophifin H) 5.2 

106000 AW194426 Hs.20726 ESTs 1.7 

5 106011 AW081202 Hs.12284 Homo sapiens, done !MAGE:2989556, mRNA, partial cds 2.8 

106017 AA477956 Hs.26268 ESTs 1.4 

106073 AL157441 Hs.17834 downstream neighbor of SON 1.4 

106078 AA130158 Hs.19977 ESTs, Moderately similar to ALU8_HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 1.6 

106094 AA533491 Hs.23317 hypothetical protein FU1 4681 6.8 

10 106140 AB006624 Hs.14912 WAA0286 protein 1.6 

106271 AA251393 Hs.289052 Homo sapiens, Similar to RIKEN cONA 5430429M05 gene, done MGC: 131 55, mRNA, complete cds 10.8 

1062B8 AB037742 Hs.24336 KIAA1321 protein 1.3 

106300 Y10Q43 Hs.19114 r^twnobflity group (rwnhistone chromosomal) protein 4 3.6 

106333 AL043114 Hs.22410 ESTs, Weakly similar to A54849 collagen alpha 1(VII) chain precursor (H.sapiensJ 5.4 

15 106350 AK001404 Hs.194698 cyc!inB2 5.7 

106359 AW390282 Hs.31 1 30 transmembrane 7 superfamily member 2 6.3 

106381 AB04Q916 Hs.24106 WAA1483 protein 6.5 

108389 AW748420 Hs.6236 Homo sapiens cDNA: FU21487 fis. done COL05419 2.2 

108457 AF1 19256 Hs.27801 zinc finger protein 278 2.7 

20 106470 053078 Hs. 186 180 Homo sapiens cONA: FU23038 fis, clone LNG02039 2.3 

106531 AA454036 Hs.8832 ESTs 1.6 

106586 AA243837 Hs.57787 ESTs 1.6 

106589 AK000933 Hs.28661 Homo sapiens cDNA FU 10071 fis, done HEMBA1 001702 2.4 

106610 AA458882 Hs.79732 Gbullnl 7.9 

25 106624 NM_003595Hs56350 tyrosytprotein suifotransferase 2 7.7 

106650 AL049951 Hs.22370 Homo sapiens mRNA; cDNA DKFZp564O0122 (from clone DKFZp564O0122) 1.8 

106669 AV657117 Hs.184164 ESTs, Moderately similar to S65657 a!pha-1C-adrenergic receptor splice form 2 [Rsapiens] 1.3 

108713 BE614802 Hs.184352 hypothetical protein FU1 2549 4.5 

106717 AA600357 Hs.239489 TIA1 cytotoxic granule-associated RNA-binding protein 1.3 

30 106723 BE388094 Hs.21857 ESTs 1-6 

106795 AF174487 Hs.293753 Bd-2-related ovarian killer protein-like 5.7 

106829 AW959893 Hs.27099 hypothetical protein FU23293 simOar to ARL-6 Interacting protein-2 16.2 

106831 BE564871 Hs^9463 centrin, EF-hand protein, 3 (CDC31 yeast homolog) 1.5 

106846 AB037744 Hs.34892 KIAA1 323 protein 2.2 

35 106852 AF151031 Hs.300631 hypothetical protein 1.3 

106873 N49809 Hs.11197 Homo sapiens, done IMAGE3343149, mRNA, partial cds 16.8 

105886 W79171 Hs.9567 GL002 protein 15 

105906 AA861271 Hs.222024 transcription factor BMAL2 22 

106920 AK001838 Hs.296323 serum/glucocorticoid regulated kinase 3.3 

40 106945 AK000511 Hs.6294 hypoflietical protein DKFZp434L1 435 simflar to valyl tRNA synthetase 6.8 

106973 BE156256 Hs.11923 hypothetical protein 6.6 

106977 AL043152 Hs.50421 KIAA0203 gene product 4.8 

106978 AW631480 Hs.8688 ESTs 6.0 
107004 AA146872 Hs.300700 hypothetical protein FU20727 1.3 

45 107029 AF264750 Hs.288971 myeloid/lymphoid or mixed-lineage leukemia 3 1.8 

107071 AW385224 Hs.35198 ectonudeotide pyrorjhcspbatase/^^ 1.7 

107113 AK000733 Hs.23900 GTPase activating protein 2.5 

107125 AK000512 Hs.69388 hypothetical protein FU20505 1.7 

107136 AV661958 Hs.8207 GK001 protein 4.6 

50 107136 AV661958 Hs.8207 GK001 protein 3.3 

107146 AK001455 Hs.5198 Down syndrome critical region gene 2 2.0 

107151 AW378065 Hs.8687 ESTs 6.3 

107155 AW391927 Hs.7946 KIAA1 288 protein 33.5 

107174 BE122762 Hs.25338 ESTs 5.2 

55 107197 W15477 Hs.64639 glioma pathogenesis-reiated proteh 6.1 

107221 AW888411 Hs.81915 leukemia-assodated phosphoprotein p18 (stathmln) 17.4 

107243 BE219716 Hs.34727 ESTs, Moderately ssnilar to 138759 zinc finger/leucine zipper protein (H.sapiens] 7.4 

1 07248 AW263124 Hs.31511 1 nudear receptor co-repressorfHDAC3 complex subunit 1 .8 

107263 060341 Hs.21198 translocase of outer mitochondrial membrane 70 (yeast) homolog A 6.6 

60 107265 BE379594 Hs.49136 ESTs, Moderately similar to ALU7_HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 2,5 

107298 N95657 Hs.6820 ESTs. Moderately similar to YOJ1.CAEEL HYPOTHETICAL 63,5 KD PROTEIN ZK353.1 IN 2.5 

107298 N95657 Hs.6820 ESTs, Moderately similar to YOJ1 JCAEEL H 1.7 

107299 BE277457 Hs.30661 hypothetical protein MGC4606 32 
107316 T63174 Hs.193700 Homo sapiens mRNA; cONA DKFZp586l0324 (from clone OKFZp58610324) 2.0 

65 107354 NM_006299Hs^6448 zinc finger protein 193 5.0 

107392 AW299900 Hs.267632 TATA element modulatory factor 1 \2 

107481 AA307703 Hs.279766 kinesln family member 4A 1.6 
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107529 BE515065 Hs.296585 nucleolar protein (KKE/D repeat) *0 
K BS7M4 S Rltt^Mitt^^ 

107772 AA018587 Hs.303055 ESTs, Weakly similar to ALU 1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 2.1 

107859 AW732573 Hs.47584 potassium voltage-gated channel, delayed-reclifier, subfamily S. member 3 ».« 

107901 L42612 Hs.335952 keratin 6B 7Z 

107901 L42612 Hs.335952 keratin 6B ™ 

107922 BE153855 Hs.61460 Ig superfamily receptor LN1R tt 

107974 AW956103 Hs.61712 pyruvate dehydrogenase kinase, isoenzyme 1 °-' 
108040 AL121031 Hs.159971 SWI/SNF related, matnx associated, actin dependent regulator of chromatin, subfarruly b, member 1 1.5 



108230 AA054224 Hs.59847 ESTs 
108274 AF129535 Hs.272027 F-box only protein 5 

108296 N31256 Hs.161623 ESTs 3 5 

108496 AA083069 Hs.339659 ESTs _„™ n \ a 

15 108607 BE300380 Hs.69476 Homo sapiens cDNAFLJ12758fis, clone NT2RP2001 328 *J 

108621 AA101809 Hs.182685 ESTs ' 
108634 AW022410 Hs.69507 ESTs 

108647 BE546947 Hs.44276 homeo box C10 ™ 

108695 AB029000 Hs.70823 KIAA1077 protein '* 

20 108717 AA122393 Hs.70811 hypothetical protein FU2G5 16 x '° 

108740 AI089575 Hs.9071 progesterone membrane binding protein • 

108828 AK001693 Hs.273344 DKFZP564O0463 protein !° 

108859 AL1 21500 Hs.178904 ESTs " 

108872 H06720 Hs.1 11680 endosulfine alpha 53 

25 108891 A1801235 Hs.48480 ESTs VL 

108894 AK001431 Hs.5105 hypothetical protein FU10569 • 

108955 AA149754 Hs.195155 Homo sapiens amino acid transport system N2 (SN2) mRNA, complete cds m 

108982 AA151708 Hs.171980 homeo box (expressed in ES cells) 1 • 

108987 AA152178 Hs.23467 hypothetical protein FU10633 - rt 

30 109002 AB028987 Hs.72134 KIAA 1064 protein ^ 

iSuS Si HS ' 72127 gSWslStra^ 5.3 

1C3068 AA164293 Hs.72545 ESTs 16 

109101 AW608930 Hs.52184 hypothetical protein FU2061 8 ' 

35 109112 AW419196 Hs.257924 hypothetical protein FU1 3782 *f 
109124 AKD00684 Hs.183887 hypotheOcal protein FU22104 

109139 AJ132592 Hs.59757 zinc finger protein 281 ' 

109166 AA219691 Hs.73625 RAB6 interacting, kinesin-like (rabkinesm 6) " 
109198 BE566742 Hs.58169 highly expressed In cancer, rich in leucine heptad repeats 

40 109213 NM_016603Hs.82035 potential nuclear protein C50RF5; GAP-IIke protein *f 

109220 AW958181 Hs.189998 ESTs 53 

109233 AU077281 Hs.170285 nucleoporin 214kD (CAIN) • 

109270 N99673 Hs.3585 ESTs, Weakly similar to AF 126743 1 DNA J ^f^tainjngp^ 1.4 

109273 AA375752 Hs.82719 Homo sapiens mRNA; cONA DKFZp586F1 822 (from done DKFZp586F1 822) Z.9 

45 109313 AF153201 Hs.86276 C2H2 (Kruppel-type) zinc finger protein " 

109341 AA213506 Hs.115099 EST 15 
109391 AL096858 Hs.184245 KIAA0929 protein Msx2 interacting nuclear target (MINT) homolog 

109420 H83603 Hs.40408 homeo box C9 * 

109426 N30531 Hs.42215 protein phosphatase 1, regulatory subunit 6 *» 

50 109429 AI160029 Hs.61438 ESTs A 

109445 AA232103 Hs.189915 ESTs 37 

109450 AB032969 Hs.173042 KIAA1 143 protein tL 

109468 NfVL015310Hs.6763 KIAA0942 protein tt 

109478 AW074143 Hs,87134 ESTs ' 

55 109570 L40027 Hs.1 18890 glycogen synthase kinase 3 alpha " 

109662 F02614 Hs.27319 ESTs A 

109825 R71264 Hs.16798 ESTs * 

110039 H11938 Hs.21907 htstone acetyltransferase rt 

110056 AA503041 Hs.279009 matrix Gla protein rt 

60 110085 AA603840 Hs.29956 KIAA0460 protein £j 

"*29 S S29 IsTUeaklysintoto^ " 

65 110242 N41744 Hs.19978 CG140 protein I*; 

110259 H28428 Hs.32406 ESTs, Weakly simPar to I38022 hypothetical protein [H^apiens] 2^ 

110312 BE256986 Hs.11896 hypothefical protein FLJ 12089 2,1 
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110330 AI288666 Hs.16621 DKFZP434I1 16 protein 6.2 

110501 H55748 gb:yq94a01.s1 Scares fetal fiver spleen WLS Homo sapiens cDNA clone 1MAGE:203400 3' 6.1 

110504 H55915 Hs.210859 hypothetical protein FU1 101 6 6.1 

110525 H57330 Hs.37430 EST 6.3 

110568 AK001160 Hs.5999 hypothetical protein FU10298 1.3 

110699 T97586 Hs.18090 ESTs 1.8 

110705 AB007902 Hs.32168 K1AA0442 protein 1.6 

110742 AW190338 Hs.28029 hypothetical protein MGC11256 7.6 

110761 AL138077 Hs.16157 hypothetical protein FU 12707 2.5 

110762 BE044245 Hs.30011 hypothetical protein M6C2983 9.3 
110765 AK000322 Hs.18457 hypothetical protein FU2031 5 5.5 
110769 BE000831 Hs.23837 Homo sapiens cDNAFU11812fis, clone HEMBA1006364 2.1 
110799 A1089660 Hs.323401 dpy-30-like protein 1.5 
110805 T25829 Hs.24048 FK506 binding protein precursor 6.6 
110813 AA767373 Hs.35669 ESTs, Moderately similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 5.7 
110820 R33261 Hs.6614 ESTs, Weakly similar to A43932 mucin 2 precursor, intestinal |H.sapiens] 3.4 
110840 N31598 Hs,12727 hypothetical protein FU21610 1.7 
110844 AJ740792 Hs.167531 methylcrotonoyl-Coenzyme A carboxylase 2 (beta) 1.7 
110854 8E612992 Hs.27931 hypothetical protein FU1 0607. similar to glucosamine-phosphate N^acetyltransferase 4.7 
110856 AA992380 gb:ot37g06.s1 SoaresJestisJMHT Homo sapiens cDNA clone 3' similar to contains element 2.3 
110885 BE384447 Hs.16034 hypothetical protein MGC1 31 86 3.5 
110897 AL1 17430 Hs.6880- DKFZP434D 156 protein 2.2 
110915 BE092285 Hs.29724 hypothetical protein FU1 31 87 2.6 1 
110918 H04360 Hs.24283 ESTs, Moderately similar to reduced expression in cancer [Rsapiens] 1.9 
110958 NMJW5864HS.24587 signal transduction protein (SH3 containing) 6.7 
110963 AK002180 Hs.11449 DKFZP56401 23 protein 2.0 
110981 AK001980 Hs.24284 ADP-ribosyttransferase (NAD+; poly(ADP^ibose) polymeraseHike 2 1.3 
110984 AW613287 Hs.80120 UDP4I^cetyl*lprtf-D^ak^ 1.8 
111125 N63823 Hs.269115 ESTs, Moderately similar to Z195_HUMAN ZINC FINGER PROTEIN 195 [H^apiens] 3.6 
111132 AB037807 Hs.83293 hypothetical protein 2.1 
111164 N46180 Hs.122489 Homo sapiens cDNAFU13289fjs,ctaneOVARC1001 170 ^3 
111172 R67419 Hs.21851 Homo sapiens cONAFU 12900 fis, done NT2RP2004321 3.7 
111174 AL050166 Hs.26295 Homo sapiens mRNA; cDNA DKFZp586D1 122 (from done DKFZp586D1 122) 7.5 
111179 AK000136 Hs.10760 asporin (LRR class 1) 7.1 
111184 AI815486 Hs.243901 Homo sapiens CONAFU20738 fis, done HEP08257 6.7 
111184 AI815486 Hs.243901 Homo sapiens cONA FU20738 fis, clone HE 3.3 
111189 N67603 Hs.272130 ESTs, Weakly similar to S65824 reverse transcriptase homolog [H-saplensJ 3.6 
111216 AW139408 Hs.152940 ESTs 1.5 
111221 AB037782 Hs.15119 KIAA 1361 protein 2.6 
111223 AA852773 Hs.334838 KIAA 1866 protein 4.6 
111239 N90956 Hs.17230 hypothetical protein FU22087 7.9 
111285 AA778711 Hs.4310 eukaryotic translation initiation factor 1A 6.9 
111299 AB033091 Hs.74313 K1AA1 265 protein 5.0 
111312 AI523913 Hs.34504 ESTs 3.8 
111318 T99755 Hs.334728 ESTs 1.2 
111337 AA837396 Hs.263925 US1 -interacting protein NUDE1, rat homolog 5.1 
111352 H58589 Hs.35156 Homo sapiens cDNA RJ11027 fis, done PLACE10Q4114 2.2 
111370 AI478658 Hs.94631 brefetdin A-inhibited guanine nucleotide-exchange protein 1 2.8 
111384 N94606 Hs.288969 HSCARG protein 2.2 
111389 AK000987 Hs.169111 oxidation resistance 1 2.1 

111391 NNL003896HS.225939 siaiyltrarefeiase9(CW^eu^^^ 5.1 

111392 W46342 Hs.325081 Homo sapiens, done IMAGE:3659680, mRNA, partial cds 8,4 
111452 R02354 Hs.15999 ESTs 2.7 
111486 AI051194 Hs.227978 EST 6.5 
111549 W90638 Hs.20321 ESTs, Moderately similar to ZRF1JHUMAN ZUOTIN RELATED FACTOR-1 (M-PHASE 1.4 
111585 R10720 Hs.20670 EST 1.6 
111627 R52656 Hs.21691 ESTs 1.6 
111870 AB037834 Hs.18685 Homo sapiens mRNA for KIAA1413 protein, partial cds 2.4 
111937 BE298665 Hs.14846 Homo sapiens mRNA; cDNA OKFZp564O016 (from done DKFZp564D016) 10.6 
111944 AW083791 Hs.21263 suppressor of potassium transport defect 3 6.6 
111987 NM.015310Hs.6763 KIAA0942 protein 5.1 
112134 R41823 Hs.7413 ESTs; calsyntenin-2 2.8 
112244 AB029000 Hs.70823 K1AA1077 protein 14.6 
112388 R46071 Hs.301 693 Homo sapiens, done 1MAGE:3638994, mRNA, partial cds 9.0 
112456 NM.016248HS.232076 A kinase (PRKA) anchor protein 11 1,4 
112464 AW007287 Hs.28538 Homo sapiens cONA:FU21086 fis, done CAS03272 1.4 
112506 AI742756 Hs.26079 ESTs 3.2 
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112513 R68425 Hs.13809 hypothetical protein FU1 0648 
112752 AK001635 Hs.14838 hypothetical protein FU10773 



30 



35 



40 



45 



50 



55 



60 



65 



Homo sapiens mRNA for FU00004 protein, partial cds 
EST 

K1AA1557 protein 
ESTs 

112966 Z44718 Hs.102548 glucocorticoid receptor DNA binding factor 1 
112978 AK000272 Hs.7099 hypothetica! protein FLJ20265 

ESTs, Moderately similar to 21 15357A TYKi protein lM.muscu!us] 
zinc finger protein 259 
ESTs 

Homo sapiens mRNA for KIAA1729 protein, partial cds 
Hs.270862 ESTs 

gb:yb51a03.s1 Stratagene fetal spleen (937205) Homo sapiens cONA done IMAGE:74668 3\ 

. .~ ■> * a i /nnmimu »/m!nn» «.r>\1 A nlnno I 1 mDMA corn lonrfl 



112884 AK000004 Hs.5013 
112923 T10258 Hs.5037 
5 112936 AW970826 Hs.6185 
112958 R61388 Hs.6724 
112966 Z44718 

112978 AK0002.. - 

112995 AA737033 Hs.7155 
10 112996 8E276112 Hs.7165 
113047 AI571940 Hs.7549 
113049 AW965190 Hs.7560 
113089 T40707 

113196 T57317 yU.)IIW WU«».a I OUBUJyoiro «u« aprewi i w. 

1 5 113248 T63857 gb:yc16e01.s1 Stratagene lung (937210) Homo sapiens cDNA done 3\ mRNA sequence 

113254 AK002180 Hs.11449 DKFZP56401 23 protein 

1 13277 AW971049 Hs.11774 protein (peptidyl-prolyl cis/trans isomerase) NiMA-interacting, 4 (parvufin) 

113429 AA688021 Hs.179808 ESTs 

113499 AI467908 Hs.8882 ESTs 
20 113547 H59588 Hs.15233 ESTs 

113554 AW503990 Hs.142442 HP1-BP74 

113647 AA813887 Hs.188173 Homo sapiens cDNA FU12187 fe. done MAWMA1000831 • 
1 13702 T97307 gb:ye53h05.s1 Scares fetal liver spleen 1NFLS Homo sapiens cDNA done IMAGE:121497 3 , 

113722 AV653556 Hs.184411 albumin 



2.0 
1.8 
6.6 
1.5 
3.2 
6.0 
6.4 
\2 
5.6 
2.0 
1.9 
2.4 
1.3 
1.7 
2.8 
1.3 
3.2 
\2 
5,9 
2.0 
3.6 
1.3 
4.4 
1.3 



113 Vu AVbWWti ns.itmii aiuumin „ „ . , ' A « 

25 113759 AW499665 Hs.9456 SW1/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a, member 5 1.2 



113777 BE266947 Hs.10590 zinc finger protein 313 
113783 AL359588 Hs.7041 hypothetical protein DKFZp762B226 
113791 AJ269096 Hs.1 35578 chitobiase, di-N-acetyl- 
113808 W44735 Hs.9286 Homo sapiens cDNA: FU21278 lis. done COLQ1 832 
113811 BE2Q7480 Hs.6994 Homo sapiens cDNA: FU22044 fls, clone HEP09141 
113817 H13325 Hs.332795 hypothetical protein DKFZp761017121 
113826 AW378212 Hs.24809 hypothetical protein FU10826 
113834 T26483 Hs.6059 EGF-containing fibulin-iike extracellular matrix protein 2 

proteasome (prosome, macropain) 26S subunit, non-ATPase, 11 
hypothetical protein, similar to (U06944) PRAJA1 
ESTs 

hypothetical protein FU22041 similar to FK506 binding proteins 
Hs.268828 ESTs 

Hs.120969 Homo sapiens cDNA RJ11562 fe. done HEMBA 10031 97 



113868 W57902 Hs.90744 
113870 AL079314 Hs.16537 
113885 AW959486 Hs.21732 
113923 AW953484 Hs.3849 
113989 W87544 
114022 AI539519 



13.4 
1.7 

I. 3 
3.3 
3.1 
3.2 
2.3 

II. 3 
2.7 
6.1 
6.6 
1.9 
U 
5.4 
9.4 
1.8 
1.5 



1.4 

15.8 

1.9 

2.4 

1.8 

1.2 

5.5 

5.2 



1 AlWyOia nS.l/USO» noma sapiens tuiw tlj i i ju£. to, wuuc i «-»vil«-» .^w. 

1 14030 AI825386 Hs.164476 hypothetical protein FU21 939 simitar to 5-azacytidine induced gene 2 
114060 AB029551 Hs.7910 RING1 and YY1 binding protein 
114196 AF017445 Hs.150926 fucose-1 -phosphate guanytyltransferase 
114226 A8028968 Hs.7989 K1AA1045 protein \ A 
114253 BE149866 Hs.14831 Homo sapiens, Similar to zinc finger protein 136 (done pHZ-20), done MGC:10647, inRNA, complete ods 
114262 AL117518 Hs.3686 KIAA0978 protein {A 
114275 AW515443 Hs.306117 KIAA0306 protein 
114292 AI815395 Hs.184641 fatty acid desaturase 2 
114309 AA332453 Hs.20824 CGI-85 protein 

114392 AA249590 Hs.100748 ESTs, Weakly similar to A28996 proline-rich protein M14 precursor - mouse [Mjnusculus] 
114407 8E539976 Hs.103305 Homo sapiens mRNA; cDNA DKFZp434B0425 (from done DKFZp434B0425) 
114455 H37908 Hs.271616 ESTs, Weakly similar to ALU8.HUMAN ALU SUBFAMILY SX SEQUENCE 

114463 AL1 20247 Hs.40109 KIAA0872 protein -- , 

114464 AI091713 Hs.106597 Homo sapiens, Similar to RIKEN cDNA 1110012M11 gene, done IMAGE:3688605, mRNA, partial cds 1.2 
114471 AAO28074 Hs.104613 RP42homolog -JA™ 
114480 BE066778 Hs.151678 UDP-N-ace^tpiia^ala^ 134 
114671 AA766268 Hs.266273 hypothetical protein FU13346 \& 
114698 AA476966 Hs.110857 polymerase (RNA) 111 (DNA directed) polypeptide K (12.3 kDa) 3.5 
114730 A1373544 Hs.331328 intermediate filament protein syncoilin 3.8 
114767 AI659865 Hs.154443 rninichromosome maintenance deficient (S. cerevlsiae)4 16 
114774 AV656017 Hs.184325 CGI-76 protein 3.1 
114798 AA159181 Hs.54900 serologically defined colon cancer antigen 1 3.5 

bromodomain and PHD finger containing, 3 4.3 
KIAA0887 protein 7.1 
hypothetical protein 1 - 3 
b:zt29f02.s1 Soares ovary tumor NbHOT Homo sapiens cDNA done !MAGE:723771 3', mRNA sequence. 

2.0 
2.9 



114860 AL157545 Hs.42179 

114895 AA236177 HsJ6591 

114896 BE539101 Hs.5324 
114911 AA236672 

114930 AA237022 Hs.188717 ESTs 

114938 AA242834 Hs.58384 ESTs 



2.3 



1.5 
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114965 A1733881 Hs.72472 BMP-R1B 2.3 

115023 AF102546 Hs.63931 dachshund (DrosophEa) homolog 13 

115038 AA252360 Hs.87968 toB-fike receptor 9 1.6 

115061 AI751438 Hs.41271 Homo sapiens mRNA full length insert cDNA clone EUROIMAGE 1913076 118 

5 115062 AA253314 Hs.154103 UM protein (similar to cat protein Wna 1.5 

115117 AI670847 Hs.5324 hypothetical protein 1.5 

115121 AI634549 Hs.88155 ESTs 2.8 

1152)6 AW183695 Hs.186572 ESTs 2.5 

115221 AW365434 Hs.79741 hypothetical protein HJ 101 16 1.5 

10 115239 BE251328 Hs.73291 hypothetical protein FU 10881 1.3 

115242 AI368236 Hs.283732 ESTs, Moderately similar to ALU1J1UMAN ALU SUBFAMILY J SEQUENCE 1.4 

115278 AK002163 Hs.301724 hypothetical protein FU 11 301 1.5 

115285 AW972872 Hs.293736 ESTs 2.4 

115291 BE545072 Hs.122579 hypothetical protein FU10461 6.2 

15 115400 A1215069 Hs.89113 ESTs 6.6 

115468 AA314349 Hs.48499 tumor antigen SLMp 7.4 

115471 AK001376 Hs.59346 hypothetical protein FU 105 14 1.4 

115479 AW301608 Hs.278188 ESTs. Moderately similar to 154374 gene NF2 protein [KsapiensJ 4.0 

115496 AW247593 Hs.71819 eukaryotic translation initiation factor 4E binding protein 1 16.3 

20 115500 Y14443 Hs.88219 zinc finger protein 200 5.0 

115553 AJ275986 Hs.71414 transcription factor (SMIF gene) 2.5 

115581 AI540842 Hs.61082 ESTs 6.1 

115587 BE081342 Hs.283037 HSPC039 protein 2.9 

115590 AA399477 Hs.67896 7-60 protein ' 5.3 

25 115646 N3611Q Hs.305971 solute carrier family 2 (facilitated glucose transporter), memberlO 4.7 

115652 BE093589 Hs.38178 hypothetical protein FU23468 10.6 

115655 AL048269 Hs.288544 Homo sapiens, done MGC:1 6063, mRNA, complete cds 12.7 

115663 AI138785 Hs.40507 ESTs 2.0 

115676 AA953006 Hs.88143 ESTs 3.0 

30 115690 AA625132 Hs.44159 hypothetical protein FU2161 5 1.7 

115693 AF231023 Hs.55173 cadherln, EGF LAG seven-pass G-type receptor 3, fbmingo (Drosophiia) homotog 6.8 

115715 BE395161 Hs.1390 proteasome (prosome, macropaln) subunit, beta type, 2 1.7 

115734 AI950339 Hs.40782 ESTs 2.6 

115811 NMJM5434HS.4B604 DKFZP434B1 68 protein 2.1 

35 115823 AI732742 Hs.87440 ESTs 2.1 

115837 AI675217 Hs.42761 ESTs 1.3 

115844 AI373062 Hs.332938 hypothetical protein MGC5370 4.4 

115866 AW062629 Hs.52081 KIAA0867 protein 7.2 

115875 N55669 Hs.333823 mitochondrial ribosomal protein L13 1.2 

40 115941 AI867451 Hs.46679 hypothetical protein FU20739 5.5 

115968 AB037753 Hs.62767 WAA1332 protein 9.8 

116003 BE275469 Hs.66493 Down syndrome critical region gene 5 1.4 

116011 AL359053 Hs.57664 Homo sapiens mRNA full length insert cDNA clone EUROIMAGE 2005735 2.4 

116108 AA770688 Hs.28777 H2A histone family, member L 18 

45 116134 BE243834 Hs.50441 CGI-04 protein 1.4 

116189 N35719 Hs.44749 ESTs, Moderately similar to T00358 hypothetical protein KIAA0684 IH.sapiens] 1.2 

116195 AW821113 Hs.72402. ESTs 2.1 

116238 AV660717 Hs.47144 DKFZP586N081 9 protein 1.7 

116246 AF265555 Hs.250646 baculovtral IAP repeat-containing 6 1-7 

50 116262 A1936442 Hs.59838 hypothetical protein FU10808 1 7 

116298 AI955411 Hs.94109 Homo sapiens cDNA FU13634 lis, clone PLACE101 1 133 1.9 

116318 AF097645 Hs.58570 deleted in cancer 1; RNA heiicase HDB/DICE1 4.9 

116325 AI472106 Hs.49303 Homo sapiens cDNA RJ 11663 lis, clone HEMBA1004631 1.4 

116336 AL133033 Hs.4084 KIAA1025 protein 19 

55 116339 AK00O290 Hs.44033 dipeptidyl peptidase 8 1.5 

116350 AA497129 Hs.184771 nudear factor l/C (CCAAT-binding transcription factor) 1.9 

116358 AI149586 Hs.38125 interferon-induced protein 75. 52kD 19 

116365 N50174 Hs.46765 ESTs 6.1 

116368 N90466 Hs.71109 WAA1229 protein 1.6 

60 116417 AW499664 Hs.124S4 Human clone 23826 mRNA sequence 7.4 

116436 AA161411 Hs. 58668 chromosome 21 open reading frame 57 2.1 

116462 AF218313 Hs^36828 putative heiicase RUVBL 1.5 

116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 2.1 

116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 1.2 

65 116575 AA312572 Hs.6241 phosphoinosiflde-3-ldnase, regulatory subunit. polypeptide 1 (p85 alpha) 15 

116637 AK001O43 Hs.92033 integrin-linked kinase-assoclated serine/threonine phosphatase 2C 2.7 

116640 X89984 Hs.211563 B-celi CLUiymphoma 7A 2.3 
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116700 AI800202 Hs.317589 hypothetical protein MGC10765 J-* 
116705 AW074819 Hs.12313 hypolhetical protein FU14566 # 
1 16732 AW152225 Hs.165909 ESTs, Weakly similar to I38022 hypothetical protein [Rsapiens] 

116921 AW068115 Hs.821 biglycan J~ 

5 116926 H73608 Hs.290fi30 ESTs J' 

117034 U72209 Hs.180324 YY1 -associated factor 2 *J 

limr N2i03 6 2 6 HS ' 42315 SSS^^ 5 ' 5 

117276 N71183 Hs.121606 Homo sapiens cDNA FU1 1971 fis, clone HEMB81001208 iAlkM DDnTC|K| ]* 

10 117284 AK001701 Hs.183779 Homo sapiens cDNA FU 10590 fls, done NT2RP2004392, weakly similar to MNN4 PROTEIN 2.0 

117367 A1041793 Hs.425Q2 ESTs 

117368 AI878942 Hs.9Q336 ATPase, H+ transporting, lysosomal (vacuolar proton pump), member J £J 
117382 AF150275 Hs.40173 ESTs f' 
117412 N32536 Hs.42645 solute carrier family 16 (monocarboxytic acid transporters), member6 ™ 

15 117557 AF123050 Hs.44532 diubiquitin *J 

117588 N34895 Hs.44648 ESTs *? 

117745 BE294925 Hs.46680 CGl-12 protein *J 

117754 AA121673 Hs.59757 zinc finger protein 281 J* 

117879 N54706 Hs.303025 chromosome 11 open reading frame 24 » ° 

20 117881 AF161470 Hs.260622 butyrate-lnduced transcript 1 , J' 

117904 BE540675 Hs.332938 hypoftetical protein MGC5370 J? 

117911 AL137379 Hs.47125 hypothetical protein FU 1391 2 J-* 

117933 Y10518 Hs.1 16470 hypothelicai protein FU20048 y 

117983 AL110246 Hs.47367 K1AA1785 protein *J 

25 118078 N54321 Hs.47790 EST Jf 

118301 AA453902 Hs.293264 ESTs c '\ 

118429 AA243332 Hs.74649 cytochrome c oxidase subunit Vic J* 

118472 AL157545 Hs.42179 bromodomaln and PHD finger containing, 3 y 

118488 AJ277275 Hs.50102 rapa-2 (rapa gene) ^^.^ Je 

30 118509 N22617 Hs.4322B Homo sapiens cDNA FU11835 fis, clone HEMBA10Q6595 y 

118528 AI949952 Hs.49397 ESTs ££ 

118670 mS iSilslsiS ESTltAideraie^^ \\ 

118698 AB033113 Hs.50187 K1AA1 287 protein .MArc^ae ei 
3 5 1 18737 AA199686 gb:zq75g09.r1 Stratagene hNT neuron (937233) Homo sapiens cDNA clone IMAGE:647488 ,5 5.2 
8925 N92293 Hs206832 ESTs Moderately similar to ALU8 HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 1.4 
ll89W S)9 Sm 

118986 AF148713 Hs.125830 bladder cancer overexpressed protein ™ 

119206 W24781 Hs.293798 K1AA171Q protein ™ 

40 119235 AW453069 Hs.3657 activity-dependent neuroprotective protein " 

119235 AW453069 Hs.3657 acfivity-dependent neuroprotective prote , ™ 

119265 BE539706 Hs.285363 ESTs 251 
119279 N57568 Hs.48028 EST 

119298 NM_001241Hs.155478 cyclinT2 !° 

45 119338 AI417240 Hs.320836 ESTs, Weakly similar to A47582 B-cell growth factor precursor [H.sapiensJ w 

119349 T65004 Hs.163561 ESTs °! 

119403 AL1 17554 Hs.1 19908 nucleolar protein NOP5/NOP58 01 
119478 AI624342 Hs.170042 ESTs 

119486 A1796730 Hs.55513 ESTs f» 

50 119513 W37933 Empirically selected from AFFX single probeset J* 

119601 AK000155 Hs.91684 Homo sapiens mRMA; cDNA DKFZp667l103 (from done DKFZp667l103) 3.7 

119602 AW675298 Hs.233694 hypothetical protein FU 11 350 ™ 
119676 AA243837 Hs.57787 ESTs !~ 
119682 W61019 Hs.57811 ESTs J* 

55 119774 AB032977 Hs.6298 KIAA1 151 protein « 
119780 NM_016625Hs.191381 hypolhetical protein 

119789 BE393948 Hs.50915 kallikrein 5 (KLK5; KLK-L2; stratum comeum tryptic enzyme) „^™ lf -^on„ A 
IMS AJ223810 Hu«3213 ESTs, Weakly similar to IEFSJHUMAN TRANSFORMATION-SENSITIVE PROTEIN IEF SSP 3.6 

119818 AA130970 Hs.58382 hypothetical protein FU 11 101 « 
60 119863 AA081218 Hs.58608 Homo sapiens cDNA FU 14206 fis, clone NT2RP30031 57 

119905 AW449064 Hs.1 19571 collagen, type 111, alpha 1 (Ehlers-Danlos syndrome type IV, autosomal dominant) 2.6 

119966 AA703129 Hs.58963 ESTs ^ 
120132 W57554 Hs.125019 lymphoid nuclear protein (LAF-4) mRNA 

120206 H26735 Hs.91668 Homo sapiens clone PP1498 unknown mRNA ™ 

65 120248 AI924294 Hs. 1 73259 uncharaclerized bone marrow protein BM033 v J- 2 
120253 AA131376 Hs.326401 fibroblast growth factor 12B 

120269 AW131940 Hs.104030 ESTs y ' b 
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120274 AA177051 gb:nc02a02.s1 NCI_CGAP_Pr3 Homo sapiens cONA done IMAGE:194 similar to contains Atu 4.6 

120280 AA190577 gb:zp52g02.s1 Stratagene HeLa cell s3 937216 Homo sapiens cDNA done 3', mRNA sequence 2.0 

120296 AW99591 1 Hs.299883' hypofoetical protein FU23399 1.8 

120297 AA191384 Hs.104072 ESTs, Weakly similar to Z195_HUMAN ZINC FINGER PROTEIN 195 [H^apiens] 15.2 
5 120324 AA195517 Hs.191643 ESTs 5.5 

120325 AA195651 Hs.104106 ESTs 6.4 

120327 AK000292 Hs.278732 hypothetical protein FU20285 16.1 

120336 N85785 Hs.181 165 eukanyotic translation elongation factor 1 alpha 1 2.9 

120342 AW450669 Hs.45068 hypothetical protein DKFZp434l1 43 5.7 

10 120345 AA210722 Hs.104158 ESTs 4.5 

120349 AW969481 Hs.55189 hypothetical protein 16.8 

120352 R06859 Hs.193172 ESTs, Weakly similar to I38022 hypothetical protein (H.sapiens) 5.0 

120356 AF000545 Hs.296433 putative purinerglc receptor 28.1 

120371 AA219305 Hs.104196 EST 12.4 

15 120382 AA228026 Hs.38774 ESTs 4.0 

120383 AL109963 Hs.123122 FSH primary response (LRPR1, rat) homotog 1 9.7 

120386 AW969665 Hs.154848 hypothetical protein DKFZp434D01 27 32.6 

120388 AA232874 Hs.104245 ESTs 3.1 

120389 AW967985 Hs.325572 ESTs, Moderately similar to ALU7J4UMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 21 .7 
20 120396 AA134006 Hs.79306 eukaryotic translation initiation factor 4E 12.5 

120404 AB023230 Hs.96427 K1AA1 01 3 protein 7.2 

120418 AW966893 Hs.26613 Homo sapiens mRNA; cDNA OKFZp586F1323 (from done DKFZp586F1 323) 11.4 

120423 AA236453 Hs. 18978 Homo sapiens cDNA: FU22822 fis, done KAIA3968 1.9 

120472 AI950087 gb:wq05c02.x1 NCI_CGAP_Kid12 Homo sapiens cDNA done 3\ mRNA sequence 19.4 

25 120473 AA251973 Hs.269988 ESTs 5.4 

120484 AA253170 Hs.96473 EST 10.4 
120504 AA256837 gb:zr84d10.s1 Soares_NhHMPu_S1 Homo sapiens cDNA done IMAGE:682387 3\ mRNA sequence. 3.9 

. 120509 BE047718 Hs.96545 ESTs 9,4 

120520 AA258601 Hs.161731 EST 2.4 

30 120535 BE350244 Hs.96547 ESTs 2.5 

120551 AA279160 Hs.111407 Homo sapiens, done IMAGE:3613029. mRNA, partial cds 5.2 

120570 AA280679 Hs.271445 ESTs, Weakly similar to ALU1.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 14.4 

120582 BE244830 Hs.284228 ZNF135-fike protein 10.2 

120590 AW372799 Hs.125790 leudne-rich repeat-containing 2 2.1 

35 120596 AA282074 Hs.237323.N-acetylgluoosaminei3hosphatemutase 7.5 

120619 AW965339 Hs.111471 ESTs 2.5 

12)624 AW407987 Hs.173518 M-phase phosphoprotein homolog 52.0 
120839 AA286942 gb:zs56f05.s1 NCLCGAP.GCBI Homo sapiens cDNA done IMAGE:701505 3* similar to contains Alu2.4 

120648 AA287095 Hs.140309 Homo sapiens, done IMAGE3677194, mRNA, partial cds 5.0 

40 120653 AW063659 Hs.191649 ESTs 22 

120668 AW969638 Hs.112318 6.2 kd protein 2.2 

120669 BE536739 Hs.109909 ESTs 1.9 

120695 AA976503 gb:oq30a04.s1 NCI CGAP_GC4 Homo sapiens cDNA done 3* similar to contains PTR7.H PTR7 46.8 

120696 AI821539 Hs.97249 ESTs 2.5 
45 120713 AW449855 Hs.96557 Homo sapiens cDNAFU12727fis, clone NT2RP2000027 5.9 

120718 AA292747 Hs.97296 ESTs 2.9 

120750 AI191410 Hs.96693 ESTs, Moderately similar to 2109260A B cell growth factor (H.sapiens] 7.0 

120774 AI608909 Hs.193985 ESTs 7.8 

120807 AA346385 Hs.30002 SH3-containing protein SH3GLB2; KIAA1848 protein 6.8 
5 0 1 20809 AA346495 gb:EST52657 Fetal heart II Homo sapiens cDNA 3' end similar to EST containing O family repeat, 4.4 

120938 AA386260 Hs.104632 EST 4.4 

120977 AA398155 Hs,97600 ESTs 4.4 

120984 BE262951 Hs.99052 ESTs 5.6 

120985 A1219896 Hs.97592 ESTs 1.2 
55 121011 AA398360 Hs.97608 EST 3.1 

121026 AM39713 Hs.165295 ESTs 3.5 

121081 AA398721 Hs, 186749 ESTs, Highly similar to 137550 mismatch repair protein MSH2 [H.sapiens] 5.4 

121133 AA363307 Hs.97032 ESTs 3.7 

121176 AL121523 Hs.97774 ESTs 1.7 

60 121223 AI0021 10 Hs,97169 ESTs, Weakly similar to dJ667H12.2.1 (H.sapiens] 2.9 

121320 AA403008 Hs,301927 c6.1A 1.9 

121340 AW956981 Hs.97910 Homo sapiens CDNAFU1 3383 fis, done PUCE 1001 024 3.5 

121408 AA406137 Hs.98019 EST 6.0 

121439 AA410190 Hs.98076 ESTs, Weakly similar to A47582 B-ceD growth factor precursor [Ksapiens] 7.4 

65 121450 AA406430 Hs.105362 Homo sapiens, done MGC: 18257, mRNA, complete cds 6.9 

121452 AW971063 Hs.292882 ESTs 1.8 

121455 H58306 Hs.15165 retinoic acid Induced 14 10.5 
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121457 W07404 Hs.144502 hypothetical protein RJ22055 3.4 

121496 M442224 Hs.97900 ESTs J" 

121505 M494172 Hs.194417 ESTs I*' 

121508 AA402515 Hs.97887 ESTs ^ 

^ 121514 AA412112 HS ' 18151 ° gfrzt69b02.s1 SoaresJestis_NHT Homo sapiens cDNA clone 1MAGE:727563 3\ mRNA sequence.^ ^2.6 

121558 AA412497 Hs ' 98142 gb^2t95g1 Zs1 SoaresJestis_NHT Homo sapiens cDNA clone 1MAGE:73Q150 3' similar to contains2.8 
121577 AA411970 Hs.98096 EST 

10 121581 M416568 gb:zu05c10.s1 SoaresJestisJiHT Homo sapiens cDNA done 3*. mRNAsequence 6.1 

121589 AD001528 Hs.89718 spemiine synthase ^ 

121594 AA626010 Hs.98247 ESTs ~ 

121622 AA416931 Hs.126065 ESTs mM M , D , Afn , i t 

121655 AA421537 Hs.178072 Homo sapiens mRNA; cDNADKFZp434B 1023 (from clone DKFZp434B 1023) 7.8 

15 121682 AA418160 Hs.86043 Homo sapiens cDNARJI 3558 fis, clone PLACE1 007743 j*> 

121690 AV660305 Hs.110286 ESTs \' 

121706 U55184 Hs.154145 hypothetical protein FLJ1 1585 J*;' 

121714 AA419225 Hs.98269 H<>mo sapiens cDNAFUl 1953 fis, done HEMBB1000883 ™ 

121729 AI949597 Hs.98325 ESTs J* 

20 121731 AA421041 Hs.180744 ESTs 
121744 AA398784 Hs.97514 ESTs 
121748 BE536911 Hs.234545 hypothetical protein NUF2R 

121773 AB033022 Hs.15B654 K1AA1 196 protein '* 
121775 AA421773 Hs.161008 ESTs 

25 121776 AA292579 Hs.125133 hypothetical protein FU22501 ™ 

121786 A1810774 Hs.98376 ESTs ™* 0 

121832 AW340797 Hs.98434 ESTs *! 
121836 AA328348 Hs.218289 ESTs . , 
121839 M425691 Hs.191606 ESTs, Highly similar to KIAA1048 protein |H.sapiens] 

30 121842 AF027406 Hs.104865 serine/mreonine kinase 23 * 

121847 AA446628 Hs.2799 cartilage linking protein 1 " 

121871 AW972668 Hs.293044 ESTs ~ 

121882 AA426376 Hs.98459 ESTs ™ 

121911 AA427950 gb:zw50f02.s1 ScaiesJotolJetus.Nb2HF8._9w Homo sapiens cDNA clone 

35 121915 AA428179 Hs.223405 ESTs, Moderately similar to A46010 X-linked retinopathy protein [H.sapiens] 2.5 

121935 AA428647 Hs.98611 EST fj 

121983 AA298760 Hs.180191 hypothetical protein FLJ14904 *r 

121985 AI862570 Hs.299214 Homo sapiens, clone !MAGE:2822295, mRNA, partial cds ™ 

121995 AA210863 Hs.3532 nemo-like kinase J? 

40 121999 AA430211 Hs.98668 EST . W4M .„- 02 

122009 AW292763 Hs.160822 Homo sapiens cDNA: FU20863 fis, done ADKA01804 ^ 

1»£ wSft 5 HUNM ESTs, Weakly similar to ALU5_HUMAN ALU SUBFAMILY SC SEQUENCE CONTAMINATION 131 

122050 A1453076 Hs.166109 ELAV (embryonic lethal, abnormal vision, Drosophila)4ike 2 9- 1 

45 122060 AA431738 Hs.98750 EST J* 1 

122114 AW161023 Hs.104921 ESTs I, 

122188 AA398838 gb:zt80d01.r1 Soares_testis_NHT Homo sapiens cDNA done 5 , mRNA sequence 3.J 

122204 AA435936 Hs.98842 EST J* 
122246 AA329550 Hs.29417 HCF-binding transcription factor Zhangfei 

50 122257 AA436819 Hs.98899 ESTs ** 

122341 iSSlW Hs.99010 hypoMcat protein FU22263 similar to nuclear receptor-binding SET-domain protein 1 2.0 

122356 AA443794 Hs.98390 ESTs ™ 

122369 AA443985 Hs.303222 ESTs ~ 

55 122371 AA868555 Hs.178222 ESTs ™ 
122372 AA446008 Hs.336677 EST 

122378 AB032948 Hs.21356 hypothetical protein DKF2p762K2015 « 
122405 AA446572 Hs.303223 EST 

122412 AA446869 Hs.1 19316 ESTs \'i 

60 122415 AA446918 Hs.99088 EST J* 

122418 AA446966 Hs.99090 ESTs, Moderately similar to similar to K1AA0766 [H.sapiens] ™ 

122440 AW505139 Hs,9460 Homo sapiens mRNA; cDNA DKFZp547C244 (from done DKFZp547C244) 2.6 

122446 AA447603 Hs.99123 EST Vl 
122448 AA447626 Hs.99127 EST 

65 122458 AI266159 Hs.104980 ESTs ^ , « . , J*? 

122460 AW416788 Hs.99148 ESTs, Weakly similar to S43569 R01H10.6 protein - Caenorhabditis elegans IC.elegans] 9.7 

122464 AA448158 Hs.99152 EST 4 - 8 
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122490 
122492 
122502 
122510 
122530 
122547 
122555 
122570 
122572 
122586 
122587 
122598 
122599 
122602 
122607 
122614 
122616 
122617 
122618 
122622 
122717 
122762 
122829 
122634 
122836 
122837 
122838 
122854 
122856 
122861 
122866 
122868 
122870 
122872 
122879 
122907 
122916 
122981 
123013 
123016 
123034 
123072 
123082 
123088 
123110 
123114 
123131 
123132 
123136 
123149 
123152 
123258 
123315 
123369 
123394 
123433 
123466 
123470 
123471 
123475 
123482 
123486 
123508 
123615 
123619 
123658 
123674 



AA448349 Hs.238151 
AA448417 Hs.104990 
AA204969 Hs.234863 
AA449232 Hs.99195 
AW959741 Hs.40368 
AA779725 Hs.164589 
AA194055 Hs.293858 
AA452578 Hs.262907 
AA452601 Hs.99287 
AK001910 Hs.99303 
AB040893 Hs.6968 
AI028173 Hs.99329 
AL355841 Hs.99330 
AA411925 Hs.301960 
AA453518 Hs.98023 
AA453630 Hs.99339 
AA453638 Hs.161873 
AI681535 Hs.148135 
AA4 53641 

AA453987 Hs.144802 
AA456859 Hs.178358 
AI376875 Hs.105119 
AW204530 Hs.99500 
AA461492 Hs.99545 
AA460581 Hs.290996 
AA461509 Hs.293565 
AA460584 Hs.334386 
AA600235 Hs.9625 
AI929374 Hs.75367 
AA335721 Hs.1 19394 
BE539656 Hs.283705 
AF005216 Hs.1 15541 
AW576312 Hs.318722 
AW081394 Hs.97103 
AA769410 HS.128654 
AA470074 Hs.169896 
AA470140 Hs.229170 
AA478951 Hs.105629 
AW968324 Hs.17384 
AW338067 Hs.323231 
AL359571 Hs.44054 
AI382600 Hs.104308 
AA485360 Hs.105661 
AI343652 Hs.105667 
AA486256 Hs.193510 
BE304942 Hs.265848 
T52027 Hs.271795 
AI061582 Hs.324179 
AW451999 Hs.194024 
AI734179 Hs.105676 
AW601773 Hs.270259 
AA490929 Hs.105274 
AA496369 

AA504757 Hs.105738 
AA731404 Hs.105510 
AW450922 Hs.112478 
AA599042 Hs.112503 
AW303285 Hs.303632 
AB021644 Hs.197219 
BE439553 Hs.250528 
N95059 Hs.55098 
BE019072 Hs.334802 
AW380388 Hs.155546 
AA609170 
AA602964 
AA609364 

A1269609 Hs.105187 



EST 

ESTs 



Homo sapiens cONA FU12082 fe. clone HEMBB1002492 
ESTs 

adaptor-related protein complex 1, sigma 2 subunit 
ESTs 
ESTs 
ESTs 
EST 

Homo sapiens cONA RJ1 1048 lis, done PLACE1004516 
KIAA1460 protein 
ESTs 

hypothetical protein FU23588 
ESTs 
ESTs 
EST 
ESTs 

serine/threonine kinase 33 

gb:zx48e06.s1 Scares testis JJHT Homo sapiens cDNA clone 3', mRNA sequence 
ESTs 
ESTs 
ESTs 
ESTs 

Homo sapiens cDNA RJ10658 lis. done NT2RP2006052 
ESTs 

ESTs, Weakly similar to putative p150 [H.sapiens] 
ESTs 

NIMA (never in mitosis gene a)-related kinase 6 
Src-Bke-adapter 
ESTs 
ESTs 

Janus kinase 2 (a protein tyrosine kinase) 
Homo sapiens cDNA: FU21766 fis, done COLF7179 
ESTs 
ESTs 
ESTs 
EST 
ESTs 
ESTs 

Homo sapiens cONA FU1 1946 fis. done HEMBB1000709 
ninein (GSK3B Interacting protein) 
ESTs. Weakly similar to KIAA1395 protein [H^apiens) 
ESTs 
ESTs 
EST 

myomegaln 

ESTs. Weakly similar to 138022 hypothetical protein [H.sapiens] 
Homo sapiens cDNA FU12371 fis, clone MAMMA1002434 
ESTs 
ESTs 
ESTs 

ESTs, Weakly similar to RMS1 HUMAN REGULATOR OF MITOTIC SPINDLE ASSEMBLY 1 {Rsapiensl 
gb:zv37d10.s1 Scares ovary tumor NbHOT Homo sapiens cDNA done IMAGE:755827 3' similar to4.1 
ESTs 6.9 
ESTs 3.6 
ESTs 37 
EST 7.4 
Human DNA sequence from clone RP1 1-110H4 on chromosomB 5 Contains a pseudogene similar to 3.5 



6.1 
5.4 

I. 3 

II. 2 
10.1 
2.5 • 

I. 9 
9.5 

II. 0 
3.4 
2.0 
1.7 
4.4 
4.6 
61.5 
10.7 
107.3 
121.4 
31.1 
5.6 
8.5 
10.4 
81.8 
3.6 
4.5 
2.7 
75.3 
7.7 
5.8 

I. 3 
4.1 
5.3 
9.9 
5.3 
13.9 

II. 5 
1.7 
5.0 
15.4 
2.8 
8.7 
8.8 
3.9 
3.8 
7.4 
2.8 
2.4 
15.6 
5.1 
23.8 
5.2 



zinc finger protein 14 (KOX6) 
Homo sapiens, clone 1MAGE:4098694, mRNA, partial cds 
ESTs 

Homo sapiens cDNA FU14680 Qs. done NT2RP2004242, weakly similar to 
KIAA1080 protein; Golgi-assodated, gamma-adaptin ear containing, ARF-binding protein 2 
gb:af12a12.s1 SoaresJestisJiHT Homo sapiens cDNA done 3', mRNA sequence 
gb:no97c02.s1 NCLCGAP Pr2 Homo sapiens cONA done, mRNA sequence 
gb2u71d09.s 1 Soares_testis_NHT Homo sapiens cONA done IMAGE:743441 3' similar to contains AIu. 
kinesin protein 9 gene 5.7 
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123735 NM.013241Hs.9523! FH1/FH2 domain-containing protein 

123738 AA609891 Hs.1 12777 EST *f 

123753 AA609955 Hs.234961 Huntingtin interacting protein E # J u -° 

123804 AA620464 Hs.261915 EST, Weakly similar to S65657 alpha-1C-adrenergic receptor splice form 2 (H' sa Pj®Js] JJ 

12381 1 AA620586 gb:ae60g05.s1 Stratagene lung carcinoma 937218 Homo sapiens cDNA done IMAGE:951320 3 2.7 

, i i ; :_i l 4 in*.. 4 6.2 



123951 AB012922 Hs.173043 metastasis-associated 1 -like 1 
123983 AJ272267 Hs.146178 choline dehydrogenase 
124001 L42542 Hs.75447 ralA binding prote&i 1 
124005 AI147155 Hs.270016 ESTs 
124070 AI950314 Hs.154762 HIV-1 rev binding protein 2 
124074 H05635 Hs.294030 topolsomerase-related function protein 4-2 
124178 BE463721 Hs.97101 putative G protein-coupled receptor 
124203 AA372796 Hs.269339 ESTs, Weakly similar to AF161356 1 HSPC093 [H.sapiens] 
124352 AA640891 Hs.102406 ESTs 
124375 D87454 Hs.192966 KIAA0265 protein 

124385 A1267847 gbiaq49a10jrt Stanley Frontal NB pool 2 Homo sapiens cDNA clone similar to contains 

124390 AA317338 Hs.7535 COBW-tike protein 

124391 AF155099 Hs.279780 NY-REN-1 8 antigen , _ . . 
124417 N34059 gb:yv28h09.s1 Soares fetal liver spleen 1NRS Homo sapiens cONA clone 3' similar to contains Alu 3.3 
124428 H13540 Hs.82202 ribosomal protein L17 ^ .* 2 ' 9 
124440 AA532519 Hs.129043 Human DNA sequence from clone 989H11 on chromosome 22q13.1-13.2. Contains part of a 
124466 R10084 Hs. 11331 9 Wnesin heavy chain member 2 

124482 N53935 gb:yv59d09.s1 Soares fetal liverspleen 1NFLS Homo sapiens cDNA clone 3', mRNA sequence 

124498 H79433 Hs.268997 ESTs 

124515 AA669097 Hs.109370 ESTs , 
124608 N71076 Hs.102800 ESTs, Weakly similar to neuronal thread protein AD7C-MTP [rLsapiens] 
124631 NNL014053HS.270594 FLVCR protein , 
124634 AI765123 Hs.143671 Homo sapiens cDNAFU13533 (is. clone PLACE10Q6371 
124637 AA160474 Hs.75798 hypothetical protein 
124642 AW968856 Hs.278569 sorting nexin 17 
124649 N92593 Hs.313054 ESTs 
124656 AW297702 Hs,102915 ESTs 

124683 AA381661 Hs*11987B ESTs, Weakly similar to M3K9_HUMAN MITOGEN -ACTIVATED PROTEIN KINASE KINASE 
124712 R09166 Hs.191148 ESTs 
124735 R22952 Hs.268685 ESTs 

124761 AA374756 Hs.93560 Homo sapiens mRNA for K1AA1771 protein, partial cds 
124768 AW368528 Hs.100855 ESTs 

124777 R41933 Hs.140237 ESTs, Weakly similar to ALU INHUMAN ALU SUBFAMILY J SEQUENCE 
124788 R43543 Hs.100912 Homo sapiens cDNA: FU22726 fis, clone HS115005 
124809 AL355722 Hs.106875 Homo sapiens EST from done 35214, full insert 

124811 R46068 Hs.288912 hypothetical protein FU22604 

124812 R47948 Hs.188732 ESTs 

124822 AA418160 Hs.86043 Homo sapiens cDNAFUl 3558 fis. clone PLACE1007743 
124825 AA501669 Hs.336693 ESTs 
124833 AW975868 Hs.294100 ESTs 
124857 R63652 Hs.137190 ESTs 
124860 R65763 Hs.101477 EST 
124863 AI382555 Hs.127950 bromodomain-containing 1 
124876 AF135422 Hs.27059 GOP-mannosepyrophosphorylaseA 
124878 BE397530 Hs.288057 hypothetical protein FLJ22242 
124902 H37941 Hs.101883 ESTs 

124930 AI07W43 3 HsJ73939 ESTs, Weakly similar to ALUBJHUMAN till ALU CLASS B WARNING ENTRY ill [H.sapiens] 
124942 R99978 Hs.268892 ESTs, Moderately similar to B34087 hypothetical protein IH.sapiens) 
124958 A1078645 Hs.431 murine leukemia viral (bmi-1) oncogene homotog 

IS T59338 H&2TO463 ESTs, Weakly similar to ALU1JHUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 
125047 T79815 Hs.279793 ESTs 
125051 T79956 Hs.100588 EST 
125056 T81310 Hs.100592 ESTs 

19M01 AI472068 Hs.286236 KIAA1 856 protein mt . , 

25113 T96595 Hs302270 ESTs, Weakly similar to ALUFJ1UMAN llll ALU CLASS F WARNING ENTRY HI (H.sapiens] 
1251 15 T97341 gb:ye57eQ5.s1 Soares fetal liverspleen 1NFLS Homo sapiens cDNA clone IMAGE:121856 3' similar to 

125125 A1222382 Hs.240767 Human DNA sequence from done RP1-12G14 on chromosome 6q24.1-25.2. Contains the 5* end of the gene 
125147 W38150 Empirically selected from AFFX single probeset 1.7 
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4.4 

7.0 

8.1 

3.7 

1.2 

3.1 

5.7 

3.1 

3.5 

57.1 

2.8 

7.1 



7.8 

2.6 

7.9 

7.8 

3.3 

4.5 

3.2 

5.8 

9.3 

3.5 

6.1 

8.3 

5.6 

7.9 

5.7 

11.3 

9.0 

8.1 

4.9 

2.8 

5.1 

4.2 

14.2 

7.9 

6.6 

2.3 

2.7 

2.3 

23.9 

2.0 

4.4 

2.7 

5.7 

32.4 

22.8 

6.1 

1.9 

4.5 

4.9 

5.0 

135.3 

5.4 

5.6 

1.8 
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125161 W44657 Hs.144232 EST 10.7 

125249 AA630863 Hs.131375 ESTs, Moderately similar to ALUB_HUMAN HI! ALU CLASS B WARNING ENTRY !I! [H^apiens] 1.3 

125255 AF098162 Hs.1 18631 timeless (Drosophila) homolog 9.4 

125279 AW401809 Hs.4779 WAA1150 protein 1.5 

5 125280 AI123705 Hs.106932 ESTs 8.0 

125298 AW972542 Hs.289008 Homo sapiens cDNA: FU21814 fis, done HEP01068 1.5 

125660 AW292171 Hs.23978 scaffold attachment factor B 5.9 

125827 NMJ»3403Hs.97496 YY1 transcription factor 1.2 

125891 U29589 Hs.7138 cholinergic receptor, rraiscannic 3 6.4 

10 126005 AW409701 Hs.1578 baculoviral lAP repeat-containing 5 (survivin) 14.3 

126202 AA1 57632 Hs.272630 vacuolar proton pump delta polypeptide 2.4 

126695 AA643322 Hs.1 72028 a disintegrin and metaiioproteinase domain 10 9.1 

127050 AW411066 Hs.274351 CGI-89 protein 17.0 

127274 AW966158 Hs.58582 Homo sapiens cONA RJ 12789 fis, clone NT2RP200 1947 12.8 

15 128355 AW293012 Hs.161623 ESTs 7.3 

128493 D87466 Hs.240112 K1AA0276 protein 3.1 

128493 D87466 Hs.240112 KIAA0276 protein L3 

128522 BE173977 Hs.10098 putative nucleolar RNA heficase 9.4 

128527 AA504583 Hs.101047 transcription factor 3 (E2A immunoglobulin enhancer binding factors E12/E47) 1.5 

20 128528 R39234 Hs.251699 ESTs, Weakly similar to IDN4-GGTR14 [H.sapiens] 2.8 

128595 U31875 Hs.272499 short-chain alcohol dehydrogenase family member 12.1 

128599 NM_015366Hs.102336 Rho GTPase activating protein 8 2.3 

128604 AI879099 Hs.102397 GIOT-3 for gonadotropin irKJudble trarscription repressor-3 • 1.3 

128608 BE267994 Hs.102419 zinc finger protein 7.1 

25 128625 AB037841 Hs.1 02652 hypothetical protein ASH 1 1-3 

128629 AL096748 Hs.102708 DKFZP434A043 protein 3.2 

128639 AW582962 Hs.102897 CGW7 protein 2.0 

128656 AA458542 Hs.10326 coatomer protein complex, subunitepsilon M 

128656 AA458542 Hs.10326 coatomer protein complex, subunitepsilo 1.3 

30 128658 BE397354 Hs.324830 diptheria toxin resistance protein required for diphthamide biosynthesis {Saccharomyces)-iite 2 2.4 

128670 AA975486 Hs.103441 Homo sapiens, Similar to RIKENcONA 170001 0L1 9 gene, done MGC:16214, mRNA, complete cds 7.1 

128691 W27939 Hs.103834 hypothetical protein MGC5576 7.7 

128698 BE081143 Hs.225977 nudear receptor coactivator 3 3.8 

128700 Y15221 Hs.103982 small Indudble cytokine subfamily B (Cys-X-Cys), member 1 1 1.6 

35 128714 T85231 Hs.179661 tubulin, beta 5 7.6 

128717 AK001564 Hs.104222 hypothetical protein FU10702 5.5 

128733 BE147740 Hs.104558 ESTs, Moderately similar to I38022 hypothetical protein [H.sapiensJ 2.7 

128737 AF292100 Hs.104613 RP42 homolog 2.8 

128742 AA307211 Hs.251531 proteasome (prosome, macropain) subunit, alpha type, 4 4.4 

40 128746 AI470163 Hs.323342 actin related protein 2/3 complex, subunit 4(20 kD) 2.2 

128747 AB027249 Hs.104741 PDZ-binding kinase; T-cell originated protein kinase 2.8 

128772 BE302796 Hs.105097 thymidine kinase 1, soluble 5.3 

128781 N71826 Hs.105465 small nuclear ribonudeoprotein polypeptide F 53.9 

128797 NM 002975Hs. 105927 stem cell growth factor; lymphocyte secreted C-type lectin 13.3 

45 128806 AW630942 Hs.106061 RD RNA-blnding protein 2.6 

128814 AW248431 Hs.256526 nudear prelamin A recognition fador 2.2 

128830 BE281170 Hs.106357 valosin-containing protein . - - 5.9 

128835 AK001731 Hs.106390 Homo sapiens mRNA; cDNA DKFZp586H0924 (from done DKFZp586H0924) 1.6 

128854 BE159181 Hs.168232 hypothetical protein FU 13855 2.2 

50 128854 BE159181 Hs.168232 hypothetical protein FLJ13855 1.9 

128668 AA4 19008 Hs. 106730 chromosome 22 open reading frame 3 3.0 

128868 AA419008 Hs.106730 chromosome 22 open reading frame 3 22 

128871 AF189723 Hs.106778 ATPase, Ca++ transporting, type 2C, member 1 1.5 

128891 F34856 Hs.292457 Homo sapiens, done MG&16362. mRNA, complete cds 13.3 

55 128906 R57988 Hs.10706 epithelial protein tost In neoplasm beta 4.7 

128920 AA622037 Ks.166468 programmed ceil death 5 1.4 

128925 R67419 Hs.21851 Homo sapiens cDNA FU12900 fis, done NT2RP2004321 1.9 

128946 Y13153 Hs.107318 kynurenine 3-monooxygeiiase (kynurenlne 3-nydroxyfese) 7.2 

128949 AA009647 Hs,8850 a disintegrin and metaiioproteinase domain 12 (meJtrina!pha)(ADAM-12) 2.4 

60 128958 AW139032 Hs.107376 hypothetical protein DKFZp434N035 1.3 

128959 AI580127 Hs. 1 07381 hypothetical protein FU 11 200 10.9 

128965 AW150697 Hs.107418 ESTs 1.4 

128970 AI375672 Hs.165028 ESTs 1.3 

128975 BE560779 Hs.284233 NJCE-5 protein 14.0 

65 128979 AW271217 Hs.281434 Homo sapiens cONA fUUQ28 6s, done HEMBA1003838 1.6 

128995 A1816224 Hs, 1 07747 DKFZP566C243 protein 1.9 

129019 A1950087 gbwq05c02jc1 NCLCGAPJ<id12 Homo sapiens cDNA clone 3'. mRNA sequence 2.9 
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129021 AL044675 Hs.173081 KIAA0530 protein 2 5 

129021 AL044675 Hs.173081 KIAA0530 protein 3 ' 4 

129032 R80088 Hs.108104 ubiquitin-conjugaling enzyme E2L 3 . _ 

129076 aS806 Hs.326234 ESTs. Highly Uto T46422 hypothetical protein DKFZp434M2023.1 [Ksapiens] 5.0 

5 129078 AI351010 Hs.102267 lysosomal « 1 

129088 AA744610 Hs.194431 paBadin 2 j 

129095 L12350 Hs.108623 thrombospondin 2 2Qg 

129096 AA463189 Hs.288906 WW Domain-Containing Gene ' 

129097 BE243933 Hs.1Q8642 zinc finger protein 22 {KOX 15) tt ^ mnm . . 58 
10 129099 AF146074 Hs. 1 08660 ATP^indingcassete,suWamiryC(CrnR/MRP). member 5 ™ 

129136 W93048 Hs.250723 hypothetical protein MGC2747 * 

129149 AA356620 Hs.108947 KIAA0050 gene product ' 

129172 AW162916 Hs.241576 hypothetical protein PR02577 21 

129192 AA286914 Hs.183299 ESTs 3 2 

15 129194 AA150797 Hs.109276 latexin protein 58 

129198 N57532 Hs.1 0931 5 KIAA14 15 protein * 

129207 AI934365 Hs. 1 09439 osteoglycin (osteoinductive factor, mimecan) • 

129228 U40714 Hs.239307 tyrosyi-tRNA synthetase ' 2 

129229 AF013758 Hs.109643 polyadenylate binding protein-interacting protein 1 ■ 
20 129254 AA252468 Hs.1098 OKFZp434J 1813 protein " 

129255 A1961727 Hs. 1 09804 H1 hlstone family, member X qfi 
129288 W26392 Hs.110080 ESTs, Weakly similarto S13495 pregnancy zone protein [H.sapiensj 

129296 AI051967 Hs.1 10122 ESTs ,^ 44C _^ Q , arcinirvim 51 
129323 AA287239 Hs.5518 Homo sapiens cDNA FLJ11311 fis, clone PLACE 101 01 02 

25 129340 H75334 Hs.11050 F-box only protein 9 w.T^nh^ 7*6 

129347 BE614192 Hs.279869 melanomanassodated antigen recognised by cytotoxic T lymphocytes '* 

129362 U30246 Hs.1 10736 solute carrier family 12 {sodium/potassium/chlonde transporters), member 2 o./ 

129366 BE220806 Hs.184697 Homo sapiens done 23785 mRNA sequence ^ 

129370 AI686379 Hs.1 10796 SAR1 protein 2 ' 0 

30 129372 NMJ)16039Hs.1 10803 CG 1-99 protein 74 

129403 AF149785 Hs.111126 pituitary tumor-transforming 1 interacting protein • 

129404 AI267700 Hs.317584 ESTs 2 ' 5 
129404 AI267700 Hs.317584 ESTs { Q 2 
129423 AA204686 Hs.234149 hypothetical protein FU20647 8Q 

35 129449 A1096988 Hs.111554 ADP-fibosylabon factor-like 7 3 ' 2 

129453 AW974265 Hs.111632 Lsm3 protein 6 7 

129482 AA188185 Hs.289043 spmdlin 36 

129482 AA188185 Hs.289043 spindfin -,\ 

129513 AW843633 Hs.306163 hypothetical protein AL1 1011 5 ' 

40 129515 AF255303 Hs.112227 membrane-associated nudefc aad binding protein ^ 

129527 AA769221 Hs.270847 delta-tubuiin 7 * 5 

129559 W01296 Hs.1 1360 hypothetical protein FU14784 ■ 

129560 AA317841 Hs.7845 . hypothetical protein MGC2752 ^ Q 
129570 AI923097 Hs.11441 chromosome 1 open reading frame 8 6 

45 129575 F08282 Hs.278428 progestin Induced protein " 

129587 H14718 Hs.1 1506 Human done 23589 mRNA sequence * 

129588 BE408300 Hs.301862 postmeiotlc segregation increased 2-like 9 ■ 
129591 N57423 Hs.179898 HSPC055 protein g Q 
129594 AW403724 Hs.36989 coagulation factor Vil (serum prothrombin conversion accelerator) 

50 129596 AF035537 Hs.115521 REV3 (yeast homo!og)-fite, catalytic subunit of ^^J^^J^J^ 
129628 U38945 Hs.1174 cyclln-dependent kinase inhibitor 2A (melanoma, p16, inhibits CDK4) 

129628 U38945 Hs.1174 cyciin-dependent kinase Inhibitor 2A (me 

129629 AK000398 Hs.11747 hypothetical protein FU20391 3 3 
129649 AD000092 Hs.16488 calreticulin 134 

55 129675 NMJ)15556Hs.172180 KIAA0440 protein u \ 

129680 U03749 gb:Human chromogranln A (CHGA) gene, promoter an «j« 

129689 AW748482 Hs.77873 B7homolog3 , , , n. , , 74 
129702 AI304966 Hs.12035 ESTs, Weakly similarto I38022 hypothetical protein [Ksapiens] 

129720 AA156214 Hs.12152 APMCF1 protein * 

60 129721 NM.001415HS.211539 eukaryotic translation initiation factor 2, subunit 3 (gamma. 52kD) w 

129726 H15474 Hs.132898 fatty acid desaturase 1 A 

129778 AK001676 Hs.1 2457 hypothetical protein FU 1081 4 ' 

129779 AA394090 Hs.12460 Homo sapiens done 23870 mRNA sequence ™ 
129800 AF052112 Hs.12540 lysosomal '« 

65 129806 AB023148 Hs.173373 K1AA0931 protein ' 1 

129815 BE565817 Hs.26498 hypothetical protein FU21657 *' 
129840 NfvL006590Hs.12820 SnRNP assembly defective 1 homolog 
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129922 
129945 
129953 
129972 
129983 
129989 
130010 
130081 
130082 
130097 
130100 
130111 
130112 
130112 
130128 
130135 
130211 
130212 
130236 
130241 
130242 
130249 
130263 
130287 
130310 
130353 
130356 
130357 
130359 
130367 
130372 
130393 
130399 
130407 
130409 
130419 
130441 
130448 
130455 
130455 
130471 
130485 
130487 
130498 
130503 
130511 
130511 
130526 
130542 
13D544 
130553 
130556 
130567 
130568 
130574 
130586 
130598 
130601 
130614 
130617 
130617 
130618 
130667 
130674 



AL049999 Hs.85963 

AI393237 Hs.129914 

AI222069 Hs.13015 

AF042379 Hs.13386 

BE514376 Hs.165998 

AA412195 Hs.13740 
AW753185 Hs.180628 

U09848 Hs,132390 

AB015856 Hs.247433 

AA301116 Hs.142838 

AA287325 Hs.14713 

S73265 Hs.1473 

AL046962 Hs.14845 

AL135561 HS.14891 

X53002 Hs.149846 

AA916785 Hs.180610 

AA916785 Hs.180610 

L76937 Hs,150477 

AA311426 Hs.21635 
NM 003358HS.23703 

D80001 Hs.152629 

R85367 Hs.51957 

AL035588 Hs.153203 



X79201 
D81983 



Hs.153221 
Hs.322852 



NMjK)2497Hs.153704 
AA479005 Hs.154036 
AB011121 Hs.154248 
Z19084 Hs.172210 
AF127577 Hs.155017 
AJ224442 Hs.15502Q 
NM 013449HsJ277401 
AL135301 Hs.8768 
AI077464 Hs.5011 
N89487 Hs.155291 
AW374106 Hs.155356 
BE385099 Hs.334727 
NM.001197HS.155419 
AF037448 Hs.155489 
U63630 Hs.155637 
BE513202 Hs.15589 
D90041 Hs.155956 
D90041 Hs.155956 
AL121438 Hs.183706 
BE245851 Hs.180779 
U49844 Hs.77613 
L38951 Hs.180446 
BE208491 Hs.295112 



U2137 
L32137 



Hs.1584 
Hs.1584 



AW876523 Hs.15929 
U64675 Hs.179825 
AA321238 Hs.4310 
AF062649 Hs.252587 
AI907018 Hs.15977 
AA383092 Hs.1608 
AA232119 Hs.16085 
AF083208 Hs.16178 
AB007891 Hs.16349 
AL042210 Hs.16493 
AA609738 Hs.16525 
A1354355 Hs.16697 
M90516 Hs.1674 
M90516 Hs.1674 
AA383439 Hs.16758 
BE246961 Hs.17639 
AL048842 Hs.194019 



DKFZP564M182 protein 2.2 

runt-related transcription factor 1 (acute myeloid leukemia 1 ; am!1 oncogene) 1.7 

hypothetical protein similar to mouse Dnajll 2.7 

gamma-tubulin complex protein 2 4.5 

PAM mRNA-bindlng protein 1.8 

ESTs 2.5 

dynamin 1-like 1.8 

zinc finger protein 36 (KOX 18) 1.3 

activating transcription factor 6 4.0 

nucleolar phosphoprotein Nopp34 1 .6 

ESTs 4.0 

gastriTMeieasing peptide 1.8 

forkheadbox03A 2.8 

hypothetical protein FU21047 2.3 

integrin, beta 5 2.3 

splicing factor proline/glutamlne rich (polypyrimldine tract-binding protein-associated) 3.0 

splicing factor proline/glutamine rich ( 2.1 

Werner syndrome 1.8 

tubulin, gamma 1 6.1 
ESTs, Moderately similar to CEGT HUMAN CERAMIDE GLUCOSYLTRANSFERASE [H.sapiens] 1.6 

KIAA0179 protein 1.3 

splicing factor, arginine/serine-rich 2, foteracting protein 2.0 

MyoD family inhibitor 3.2 

synovia] sarcoma, translocated to X chromosome 5.4 

GAS2-related on chromosome 22 4.8 

N IMA (never in mitosis gene a)-related kinase 2 1 .4 

tumor suppressing subtransferable candidate 3 2.6 

amyotrophic lateral sclerosis 2 (Juvenile) chromosome region, candidate 3 6.3 

MUF1 protein 6.2 

nuclear receptor Interacting protein 1 2.4 

putative methyrtransferase 3.4 

bromodomain adjacent to zinc finger domain, 2A 8.5 

hypothetical protein FU 10849 1 .4 

RNA binding motif protein 9 3.3 

KIAA0005 gene product 1.6 

hypothetical protein MGC2840 similar to a putative glucosyitransferase 3.4 

hypothetical protein MGC3017 2.3 

BCL2-interacting killer (apoptosis-inducing) 2.7 

NS1 -associated protein 1 1 .8 

protein kinase, DNA-activated, catalytic polypeptide 2.3 

PPAR binding protein 3.9 

N-acetyttransferase 1 (arytamine N-acetyltransferase) 33.6 

N-acetyltransferase 1 (arylamine N-acety 4.6 

adducin 1 (alpha) 2.7 

H2B histone family, member B 5.0 

ataxia telangiectasia and Rad3 related 4.3 

karyopherin (importin) beta 1 - 1.6 

K1AA0618 gene product 16.1 

cartilage oligomeric matrix protein (pseudoachondroplasla, epiphyseal dysplasia 1 , multiple) 6.1 

cartilage oligomeric matrix protein (pse 5.3 

hypoftetlcal protein OJ12910 2.1 

RAN binding protein 2-like 1 7.8 

eukaryotic translation Vitiation factor 1A 1 .5 

pituitary tumor-transforming 1 14.4 

Empirically selected from AFFX single probeset 4.7 

replication protein A3 (14kO) 7.9 

putative G-protein coupled receptor 3.3 

apoptosis antagonizing transcription factor 1.2 

KIAA0431 protein 5.6 

hypothetical protein DKFZp762N2316; WAA1803 protein 1.4 

ESTs 1.5 

down-regulator of transcription 1 , TBP-binding (negative cefaclor 2) 1 .3 

g lirtamJne-mjctose-6-phosphate transaminase 1 12.1 

glutamine-fructose-6-phosphate transamin 2.4 

Splr-1 protein 155 

Homo sapiens ubtquitin protein figase (UBE3B) mRNA, partial cds 135 

attractin 1.5 
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5.4 
5.0 
2.0 



1.5 
3.4 
1.7 



130675 AA442233 Hs.17731 hypothetical protein FL J 12892 
130692 AA652501 Hs.13561 hypothetical protein MGC4692 

130593 R68537 Hs.17962 ESTs , g 

130712 AJ271881 Hs.279762 brornodomain-containing 7 ' 

5 130714 A1348274 Hs.18212 DNA segment on chromosome X (unique) 9879 expressed sequence J" 

130730 AB007920 Hs.18586 KIAA0451 gene product ?" 
130744 H59696 Hs.18747 POP7 (processing of precursor, S. cerevislae) homolog 

130751 AF052105 Hs.18879 chromosome 12 open reading frame ™ 

130757 AL036067 Hs.18925 protein x0001 ^ At ^ ^ 4 5 

10 130768 AF258627 Hs.211562 ATP^inding cassette, sub-family A (ABC1), member 1 

130789 AK000355 Hs.8899 sirtuin (silent mating type information regulation 2, Sxerevisiae, homolog) 5 m 

130815 AB018298 Hs.19822 SEC24 (S. cerevisiae) related gene family, member D ];? 

130836 J05068 Hs.2012 transcobalamin I (vitamin B12 binding protein, R binder family) ™ 

130841 AL157468 Hs.325825 Homo sapiens cDNA FU20848 fis, clone ADKA01732 £ J 

1 5 130843 AA447492 Hs.2Q183 ESTs, Weakly similar to AF164793 1 protein x 013 [H.sapiensl 
130844 U76248 Hs.20191 seven in absentia (Drosophila) homolog 2 
130855 AJ243706 Hs.143323 putative DNA/chromatln binding motif 

130861 NM_016578Hs.2O509 HBV pX associated protein-8 ' 

130879 NNL003416Hs.2076 zinc finger protein 7 (KOX 4, clone HF. 16) ]f 

20 130880 BE514434 Hs.20830 WnesirHike 2 2 * 4 

130892 AL120837 Hs.20993 high-glucose-regulated protein 8 * 

130898 AB033078 Hs.186613 sphingosine-1-pnosphate lyase 1 * 

130911 BE409769 Hs.21189 DnaJ (Hsp40) homolog, subfamily A, member 2 - 

130919 N79110 Hs.21276 collagen, type IV, alpha 3 (Goodpasture antigen) binding protein " 

25 130944 BE382657 Hs.21486 signal transducer and activator of transcription 1,91 kD 

130971 N39842 Hs.301444 KIAA1673 7o 

130992 BE398091 Hs.74316 desmoplakin (DPI, DPI1) ]Z 

130993 T97401 Hs.21929 ESTs 

131005 AV658308 Hs.2210 thyroid hormone receptor interactor 3 * 

30 131028 AI879165 Hs.2227 CCAAT/enhancer binding protein (C/EBP), gamma • 

131042 AI826288 Hs.171637 hypothetical protein MGC2628 * 

131046 AA321649 Hs.2248 small inducible cytokine subfamily B (Cys-X-Cys). member 10 ' •£ 

131046 AA321549 Ks.2248 small inducible cytokine subfamily B (Cy ■ 

131047 H23230 Hs.22481 ESTs, Moderately similar to A46010 X-linted retinopathy protein [Ksaplens] l./ 
35 131060 AA194422 Hs.22564 myosin VI 2 ' 5 

131060 AA194422 Hs.22564 myosin VI ? i 

131076 /W49230 Hs^6433 doltehytyhosphate (UDP-N-acetytgtomine) N-a(^lu<»samir^phcsprK>transferase 1 ( 2.0 

131076 AA749230 Hs.26433 dolichyl-phosphate (UDP-N-acetylglucosam J* 

40 131099 AL133353 Hs.226581 COX15 (yeast) homolog, cytochrome c oxidase assembly proton 

131174 NM_006540Hs.29131 nuclear receptor coactivator 2 * 

131185 BE280074 Hs.23960 cyclinBI 20 

131206 AW138839 Hs.24210 ESTs 7 0 

131213 AA885699 Hs.24332 CG1-26 protein 7 ' 5 

45 131225 H62087 Hs.31659 thyroid hormone receptor^associated protein, 95-kD subunlt 

131231 N47468 Hs.59757 zinc finger protein 281 * 

131233 D89053 Hs.268012 fatiy-acid-Coenzyme A ligase, long-chain 3 * 

131243 AW383256 Hs.24752 spectrin SH3 domain binding protein 1 ' 

131245 AL080080 Hs.24766 thioredoxin domain-containing ' 

50 131247 AU043100 Hs.326190 fatty add amide hydrolase ' 
131281 AA251716 Ks.25227 ESTs 

131283 X80038 Hs.339713 Homo sapiens clone F19374 APO E-C2 gene cluster 

131305 AV656017 Hs.184325 CGI-76 protein ™ 

131320 AA505691 Hs.145696 spficing factor (CC1 .3) * 

55 131339 AF05869S Hs.25812 Nijmegen breakage syndrome 1 (nibnn) ■ 

131339 AF058696 Hs.25812 Nijmegen breakage syndrome 1 (nib/In) * 

131375 AW293165 Hs.143134 ESTs ™ 
131390 BE269388 Hs.182698 mitochondria! ribosomat protein L20 



131410 BE259110 Hs.279836 HSPC166 protein " 

60 131412 NWL012247HS.124027 SELENOPHOSPHATE SYNTHETASE ; Human selenium donor protein 2.0 

131429 AL046302 Hs.26750 hypothetical protein FLJ21 908 J"! 
131458 BE297567 Hs.27047 hypothetical protein FLJ20392 
131475 AA992841 Hs.27263 KIAA1458 protein 

131501 AV661958 Hs.8207 GK001 protein rt 

65 131501 AV661958 Hs.8207 GK001 protein mmn * in 

131511 AA732153 Hs.27865 Homo sapiens cONA: FU21333 fis, clone COL02535 2.0 
131528 AU076408 Hs.28309 UDP-glucose dehydrogenase 

131 



WO 02/059377 PCT/US02/02242 



131532 BE268278 Hs.28393 hypothetical protein MGC2592 7.4 

131543 AW966881 Hs.41639 programmed cefl death 2 2.2 

131544 AL355715 Hs.28555 programmed cell death 9 (PDCD9) 2.1 
131562 NNL003512HS.28777 H2AhistDnefamQy,membefL 1.7 

5 131564 T93500 Hs.28792 Homo sapiens cONA FL1 1 1 04 1 fe, clone PUCE 1004405 5.1 

131564 T93500 Hs.28792 Homo sapiens cDNAFLh 1041 fis, clone PL 1.8 

131569 AL389951 Hs.271623 nudeoporin50kD 5.0 

131618 BE393822 Hs.29645 Homo sapiens mRNA;cDNADKFZp761CQ29 (from clone OKFZp761C029); partial cds 1.8 

131622 R78195 Hs.29692 Homo sapiens cDNA FU1 1436 fis, clone HEMBA1001213 1.3 
10 131623 AB037791 Hs.29716 hypothetical protein FU10980 2.2 

131623 AB037791 Hs.29716 hypofcefical protein FU1098O 1.9 
131643 AW410601 Hs.30026 HSPC 182 protein 2.9 
131653 AW960597 Hs.30164 ESTs U 
131656 AI21891B Hs.30209 KIAA0854 protein 2.8 

15 131669 X52486 Hs.3041 uracil-DNA glycosylase 2 2.8 

131692 BE559681 Hs.30736 K1AA01 24 protein 5.6 

131714 AA642831 Hs.31016 putative DNA binding protein 2.9 

131722 D13757 Hs.311 phosphoribosyl pyrophosphate amidotransferase 3.4 

131737 AK001641 Hs.31323 inhibitor of kappa fight polypeptide gene enhancer inB-cells, kinase complex-associated protein 3.8 

20 131760 X76732 Hs.3164 nucleobindin 2 2.9 

131760 X76732 Hs.3164 nucleobindin 2 2.8 

131763 AI876932 Hs.317 topoisomerase (DNA) I 3.4 

131772 AA382590 Hs.170980 WAA0948 protein 25.5 

131774 BE267158 Hs.169474 DKFZP586J011 9 protein 5.5 

25 131787 087077 Hs.1S6275 KIAA0240 protein 2.4 

131793 AW966127 Hs.32246 Homo sapiens cONA FU 14656 fis, done NT2RP2002439 7.9 

131795 BE501849 Hs.32317 high-mobility group 20B 1.4 

131798 X86098 Hs.301449 adenovirus 5 E1 A binding protein 4.1 

131817 U20536 Hs.3280 caspase 6. apoptosis-related cysteine protease 4.2 

30 131824 U28838 Hs.32935 TATA box binding protein (TBP)-associated factor, RNA polymerase 111, GTF3B subunit 2 3.5 

131850 A1251317 Hs.33184 ESTs 5.1 

131878 AA083764 Hs.6101 hypothetical protein MGC3 178 5.8 

131885 BE502341 Hs.3402 ESTs 13.7 

131885 BE502341 Hs.3402 ESTs 2.4 

35 131887 W17064 Hs.332848 SW1/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily e, member 1 3.2 

131900 AA099014 Hs.231029 Homo sapiens, done MGC:15961, niRNA, complete cds 8.7 

131900 AA099014 Hs.231029 HOTOsapiens,(toneMGC.i5961,n)RNA ( com 2.0 

131904 AF078866 Hs.284296 Homo sapiens cDNA: FU22993 fa, done KAT1 1914 5.5 

131905 AA179298 Hs.3439 stomatln-like 2 11.3 
40 131913 AW207440 Hs.185973 degenerative spermatocyte (homolog Drosophife; Gpid desaturase) 1.7 

131916 AA025976 Hs.34569 ESTs 5.2 

131925 AF151048 Hs.183180 anaphase promoting complex subunit 11 (yeast APC 11 homolog) 2.7 

131929 BE541211 Hs.34804 Homo sapiens cDNAFU 11472 fis, done HEMBA1001711 5.3 

131941 BE252983 Hs.35086 ubiquitin specific protease 1 2.3 

45 131950 AA3551 13 Hs.35380 x 001 protein 1.5 

131962 AK000046 Hs.267448 hypothetical protein FU20039 2.3 

131965 W79283 Hs.35962 ESTs - 1.4 

131971 BE567100 Hs.154938 hypothetical protein MDS025 3.5 

131977 U90441 Hs.3622 procollagen-profine, 2-oxoglutarate 4-dioxygenase (proline 4-hydroxylase), alpha polypeptide II 6.5 

50 131985 AA503020 Hs.36563 hypothetical protein FU224 18 2.4 

131991 AF053306 Hs.36708 budding uninhibited by benzimidazoles 1 (yeast homolog), beta 2.1 

132019 H56995 Hs.37372 Homo sapiens DNA binding peptide mRNA, partial cds 3.2 

132031 AF193844 Hs.3758 COP9 complex subunit 7a 5.8 

132062 BE266155 Hs.3832 cbthnn-assoclated protein AP47 1.5 

55 132084 NM„002267Hs.3886 kan/opherinalpha3(importinalpha4) 3.7 

132103 BE171921 Hs.3991 ESTs 1.4 

132105 AV646076 Hs.39959 ESTs 5.8 

132116 AW960474 Hs.40289 ESTs 1.7 

132176 AA857025 Hs.8878 Wnesin-like 1 3.3 

60 132180 NM_004460Hs.418 fibroblast activation protein, alpha 14.7 

132192 AA206153 Hs.4209 mitochondrial ribosomal protein L37 5.5 

132194 R42432 Hs.4212 ESTs 4.4 

132203 N&L004782HS.194714 synaptosomal^ssociated protein, 29kD 2.2 

132207 BE206939 Hs.42287 E2F transcription factor 6 2.2 

65 132235 AV658411 Hs.42656 KIAA1681 protein 7.8 

132240 AB018324 Hs.42676 K1AA0781 protein 1.5 

132252 AI566004 Hs.141269 Homo sapiens cONA: FU21550 fis, done COL06258 1.3 
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132266 M301228 Hs.43299 hypothetical protein FLJ 12890 J.7 

132273 AA227710 Hs.43658 OKFZP586L151 protein ¥ 

132276 AA653507 Hs.285711 hypothetical protein FU1 3089 J*J 

132288 N36110 Hs.305971 solute carrier family 2 (facilitated glucose transporter), mem J£ 

5 132294 AB023191 Hs.44131 KIAA0974 protein I 0 - 0 

132298 NM.015986Hs.7120 cytokine receptor-like molecule 9 J» 

132299 AW405882 Hs«442Q5 corBstalin Jf 
132325 N37Q65 Hs.44856 hypothetical protein FU121 16 tr 
132348 AW067708 Hs.170311 heterogeneous nudearribonudeopr^^ J* 

10 132370 AW572805 Hs.46645 ESTs lf _ efe „ efl iS 

132374 AF155582 Hs.46744 corel UDP^alactose:N-acstylga!adra ™ 

132376 AI279892 Hs.46801 sorting nexin 14 ™ 
132384 AA312135 Hs.46967 HSPC034 protein 

132393 AL135094 Hs.47334 hypothetical protein FLJ 14495 J-J 

15 132450 AA100O12 Hs.48827 hypothetical protein FU 12085 J* 

132452 AW973521 Hs.247324 mitochondrial ribosomal protein S1 4 ™ 

132456 AB011084 Hs.48924 KIAA0512 gene product; ALEX2 J-J 

132465 AW169847 Hs.49169 KIAA1 634 protein J* 
132470 AI224456 Hs.4934 H.saplens polyA site DNA 

20 132484 X16660 Hs. 119007 RAB4, member RAS oncogene family ™ 
132518 AW885606 Hs.5064 ESTs 

132528 T78736 Hs.50758 SMC4 (structural maintenance of chromosomes 4, yeast)-like 1 

132530 AA306105 Hs.50785 SEC22, vesicle trafficking protein (S. cerevisiae)-Uke 1 2.0 

132532 AA454132 Hs.5080 mitochondrial ribosomal protein L1 6 *•* 

25 132534 BE388673 Hs.5086 hypothetical protein MGC10433 *•* 
132543 BE568452 Hs.5101 protein regulator of cytokinesis 1 

132571 AW674699 Hs.5169 suppressor of G2 allele of SKP1. S. cerevisiae, homolog of J- 

132574 AW631437 Hs.5184 TH1 drosophSa homolog f l 

132596 AK001484 Hs.5298 CGl-45 protein ~ 
30 132611 AA345547 Hs.53263 hypothetical protein FU 13287 . J* 

132612 H12751 Hs.5327 PR01914 protein J* 

132616 BE262677 Hs.283558 hypothetical protein PR01 855 ]™ 
132638 AI795870 Hs.54277 DMA segment on chromosome X (unique) 9928 expressed sequence 

132648 U51127 Hs.54434 hypothetical protein MGC 171 5 J-J 

35 132668 AB018319 Hs.5460 KIAA0776 protein J? 
132692 AW191962 Hs.249239 collagen, type VIII, alpha 2 

132715 F11875 Hs.5534 Homo sapiens cDNA FU12961 lis, clone NT2RP2005645 J-J 

132718 NMJ»4600Hs.554 Sjogren syndrome antigen A2 (60kD, ribonucleoprotein autoantigen SSWRo) J.u 
132724 A1142265 Hs.55498 geranylgeranyl diphosphate synthase 1 J* 
40 132731 AI189075 Hs.301872 hypothetical protein MGC4840 J J* 

132744 AA010233 Hs.55921 glutamyl-proIyl-tRNA synthetase ™> 
132760 AA125985 Hs.56145 thymosin, beta, identified in neuroblastoma cells 

132771 Y10275 Hs.56407 phosphoserine phosphatase *" 
132773 AA459713 Hs.295901 KIAA0493 protein fj 
45 132784 AI142133 Hs.56845 GDP dissociation inhibitor 2 
132798 A1026701 Hs.5716 KIAA0310 gene product 

132807 U07418 Hs.57301 mutL (E. colt} homolog 1 (colon cancer, nonpolyposis type 2) 

132810 AB007944 Hs.5737 KIAA0475 gene product momhor o a 7 

132813 BE313625 Hs.57435 solute carrier family 11 (proton-coupled divalent metal ion transporters), member 2 w 

50 132815 AI815189 Hs.57475 sex comb on midleg homolog 1 

132817 N27852 Hs.57553 tousled-like kinase 2 *J 
132821 AJ251595 Ks.169610 CD44 antigen (homing function and Indian blood group system) f? 
132833 U78525 Hs.57783 eukaiyoiic translation Initiation factor 3, subunit 9 (eta, 116kD) J™ 
132842 NM_016154Hs.279771 Homo sapiens clone PP1 596 unknown mRNA J* 

55 132844 F12200 Hs.5811 chromosome 21 open reading frame 59 fj 
132851 U09716 Hs.28791 2 lectin, mannose-binding, 1 ™ 
132863 BE268048 Hs.236494 RAB 10, member RAS oncogene family «•< 
132869 AW963217 Hs.203961 ESTs, Moderately similar to AF1 16721 89 PR021 68 [H. sapiens] 2.8 
132873 AW007683 Hs.58598 KIAA1266 protein *■» 

60 132875 NWL004850Hs.58617 Rho^ssodated, coiled-coil containing protein kinase 2 J-J 
132891 BE267143 Hs.59271 U2(RNU2) small nuclear RNA auxiliary factor 1 (non-standard symbol) 1.4 
132897 AW503667 Hs.59545 ring finger protein 15 " 
132902 AI936442 Hs.59838 hypoftietical protein FU10808 

132912 AW732760 Hs.1G7578 Homo sapiens cDNA FU11095 fis, clone PLACE1005374 7.1 
65 132913 W78714 Hs.60257 Homo sapiens cONA FU13598 fis, clone PLACE1009921 2.8 

132940 779136 Hs.127243 Homo sapiens mRNA for KIAA1724 protein, partial cds 6.1 

132941 AI817165 Hs.6120 hypothetical protein FU13222 ™- 3 
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132942 AA554458 Hs.197751 K1AA0666 protein 1.8 

132952 AI658580 Hs.61426 Homo sapiens mesenchymal stem cell proteh OSC96 mRNA, partial cds 12 

132962 AA576535 Hs.6153 CGI-48 protein 4.9 

132972 AA034365 Hs.288924 Homo sapiens cONA FLJ11392 fiS # clone HEMBA1000575 2.7 

5 132973 AA035446 Hs.323277 ESTs 5.3 

132977 AA093322 Hs.301404 RNA binding motif protein 3 3.2 

132980 AA040696 Hs.62016 ESTs 1.3 

132994 M1 12748 Hs.279905 clone HQ0310 PRO0310p1 3.0 . 

133012 AA847843 Hs.62711 Homo sapiens, clone IMA6E:3351295. mRNA 10.3 

10 133015 AJ002744 Hs.246315 UDP-N^cetyl-alpha^aiacic^mine^ 2.1 

133016 AI439S88 Hs.6289 hypothetical protein FU20886 1.3 

133053 AI065016 Hs.6390 Homo sapiens done FLB3344 PRO0845 mRNA, complete cds 6.0 

133062 AW500374 Hs.64056 PRO01 49 protein 5.3 

133069 BE247441 Hs.6430 protein with polyglutamlne repeat calcium (ca2+) homeostasis endoplasmic reticulum protein 4.9 

15 133091 AK001628 Hs.64691 KIAA0483 protein 3.5 

133110 AA808177 Hs.65228 ESTs 13.1 

133134 AF198620 Hs.6564B RNA binding motif protein 8A 1.3 

133145 H94227 Hs.6592 Homo sapiens, clone IMAGE:2961368, mRNA, partial cds . U 

133152 Z11695 Hs.324473 mitogen^activated protein kinase 1 1.3 

20 133174 AA431620 Hs,324178 hypothetical protein MGC2745 17.1 

133175 AW955632 Hs.66666 ESTs, Weakly similar to S19560 profine-rich protein MP4 - mouse [M.muscu!us] 1.8 

133177 X97795 Hs.66718 RAD54 (S.cerevislae)-like 4.9 

133197 A1275243 Hs.180201 hypothetical protein FU20671 3.1 

133208 AI801777 Hs.$774 ESTs 4.4 

25 133226 AW954569 Hs.296287 Homo sapiens, Similar to bromodonraiivcontaining 4, done IMAGE:3542455, mRNA, partial cds 1.7 

133228 AI492924 Hs.6831 golgi phosphoprotein 1 6.0 

133240 AK001489 Hs.242894 ADP-ribosylation fador-like 1 1.5 

133254 AI567421 Hs.273330 Homo sapiens, done IMAGE:3544662, mRNA, partial cds 1.4 

133266 AI160873 Hs.69233 zinc finger protein 5.6 

30 133268 AW956781 Hs.293937 ESTs, Weakly similar to FXD£_HUMAN FORKHEAD BOX PROTEIN 02 [H.sapiensl 1.9 

133285 M76477 Hs.289082 GM2 ganglioside activator protein 4.7 

133291 BE297855 Hs.69855 NRAS-related gene 5.0 

133314 AA102670 Hs.70725 gamma-aminobutyric acid (GABA) A receptor, pi 2.7 

133321 T79526 Hs.179516 integral type I protein 9.3 

35 133327 AL390127 Hs.7104 KnjppeWike factor 13 4.4 

133347 BE257758 Hs.71475 add duster protein 33 1.8 

133360 AI016521 Hs.71816 v-akt mu rine thymoma viral oncogene homolog 1 5.5 

133366 AA292811 Hs.72050 non-metastatlc cells 5, protein expressed in (nudeoside-dlphosphate kinase) 2.7 

133367 AF231919 Hs.18759 KIAA0539 gene product 1.7 
40 133370 AF245505 Hs.72157 DKFZP564 1 1922 protein 1.8 

133383 8E313555 Hs.7252 K1AA1224 protein 1.7 

133390 A1950382 Hs.72660 phosphatidylserine receptor 13 

133391 AW103364 Hs.727 Inhlbin, beta A (activln A, adivin AB alpha polypeptide) 16.1 
133394 AA305127 Hs.237225 hypothetical protein HT023 12.2 

45 133437 AL031591 Hs.7370 phosphotidylinositol transfer protein, beta 10.4 

133452 NM_002759Hs.274382 protein kinase, Interferon-indudble double stranded RNA dependent 1.2 

133453 AI659306 Hs.73826 protein tyrosine phosphatase, non-receptor type 4 (megakaryocyte) 1.7 
133500 AW964804 Hs.74280 hypothetical protein FU22237 11.1 
133529 W45623 Hs.74571 ADP-nbosylatiGn factor 1 2.8 

50 133540 AL037159 Hs.74619 proteasome {prosome. macropain) 26S subunit. non-ATPase, 2 2.9 

133543 AU077073 Hs.108327 damage^spedfic DNA binding protein 1 (127kD) 2.5 

133578 AU077050 Hs.75066 translin 1.5 

133579 X75346 Hs.75074 milogervactivated protein kinase-activated protein kinase 2 2.1 
133582 BE391579 Hs.75087 Fas-activated serine/threonine kinase 1.3 

55 133594 AW160781 Hs.172589 nudear phosphoprotein similar to S. cerevislae PWP1 22 

133595 AA393273 Hs.75133 transcription fador6-like 1 (mitochondrial transcription factor 1-like) 1.5 

133599 NM002885HS.75151 RAP1, GTPase activating protein 1 5.7 

133621 NM_004893Hs.75258 H2A histone family, member Y 25.5 

133627 NMJKJ2047HS.75280 glycyMRNA synthetase 15.8 

60 133631 NM_000401Hs.75334 exostoses (multiple) 2 3.3 

133649 U25849 Hs.75393 acid phosphatase 1, soluble 1.6 

133690 AV661185 Hs.75574 mitochondrial ribosomal protein L19 4.1 

133720 127841 Hs.75737 pericentriolar material 1 1.5 

133722 AW969976 Hs.279009 matrix Gla protein 6.3 

65 133751 AW402048. Hs.334787 Homo sapiens, Simiferto likely ortholog of yeast ARV1, clone IMAGE:3506392, mRNA 3.9 

133757 T52946 Hs.196209 RAE1 (RNA export 1, S.pombe) homolog 1.7 

133760 BE271766 Hs.181357 laminin receptor 1 (67kD, ribosomal protein SA) 1.8 
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1.5 
3.5 



1.8 
2.0 
2.8 
6.7 
2.5 



133765 M62194 Hs.75929 cadherin 11, type 2, OB-cadherin (osteoblast) 

133780 M557660 Hs.76152 decorin *jj 
133784 BE622743 Hs.301064 arfaptin 1 ?•? 
133791 M34338 Hs.76244 spermidine synthase 
5 133797 AL133921 Hs.76272 retlnoblastoma-binding protein 2 ™ 
133822 050525 Hs.699 peptidylproly! Isomerase B (cydophilin B) ™ 
133842 AW797468 Hs.285013 putative human HLA class II associated protein ! J« 
133845 AA147026 Hs76704 ESTs ^ 
133850 W29092 Hs.7678 cellular retinoic add-binding protein 1 
10 133859 U86782 Hs.178761 26S proteasome-associated pad1 homolog 
133865 AB011155 Hs.170290 discs, large (Drosophila) homolog 5 

133867 AW340125 Hs.76989 K1AA0097 gene product 

133868 AB012193 Hs.183674 cullin 4A 
133881 1)30872 Hs77204 centromere protein F (350MOOkD, mitosin) • « 

15 133922 U30825 Hs.77608 splicing factor, arginine/serme-rich 9 ™ 
133924 D86326 Hs. 325948 vesicle docking protein pi 15 J-J 
133929 NlvL006306Hs.211602 SMC1 (structural maintenance of chromosomes 1, yeast)-uke 1 

133936 L17128 Hs.77719 gamma-glutamyt carboxylase *' 
133941 BE244332 Hs.77770 adaptor-related protein complex 3, mu 2 subunit 
20 133959 X81789 Hs.77897 splicing factor 3a. subunit 3, 60kD *' 
133976 AI908165 Hs.169946 GATA-binding protein 3 (T-ceB receptor gene activator) *J 
133989 AL040328 Hs.78202 SWi/SNF related, matrix associated, actin dependent regulator of chromatin ^ 
133997 AI824113 Hs.78281 regulator of G-protein signalling 12 

134010 AB016092 Hs.197114 RNA binding protein; AT-rich element binding factor « 
25 134015 D31764 Hs.278569 sorting nexin 17 

134070 NM 003590Hs.78946 cullin 3 !*X 
134110 U41060 Hs.79136 UV-1 protein, estrogen regulated ?*f 
134129 NM.014742HS.79305 KIAA0255 gene product " 
134134 H86504 Hs.173328 protein phosphatase 2. regulatory subunit B (B56). epsilon isolorm 



... on 

30 134200 BE559598 Hs.197803 K1AA01 60 protein *f 

134206 AF107463 Hs.79968 splicing factor 30. survival of motor neurorHelated « 

134208 NM_000288Hs.79993 peroxisomal biogenesis factor 7 JJ 

134219 NM 000402Hs .80206 glucose-6-phosphate dehydrogenase J' 

i^^i oMAnA-TQ u-QA//iQ u^mn pinionc r\nr\D iwAnF'!\w.s9^u mRNA. oariiai cos cX> 



1.8 



JOtllS rww_vuuw£.na.uuAw yiu^ « r ~- r -« - — . *^ ... 

134234 BE300078 Hs.80449 Homo sapiens, done IMAGE:3535294, mRNA, partial cds 

35 134275 A1878910 Hs.3688 cisplatin resistance-assodated overexpressed protein 

134292 AI906291 Hs.81234 immunoglobulin superfamBy, member 3 f*J 

134301 AW502505 Hs.81360 Homo sapiens cDNA: FU21927 (is, clone HEP04178, highly similar to HSU90909 2.5 

134305 U61397 Hs.81424 ubiquiWke 1 (sentrin) ~ 

134324 AB029023 Hs.179946 KIAA1100 protein 

40 134326 AW903838 Hs.81800 chondral sulfate proteoglycan 2 (versican) 

134329 N92036 Hs.81848 RAD21 (S. pombe) homolog f° 

134337 NMJ)04922Hs.81964 SEC24 (S. cerevisiae) related gene family, member C " 

134348 AW291946 Hs.82065 interleukin 6 signal transducer (g p1 30, oncostatinM receptor) 

134367 AA339449 Hs.82285 phosphoribosylgtycirBmideformyltransferase, phosphoribosylglycinamide synthetase, o.o 

45 134376 X06560 Hs.82396 2\5'-oligoadenylate synthetase 1 (4046 kD) ™ 

134379 AW362124 Hs.323193 hypothetical protein MGC3222 , UJl/ , COMOOM .... naflia , Jr « R 

134384 AI589941 Hs.8254 Homo sapiens, Similar to tumor dfferenfidy expressed 1 f done IMAG&36392S2, mRNA. partial octo 

134391 AA417383 Hs.82582 lntegrin, beta-like 1 (with EGF4Hce repeat domains) « 

134395 AA456539 Hs.8262 lysosomal ' 

50 134403 AA334551 Hs.82767 sperm specific antigen 2 rz 

134405 AW067903 Hs.82772 collagen, type XI, alpha 1 '~ 

134411 BE272095 Hs.167791 reticulocalbin 1, EF-hand calcium binding domain *« 

134415 A1750762 Hs.82911 protein tyrosine phosphatase type IVA, member 2 J* 

134421 AU077198 Hs.82985 collagen, type V, alpha 2 

55 134424 Z44190 Hs.83023 peroxisomal biogenesis factor 11B ^ 

134446 M112036 Hs.83419 KIAA0252 protein ]** 

134447 M58603 Hs.83428 nuclear factor of kappa light polypeptide gene enhancer In B-ceilsl (pi 05) 

134470 X54942 Hs.83758 CDC28 protein kinase 2 £1 

134480 NM_005000Hs.83916 Empirically selected from AFFX single probeset ™ 

60 134485 X82153 Hs.83942 cathepsin K (pycnodysostosis) 

134498 AW246273 Hs.84131 threonyl-tRNA synthetase f-' 
134513 AA425473 Hs.84429 KIAA0971 protein 

134516 AK001571 Hs.273357 hypothetical protein HJ 10709 j" 

134520 BE091005 Hs.74861 activated RNA polymerase II transcription cofactoM J./ 

65 134529 AW411479 Hs.848 FK506-binding protein 4 (59kD) fi 



134577 BE244323 Hs.85951 exportin, tRNA (nuclear export receptor for tRNAs) 
134582 AA927177 Hs.86041 CGG triplet repeat binding protein 1 



5.5 
5.8 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



134612 
134624 
134632 
134654 
134664 
134666 
134687 
134692 
134705 
134714 
134719 
134722 
134724 
134746 
134751 
134790 
134806 
134634 
134850 
134853 
134859 
134880 
134910 
134925 
134955 
134971 
134975 
135011 
135022 
135032 
135077 
135083 
135095 
135096 
135153 
135181 
135199 
135207 
135214 
135243 
135245 
135257 
135263 
135274 
135294 
135295 
135307 
135321 
135354 
135361 
135389 
135400 
134975 
135011 
135022 
135032 
135077 
135083 
135095 
135096 
135153 
135181 
135199 
135207 
135214 
135243 
135245 



AW068223 Hs.171581 ublquttin C-terminal hydrolase UCH37 2.2 

AF035119 Hs.8700 deleted in Ever cancer 1 2.0 

X78520 Hs.174139 chloride channel 3 2.3 

AK001741 Hs.8739 hypothetical protein FU10879 1-4 

AA256106 Hs.87507 ESTs ■ 72.9 

BE391929 Hs.8752 transmembrane protein 4 8.5 

U62317 Hs.88251 arytsulfatase A 6.0 

NM_003474Hs.8850 a disintegrin and metaBoprotelnase domain 12 (meltrin alpha) 4.3 

BE1618B7 Hs.88799 anaphase-promoting complex subuntt 10 2.3 

Y14768 Hs.890 lysosomal 6.7 

AA852985 Hs.89232 chromobox homolog 5 (Drosophfla HP1 alpha) 2.3 

AF129536 Hs.284226 F-box only protein 6 2.9 

AF045239 Hs.321576 ring finger protein 22 6.6 

X07871 Hs.89476 CD2 antigen (p50), sheep red blood cell receptor * 2.3 

AW630803 Hs.89497 laminBI 6.2 

BE002798 Hs.287850 Integral membrane protein 1 19 

AD001528 Hs.89718 spermine synthase 1-8 

AW451 370 Hs.B991 adaptor-related protein complex 1 , gamma 2 subunit 1 -4 

AI701162 Hs.90207 hypothetical protein MGC1 11 38 14 

BE268326 Hs.90280 5*rninoimldazQW-<arboxamide ribonucleotide formyltransferase/IMPcydohydrolase 5.6 

D26488 Hs.90315 KIAA0007 protein 2.8 

A1879195 Hs.90606 1 5 kDa selenoprotein 1.7 

AA532963 Hs.9100 Homo sapiens cDNARJ 13100 fis, done NT2RP3002255 1.7 

AW885909 Hs.6975 PRO1073 protein 2.1 

AW401361 Hs.91773 protein phosphatase 2 (formerly 2A), catalytic subunit, alpha Isoform 1.3 

AI097346 Hs.286049 phosphoserine aminotransferase 2.1 

R50333 Hs.92186 Leman coiled-coil protein < 2.3 

AB037835 Hs.92991 KIAA1414 protein 1-6 

NfVL000408Hs.93201 glycerol-3-phosphate dehydrogenase 2 (mitochondrial) 3.9 

AW301984 Hs.173685 hypothetical protein FU1261 9 6.2 

AW503733 Hs.9414 WAA1 488 protein 2.0 

AB036063 Hs.94262 p53-indudble ribonudeotide reductase small subunit 2 homolog 1.3 

AF027219 Hs.9443 zinc finger protein 202 7.1 

AA081258 Hs.132390 zinc finger protein 36 (KOX 18) 3.2 

AI093155 Hs.95420 JM27 protein 2.5 

BE250865 Hs.279529 px19-like proteh 1.4 

AA477514 Hs.96247 transfin-assodated factor X 5.0 

N26427 Hs.9634 ESTs. Highly similar to C10_HUMAN PUTATIVE C10 PROTEIN [H.sapiens] 6.1 

T78802 Hs.96560 hypothetical protein FU 11 656 4.6 

BE463721 Hs.97101 putative G protein-coupled receptor 5.6 

AI028767 Hs.262603 ESTs 3.5 

AW291023 Hs.97255 ESTs, Weakly similar to A46010 X-linked retinopathy protein [H.sapiensJ 1.2 

AI088775 Hs.55498 geranylgeranyl diphosphate synthase 1 2.6 

AA448460 Hs.1 12017 GE36gene 5.3 

AA150320 Hs.9800 protein kinase Njmu-R1 9.1 

AI090838 Hs.98006 ESTs 2.4 

AI743770 Hs.98368 ESTs, WeaMy similar to KIAA0822 protein (H.sapiens) 13.3 

AI652069 Hs.98614 ribosome binding protein 1 (dog 180kD homolog) 2.6 

AA456454 Hs.183418 cell division cycle 2-fike 1 (PITSLRE proteins) 8.3 

AA373452 Hs. 1 67700 Homo sapiens cONAFU 101 74 fis, done HEMBA 1003959 1.5 

U05237 Hs.99872 fetal Alzheimer antigen 4.9 

X78592 Hs.99915 androgen receptor (dihydrotestosterone receptor; testicular feminization; spinal and bulbar 2.0 

R50333 Hs.92186 Leman coiled-coil protein 2-6 

AB037835 Hs.92991 KIM1 41 4 protein 1.4 

NfvL000408Hs.93201 glycerophosphate dehydrogenase 2 (mi 1.6 

AW301984 Hs.173685 hypothetical protein FU12619 1.4 

AW503733 Hs.9414 K1M1 488 protein 1.8 

AB036063 Hs.94262 p53-indudble ribonudeotide reductase s 2.5 

AF027219 Hs.9443 zinc finger protein 202 1.5 

AA081258 Hs.132390 zinc finger protein 36 (KOX 18) 2.1 

AI093155 Hs.95420 JM27 protein 4.4 

BE250865 Hs.279529 px19-Gke protein 14.9 

AA477514 Hs.96247 transfin-assodated factor X 1.3 

N26427 Hs.9634 ESTs, Highly similar to C10_HUMAN PUTATl 1.7 

T78802 Hs.96560 hypothetical protein RJ 11 656 6.1 

BE463721 Hs.97101 putative G protein-coupled receptor 2.7 

AI028767 Hs.262603 ESTs 12.2 
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135257 AW291023 Hs.97255 ESTs, Weakly similar to A46Q10 X-linked ™ 

135263 A1086775 Hs.55498 geranylgeranyl diphosphate synthase 1 J* 

135274 AA448460 Hs.112017 GE36gene J-J 

135294 AA150320 Hs.9800 protein kinase Njmu-R1 ]* 

5 135295 AI090838 Hs.98006 ESTs ?° 

135307 AI743770 Hs.98368 ESTs, Weakly similar to K1AA0822 protein J* 

135321 A1652069 Hs.98614 ribosome binding protein 1 (dog 180kD ho J" 

135354 AA456454 Hs.183418 cell division cycle 2-like 1 (PITSLRE pr 

135361 AA373452 Hs.167700 Homo sapiens cDNA FLM0174 fis, done HE 

10 135389 U05237 Hs.99872 fetal Alzheimer anflgen ]- y 

135400 X78592 Hs.99915 androgen receptor (dihydrotestosterane r ™* y 

302256 AA857131 Hs.171595 HIV TAT specific factor 1 ^ 

302276 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c-erb-b2, 

303135 AW592789 Hs.279474 HSPC070 protein J-J 

15 303686 AK000714 Hs.109441 MSTP033 protein _ J* 

Z8 



310085 R43191 Hs.101248 Homo sapiens clone IMAGE:32553, mRNA seq 

315518 AA808229 Hs.167771 ESTs 

317781 NNL007057HS.42650 ZW10 Interactor f* 

320836 AI268997 Hs.197289 rab3 GTPase-activating protein, non-cata 

20 321114 AA902256 Hs.78979 Golgt apparatus protein 1 J* 

322221 N24236 Hs,179662 nucleosome assembly protein 1-Oke 1 J-f 

322474 AF1 18083 Hs.29494 PR01912 protein 

322556 BE041451 Hs.177507 hypothetical protein * 

323541 AF292100 Hs.104613 RP42homolog 

25 407827 BE278431 Hs.40323 BU83 (budding uninhibited by benzimidazo 

408196 AL034548 Hs.43627 SRY (sex determining region Y)-box 22 

408813 AI580090 Hs.48295 RNA helicase family 

409176 R73727 Hs.101617 ESTs, Weakly similar to T32527 hypotheti « 



413670 AB000115 Hs.75470 hypothetical protein, expressed in osteo 



414846 AW304454 Hs.77495 UBX domain-containing 1 
416980 AA381133 Hs.80684 high-mobility group (nonhistone chromoso 



1.6 
6.1 
5.6 



1.5 



30 414108 AI267592 Hs.75761 SFRS protein kinase 1 



23.6 



417373 R57256 Hs.82037 TATA box binding protein (TBP)-assodate 5.8 
418283 S79895 Hs.83942 calhepsin K (pycnodysostosis) 

35 418467 NM_006910Hs.85273 retinoblastoma-binding protein 6 ™ 

420269 U72937 Hs.96264 alpha thalassemia/mental retardation syn " 

420802 U22376 Hs.1334 v-myb avian myeloblastosis viral oncogen ™ 
421225 AA463798 Hs.102696 MCT-1 protein 

421642 AF172066 Hs.106346 reUnoic acid repressible protein ™ 

40 421828 AW891965 Hs.279789 histone deacetylase 3 *J 

421983 AI252640 Hs.110364 peptidylprolyl isomerase C (cyckiphilin ™ 
422052 AA302744 Hs.104518 ESTs 
422055 NMJ)14320Hs.1 11029 putative heme-bindlng protein 

423750 AF165883 Hs.298229 prefoldin2 ( " 

45 424001 W67883 Hs.1 37476 paternally expressed 10 (PEG10; WAA105 ™ 

425182 AF041259 Hs,155040 zinc finger protein 217 £i 
425284 AF155568 Hs.155489 NS1 -associated protein 1 

426372 BE3O4680 Hs.169531 DEAD/H (Asp-Glu-Ala-Asp/His) box poiypep J* 

428049 AW183765 Hs.182238 GW128 protein J' 

5 0 428477 AW500533 Hs.1 1482 splicing factor, arginine/serine-rich 1 1 *■* 

437562 AB001636 Hs.5683 DEAD/H (Asp-Glu-Ala-AspJHis) box poiypep ™ 

438449 AK001333 Hs.6216 Homo sapiens hepatocellular carcincma-as ™ 

441560 F13386 Hs.7888 Homo sapiens clone 23736 mRNA sequence tz 

445580 AF167572 Hs.12912 skbl (S. pombe) homolog '* 

55 446999 AA151520 Hs.334822 hypothetical protein MGC4485 " 
447111 A1017574 Hs.17409 cysteine-rich protein 1 (intestinal) 
447778 BE620592 Hs.71190 ESTs, Weakly similar to S16506 hypotheti 

448873 NM_003677Hs.22393 density-regulated protein J-J 

449687 W68520 Hs.331328 Intermediate filament protein syncoilin J* 

60 450701 H39960 Hs.288467 Homo sapiens cDNAFU 12280 fis, done MA J-« 
450703 AA011202 Hs.184771 nuclear factor l/C (CCAAT-binding transc 
452461 N78223 Hs.108106 transcription factor 

452511 BE408178 Hs.285165 Homo sapiens cDNA FU20845 fis, clone AD J* 1 

453157 AF077036 Hs.31989 DKFZP586G1722 protein JJ 

65 453658 BE541906 Hs.87819 Homo sapiens, clone MGC:2492, mRNA, comp « 

100833 AF135168 Hs. 108802 Nnethylmaleimide-sensitive factor ** 

102481 U50360 gb:Human calcium, calmodulin-dependent p °* 
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102827 BE244588 Hs.6456 chaperonin containing TCP1,subunlt 2 (b 7.9 

103549 BE270465 HsJ8793 protein kinase C, zeta 2.0 

104331 AB040450 Hs.279862 cdk Inhibitor p21 binding protein 5.3 

110018 AW579842 Hs.104557 hypothetical protein FU 10697 2.0 

5 115008 AK001827 Hs.87889 helicase-moi 5.7 

119075 M10905 Hs.287820 fibronectinl 1.3 . 

119615 AL034423 Hs.75875 ublquitin-conjugating enzyme E2 variant 2.9 

125006 BE065136 Hs.145696 spfidng factor (CC1 .3) 1.7 

127609 X80031 Hs.530 collagen, type IV, alpha 3 (Goodpasture 2.4 

10 129209 R62676 Hs.17820 Rho-associated, cc8ed-coil containing p 5.2 

129917 M30773 Hs.278540 protein phosphatase 3 (formerly 2B), reg 4.5 

130182 BE267033 Hs.192853 ubiquitin-conjugafing enzyme E2G 2 (homo 11.0 

130365 W56119 Hs.155103 eukaryotic translation mitiation factor 3.3 

131135 NMJ16569HS.267182 TBX3-iso protein 1.3 

15 131853 AI681917 Hs.3321 ESTs, Highly similar to IRX1 JIUMAN IROQU 3.2 

131881 AW361018 Hs.3383 upstream regulatory element binding prot 14.3 

132726 N52298 Hs.55608 hypothetical protein MGC955 3.0 

135193 X95525 Hs.96103 TATA box binding protein (TBP)-associate 2.7 

409487 H19886 gb:yn57a05.r1 Scares adult brain N2b5H 2.3 

20 416040 AW819158 Hs.289044 Homo sapiens cDNA FU12048 to, done HE 7.4 
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TABLE 4A 



Table 4A shows the accession numbers for those pkeys lacking unigenelD's for Table 4. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 



123615 
123619 
101445 
124385 
124417 
124482 
102481 
103349 
110856 



120280 
113248 
120472 



30686.-15 

371681 1 

1650.-5 

656394.1 

1642364.1 

1657509.1 

3128U-28 

11052.-2 

19346.14 . 



160212_i 
328626.1 
44573 2 



129019 44573.2 



120695 9683.3 



M609170 

AA602964AA609200 
M21259 

AI267847 N27351 
N34059 N46979 
N53935 N53950 
U50360 
X89059 

AA992380 N33063 N21418 H79958 R21911 H79957 

103797 109699 1 AA080912 M075318 AA083403 AA076594 AA078992 AA084926 AA081881 AA1 13913 AA1 13892 
AA083821 AA134801AA082953 AA070343 AA062835 AA075419 AA063293 M071252 AA078900 AA062836 AW974305 

AA190577AA181657 

T63857 AW971220 AA493469 T63699 

AI950087 N70208 R97040 N36809 AI3081 19 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068AL 134043 AW235363 AA663345 AW008282 AA488964 
AA283144 AI890387 AI950344 AI741346 AI689062 AA282915 AW102898 AI872193 AI763273 AW173586 AW150329 AI653832 
AI762688 AA988777 AA488892 AI356394 AW103813 AI539642 AA642789 AA856975 AW505512 AI961530 AW629970 
8E612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961 101 AA251669 AA251874 
AI819225 AW205862 AI683336 AI858509 AW276905 A1633006 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75628 N22388 H84729 H60052 T92487 A1022058 AA780419 AA551005 W80701 AW613456 AI373032 AI564269 
F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 F04005 
A1950087 N70208 R97040 N36809 AI3031 19 AW967677 N35320 A1251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 AI820532 W87891 T85904 U71456T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 A1560251 AW874068 AL134043 AW235363 AA663345 AW008282 AA488964 
AA263144 AI890387 AI950344 AI741346 AI669062 AA282915 AW102898 AI672193 AI763273 AW1 73586 AW150329 AI653832 
AI762688 AA988777 AA488892 AI356394 AW103813 AI539642 AA642789 AA856975 AW505512 AI961530 AW629970 
BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW981 101 AA251669 AA251874 
AI819225 AW205862 AI683338 AI858509 AW276905 AI633006 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75628 N22388 H84729 H60052 T92487 A1022058 AA780419 AA551005 W80701 AW613456 AI373032 A1564269 
F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 F04005 
AA976503 A1917802AA953664AA404613 AA428771 BE280542 AW194691 AI927301 Ai740458 A1796100 AI935603 AW052210 
AA970201 AI633384 AA425910 AI017004 A1241295 AA402816 AA291468 
AA398838 M435847 

AA416568 AA442889 AA417233 AA442223 
AA453641 AA454061 



122188 275673J 
121581 283769.1 
122518 305217.1 
109026 150431 1 AA157811 AA836869 
123658 genbajik_M609364 AA609354 
123811 genbank_AA62G586 AA620586 
125115 genbank.T97341 T97341 
125147 NOT_FOUND.entrez_W38150 
118737 382979 1 AA1 99686 N73861 
120274 genbanLAA177051 AA177051 
113196 genbank_T57317 T57317 
120504 genbanleAA256837 AA256837 
120639 genbank^AA285942 
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120809 genbanlUA346495 AA346495 

113702 genbankJT97307 T97307 

129680 23162J U03749 NMJ01275 J03483 J03915 AI214509 AW245744 AL046455 AA318960 A1741505 AA843875 AI829382 A1560122 
AI858999 D55958 A1684005 D53170 AA854091 AI025609 D53119 D54729 D55504 D55377 D55313 AW512244 AA846441 
5 AW043898 AJ969102 AA405741 AI091983 M788784 AA706586 AA854361 AW470949 AA843095 AA772028 AJ148432 

AKJ38109 AA782478 AA910064 AI22Q384 AA781296 AA843881 AA854064 AA843125 AA843419 AA319036 AA319054 
AI273831 W32275 AI584185 C05724 AA789023 AI686818 D54392 AI022485 AA431410 AA854232 W39212 W15214 AA894441 
AJ803081 AJ167381 AW245389 AA319430 AA335156 AI042646 AA327030 AA725170T27943 AA889304 AA976699 A1687001 
A1621107 AI865540 AA772107 C05286 AA319661 M405992 
10 101045 entre*J05614J05614 

117247 genbank N21032 

110501 genbank H55748 

103392 entrezJ(94563X94563 

105032 genbank AA127818 
15 119513 NOTJWNO.entrez W37933 

105445 genbank AA252395 

121514 genbank AA412112 

121558 genbank AA412497 

121911 genbank AA427950 
20 123315 714071J AA496369 AA496846 

114911 genbank AA236672 

409487 1134778 J H 1 9886 AW402808T1 0231 
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TABLE 5: Figure 5 from BRCA 001 US 
Table 5 shows genes upregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

10 Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal breast tissue 



Pkey ExAccn UnigenelD UnlgeneTitle 



Rl 



15 

100114 X02308 Hs.82962 
100147 D13666 Hs.136348 
100154 H60720 Hs.81892 
100335 AW247529 Hs.6793 
20 100656 L05424 Hs.169610 

100667 105424 Hs.169610 

100668 L05424 Hs.169610 
100678 AW502935 Hs.740 
100988 AK000405 Hs,76480 

25 101031 J05070 Hs.151738 
101045 J05614 
101332 J04088 Hs.156346 
101352 AM94299 Hs.16297 
101580 NMJ)1215lHs.83363 

30 101592 AF064B53 Hs.91299 
101767 M81057 Hs.180884 
101806 AA586894 Hs.112408 
101810 NH.000318HS.180612 
101983 AJ904232 Hs.75323 

35 102107 BE258602 Hs.182366 
102165 BE31328Q Hs.159627 
102198 AW950852 Hs.74598 
102217 AA829978 Hs.301613 
102220 U24389 Hs.65436 

40 102302 AA306342 Hs.69171 
102348 U37519 Hs.87539 
102374 U33635 Hs,90572 
102455 U48705 Hs.75562 
102568 W81489 Hs.223025 

45 102618 AL037672 Hs.81071 
102687 NM_007019Hs.93002 
102689 U96132 Hs.171280 

102704 AU077058 Hs.54G89 

102705 T97490 Hs.50002 
50 102801 BE252241 Hs.38041 

102827 BE244588 Hs.6456 
103060 NMJJ05940HS.155324 
103080 AU077231 Hs.82932 
103178 AA205475 Hs.275865 

55 103206 X72755 Hs.77367 
103238 A1369285 Hs.75189 
103547 AI376722 Hs.180062 
103549 BE270465 Hs.78793 
103886 AK001278 Hs.105737 

60 104325 BE379766 Hs.150675 
104827 AW052006 Hs.8551 
104846 A1250789 Hs.32478 
104854 AA041276 Hs.154729 
104867 AA278898 Hs.225979 



thymidylate synthetase 2.9 

osteoblast specific factor 2 (fasciclin 7.5 

KIM0101 gene product 9 2 

platelet-activating factor acety&iydrola 2.7 

CD44 antigen (homing function and Indian 5.7 

CD44 antigen (homing function and Indian 9 

CD44 antigen (horning function and Indian 7.6 

PTK2 protein tyrosine kinase 2 53.2 

ubiquitin-like4 11-4 

matrix metafloproteinase 9 (gelatinase 8 8.2 

gb:Human proliferating cell nuclear anil 5 

topoisomerase (DNA) 11 alpha (170kD) 3.4 

COX17 (yeast) homolog, cytochrome coxid 6.3 

coagulation factor Vlll-associated (intr 5.7 

guanine nucleotide binding protein ( 5.6 

carboxypeptidase B1 (tissue) 14.4 

S100 calcium-binding protein A7 (psorias 8.9 

peroxisomal membrane protein 3 (35kD, Ze 3.2 

prohibits M 

heat shock protein 75 1.4 

death associated protein 3 4.6 

polymerase (DNA directed), delta 2, regu 4.3 

JTV1 gene 6,7 

lysosomal 4.3 

protein kinase C-tike 2 2.7 

aldehyde dehydrogenase 3 family, member 2 

PTK7 protein tyrosine kinase 7 6.2 

discoldin domain receptor family, member 6.9 

RAB31, member RAS oncogene family 5.3 

extracellular matrix protein 1 5.8 

ubiquitin carrier protein E2-C 4.3 
hydroxyacyl-Coenzyme A dehydrogenase, ty 6 

BRCA1 associated RING domain 1 1 .9 

small inducible cytokine subfamily A (Cy 2.3 

pyridoxal (pyridoxine, vitamin B6) Wnas 6.4 

chaperonin containing TCPl.subunit 2 (b 5.6 

matrix metalloproteinase 1 1 (MMP11; stro 4.5 

cydin D1 (PRAD1: parathyroid adenomatos 3.1 

ribosomal protein S18 9-9 

monokine induced by gamma Interferon 8.8 

death-associated protein 5-6 

proteasome (prosome, macropain) subunit, 9.7 

protein kinase C, zeta 7.9 

hypothetical protein RJ10416 slmilarto 6.5 

polymerase (RNA) II (DNA directed) polyp 6.3 
PRP4/STIWD splicing factor 105 

ESTs 5-6 
3-phospholnositide dependent protein kin 123 
hypothetical protein similar to small G 2 
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104896 AW015318 Hs.23155 
104909 AW408164 Hs.249184 
104916 AW958157 Hs.155489 
104919 AA026880 Hs.25252 
5 104974 Y12059 Hs.278675 
104978 AI199268 Hs.19322 
105012 AF098158 Hs.9329 
105039 AA907305 Hs.36475 
105079 AA151342 Hs.12677 

10 105088 H58589 Hs.35156 
105393 AF167570 Hs.2565S3 
105547 AA262640 Hs.27445 
105564 BE616694 Hs.288042 
105658 AA985190 Hs.246875 

15 105746 AW151952 Hs.46679 
105858 AF151066 Hs.281420 
105930 AF016371 Hs.9880 
106094 AA533491 Hs.23317 
106350 AK001404 Hs.194698 

20 106359 AW390282 Hs.31130 
106610 AA458882 Hs.79732 
106624 NMJ)03595Hs.2635O 
106713 BE614802 Hs.184352 
106829 AW959893 Hs.27099 

25 106846 AB037744 Hs.34892 
106873 N49809 Hs.11197 
106973 BE156256 Hs.11923 
107029 AF264750 Hs.288971 
107197 W15477 Hs.64639 

30 107859 AW732573 Hs.47584 
107901 L42612 Hs.335952 
107922 BE153855 Hs.61460 
107974 AW956103 Hs.61712 
108274 AF129535 Hs.272027 

35 108647 BE546947 Hs.44276 
108695 AB029000 Hs.70823 
108894 AK001431 Hs.5105 
109011 AA156542 Hs.72127 
109068 AA164293 Hs.72545 

40 109273 AA375752 Hs.82719 
109468 NM_015310Hs.6763 
110240 AI668594 Hs.176588 
110330 AI288666 Hs.16621 
110501 H55748 

45 110742 AW190338 Hs.26029 
110762 BE044245 Hs.30011 
110856 AA992380 
110958 NJVL005864HS.24587 
111125 N63823 Hs.269115 

50 111179 AK000136 Hs.10760 
111239 N90956 Hs.17230 
111285 AA778711 Hs.4310 
111392 W46342 Hs.325081 
111937 BE298665 Hs.14846 

55 112244 AB029QQ0 Hs.70823 
112995 AA737033 Hs.7155 
113777 BE266947 Hs.10590 
113791 AI269096 Hs.135578 
113811 BE207480 Hs.6994 

60 113834 T26483 Hs.6059 
113868 W57902 Hs.90744 
113870 AL079314 Hs.16537 
113923 AW953484 Hs.3849 
114275 AW515443 Hs.306117 

65 114895 AA236177 Hs.76591 
114965 AI733881 Hs.72472 
115061 AI751438 Hs.41271 



ESTs 

transcripfian factor 19 (SC1) 
NS1 -associated protein 1 
prolactin receptor 
bromodomaln-contalntng 4 



177 

5 

1.7 
1.4 
1.4 



Homo sapiere,SimflartoRiKEN<f>NA 2010 11 

chromosome 20 open reading frame 1 3.3 

ESTs 2.5 

CGM47 protein 9.5 

Homo sapiens cONA FU1 1027 fis, clone PL U 

interieukin enhancer binding factor 3, 9 5.4 

unknown 9.3 

hypothetical protein RJ14299 1.4 

hypothetical protein FU20059 9.4 

hypothetical protein FU20739 1 -5 

hypothetical protein 2.9 

peptidyf prolyl Isomerase H (cydophilin 5.2 

hypothetical protein FU14681 6.8 

cycfin B2 5.7 

transmembrane 7 superfamHy member 2 6.3 

fibulin 1 7.9 

tyrosytprotein sulfotransferase 2 7.7 

rrypothetical protein FU12549 4.5 

hypothetical protein FU23293 similar to 162 

KIAA1323 protein 2.2 

Homo sapiens, clone 1MAGE3343149, mRNA, 16.8 

hypothetical protein 6.6 

myeloid/lymphoid or mixed-lineage leukem 1 .8 

glioma pathogenesis-related protein 6. 1 

. . , . . 64 

2.5 
2.2 
6.7 



72 
4 

1.4 
2.9 



keratin 68 
Ig superfamfly receptor LNtR 



F-box only protein 5 7.1 
homeoboxCIO 9.8 
KIAA1077 protein 
hypothetical protein FU10569 
ESTs 
ESTs 

Homo sapiens mRNA; cDNA DKFZp586F1822 (f 2.9 
KIAA0942 protein 3.2 
ESTs, Weakly similar to CP4Y_HUMAN CYTOC 4.2 
DKFZP434I116 protein 62 
gbyq94a01 .s1 Soares fetal liver spleen 6.1 
hypothetical protein MGC11256 7.6 
hypofteifeal protein MGC2963 9.3 
gb:ot37g06.s1 SoaresJesfc_NHT Homo sap 2.3 
signal transduction protein (SH3 contain 6.7 
ESTs. Moderately similar to Z195.HUMANZ 3.6 
asporin(LRR class 1) 7.1 
hypothetical protein FU22087 7.9 
eukaryoflc translation initiation factor 6.9 
Homo sapiens, done IMAGE:3659680, mRNA. 8.4 
Homo sapiens mRNA; cONA DKFZp564D016 (fr 10.6 
K1AA1077 protein 14.6 
ESTs. Moderately similar to 21 15357A TYK 5.6 
zinc finger protein 313 1 3.4 

chitobiase, di-N-acetyl- 1-3 
Homo sapiens cONA: FU22044 fis, done H 3.1 
EGF-containing ftbulin-Dke extracellula 1 1 .3 
proteasome (prosome, macropain) 26S subu 2.7 
hypothetical protein, similar to (U06944 6.1 
hypothetical protein FU22041 similar to 1.9 
K1AA0306 protein 15.8 
KIAA0887 protein 7.1 
BMP-R1B 2.3 
Homo sapiens mRNA full length Insert cON 1 1 .8 
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115278 
115291 
115652 
115693 
115941 
115968 
116011 
116417 
116470 
116637 
117132 
117881 
118528 
119075 
119265 
119349 
119403 
119789 
120206 
120253 
120297 
120325 
120327 
120349 
120356 
120371 
120383 
120386 
120389 
120396 
120418 
120472 
129484 
120570 
120582 
120596 
120624 
120695 
120713 
120750 
120774 
120807 
120809 
120984 
121081 
121408 
121505 
121508 
121513 
121549 
121558 
121655 
121744 
121748 
121773 
121832 
121839 
121882 
121911 
121999 
122013 
122036 
122356 
122371 
122372 
122460 
122490 



AK002163 
BE545072 
BE093589 
AF231023 
AI867451 
AB037753 
AL359053 
AW499664 
AI272141 
AK001043 
AI393666 
AF161470 
A1949952 
M10905 
BE539706 
T65004 
AL1 17554 
BE393948 
H26735 
AA131376 
AA191384 
AA195651 
AK000292 
AW969481 
AF000545 
AA219305 
AL109963 
AW969665 
AW967985 
AA134006 
AW966893 
AI950087 
AA253170 
AA280679 
BE244830 
AA282074 
AW407987 
AA976503 
AW449855 
AI191410 



Hs.301724 
Hs.122579 
Hs.38178 
Hs.55173 
Hs.46679 
Hs.62767 
Hs.57664 
Hs.12484 
Hs.83484 
Hs.92033 
Hs.42315 
Hs.260622 
Hs.49397 
Hs.287820 
Hs.265363 
Hs.163561 
Hs.1 19908 
Hs.50915 
Hs.91668 
Hs.326401 
Hs.104072 
Hs.104106 
Hs.278732 
Hs.55189 
Hs.296433 
Hs.104196 
Hs.123122 
Hs.154848 
Hs.325572 
Hs.79306 
Hs.26613 



hypothetical protein FU11301 1 .5 

hypothetical protein FU10461 6.2 
hypothetical protein FU23468 10.6 
cadherin, EGF LAG seven-pass G-type rece 6.8 
hypothetical protein FU20739 5.5 
KIAA1332 protein 

Homo sapiens mRNA full length insert cDN 
Human done 23826 mRNA sequence 
SRY (sex determining region Y>box 4 
integrin-iinked Wnase-associated serine 
p10-bindlng protein 
butyrate-induced transcript 1 
ESTs 

fioronectln 1 
ESTs 
ESTs 

nudeoiar protein NOP5/NOP58 
kailikrein 5 (KLK5; KLK-L2; stratum com 
Homo sapiens clone PP1498 unknown mRNA 45.7 
fibroblastgrowth factor 12B 38.9 
ESTs.Weaklys1milartoZ195^HUMANZlNC 15.2 

mt. R A 



Hs.96473 

Hs.271445 

Hs.284228 

Hs.237323 

Hs.173518 



M346385 
AA346495 
BE262951 
AA398721 
AA406137 
AA494172 
AA402515 
AA416653 
AA412477 
AA412497 
AA421537 
AA398784 
BE536911 
AB033022 
AW340797 
AA425691 
AA426376 
M427950 
AA430211 
AA431085 
W92142 
AA443794 
AA868555 
AA446008 
AW418788 
AA448349 



2.4 
7,4 
2.1 
2.7 
5.2 
5.7 
7.4 
5.7 
1.4 
8.4 
6.7 
9.2 



ESTs 

hypothetical protein FU20285 



Hs.96557 
Hs.96693 
Hs.193985 
Hs.30002 

Hs.99052 
Hs.186749 
Hs.98019 
Hs.194417 
Hs.97887 
Hs.181510 
Hs.98142 

Hs.178072 
Hs.97514 
Hs.234545 
Hs.158654 
Hs.98434 
Hs.191606 
Hs.98459 

Hs.98668 
Hs.98706 
Hs.271963 
Hs,98390 
Hs.178222 
Hs.336677 
Hs.99148 
Hs.238151 



putative purinergic receptor 
EST 



6.4 
16.1 
16.8 
28.1 
12.4 
9.7 
32.6 



FSH primary response (LRPR1, rat) homolo 
hypothetical protein DKFZp434D0127 
ESTs, Moderately similar to ALU7_HUMAN A 21.7 
eukaryofic translation initiation factor 1 23 
Homo sapiens mRNA; cDNA DKFZp586F1323 (f 1 1 .4 
g bwq05c02jc1NCLCGAP^Kid12Homosapien 19.4 
EST 104 
ESTs, Weakly similar to ALU1_HUMAN ALU S 14.4 
ZNF135-like protein 105 
N-acetytglucosamine-phosphate mutase 7.5 
M-phase phosphoprotein homotog 52 
gb:oq30a04.s1 NCLCGAP_GC4 Homo sapiens 46.8 
Homo sapiens cDNA FU 12727 fis, clone NT 5.9 
ESTs, Moderately similar to 2109260A Be 7 
ESTs 7.8 
SH3^containing protein SH3GLB2;KIAA1848 6.8 
gb:EST52657 Fetal heart II Homo sapiens 4.4 
ESTs 

ESTs, Highly similar to 137550 mismatch 
EST 
ESTs 
ESTs 
ESTs 
EST 

gb:zt95g12.s1 SoaresJestisJJHT Homo sap 2.8 
Homo sapiens mRNA; cDNA DKFZp434B1023 (f 7.8 
ESTs 1 
hypothetical protein NUF2R 
WAA1196 protein 
ESTs 

ESTs, Highly similar to WAA1048 protein 
ESTs 

gb:zw50f02^1 Soares_totalJetus^Nb2HF8_ 7.2 

ESTs 65 
ESTs, Weakly similar to ALU5.HUMAN ALUS 13.1 
ESTs 7 - 3 

ESTs 7fi 
EST 7.6 

ESTs, WeaWy similar to S43569 R01H10.6 9.7 

EST 6.1 
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5.6 
5.4 
6 

13.1 
28 
6.2 
7.4 



7.1 

19.5 

7.9 

5.8 

5 
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122492 
122510 
122530 
122572 
122607 
122614 
122616 
122618 
122622 
122717 
122829 
122838 
122856 
122868 
122907 
123016 
123034 
123136 
123152 
123394 
123466 
123486 
123615 
123735 
123753 
124006 
124385 
124440 
124656 
124683 
124735 
124761 
124768 
124768 
124811 
124812 
124822 
124860 
124903 
124930 
124942 
125051 
125056 
125101 
125115 
125280 
127274 
128528 
128670 
128691 
128772 
128781 
128797 
128868 
128891 
128946 
128975 
128995 
129019 
129076 
129088 
129096 
129198 
129347 
129362 
129372 
129404 



AA448417 Hs.104990 
AA449232 Hs.99195 
AW959741 Hs.40368 
AA452601 Hs.99287 
AA453518 Hs.98023 
AA453630 Hs.99339 
AA453638 Hs.161873 
AA453641 

AA453987 Hs.144802 
AA456859 Hs.178358 
AW204530 Hs.99500 
AA460584 Hs.334386 
AI929374 Hs,75367 
AF005216 Hs.115541 
AA470074 Hs.169896 
AW338067 Hs.323231 
AI359571 Hs.44054 
AW451999 Hs.194024 
AW601773 Hs,270259 
AA731404 Hs.105510 
AA599042 Hs.112503 
BE019072 Hs.334802 
AA609170 

NMJ)13241Hs.95231 
AA609955 Hs.234961 
AI147155 Hs.270016 
AI267847 

AA532519 Hs.129043 
AW297702 Hs.102915 
AA381661 Hs.1 19878 
R22952 Hs.268685 
AA374756 Hs.93560 
AW368528 Hs.100855 
R43543 Hs.100912 
R46068 Hs.288912 
R47948 Hs.188732 
AA418160 Hs.86043 
R65763 Hs.101477 
AW298713 Hs.221441 
AI076343 Hs.173939 



R99978 

T79956 

T81310 

AI472068 

T97341 

AI123705 



Hs.268892 
Hs.100588 
Hs.100592 
Hs.286236 

Hs.106932 
AW966158 Hs.58582 
R39234 Hs.251699 
AA975486 Hs.103441 
W27939 Hs.103834 
BE302796 Hs.105097 
N71826 Hs.105465 
NIVL002975H3.105927 
AA419008 Hs.106730 
F34856 Hs.292457 
Y13153 Hs.107318 
BE560779 Hs.284233 
AI816224 Hs.107747 
AI95Q087 

AW296806 Hs.326234 
AA744610 Hs.194431 
AA463189 Hs.288906 
N57532 Hs.109315 
BE614192 Hs.279869 
U30246 Hs.110736 
NMJJ16039HS.1 10803 
AI267700 Hs.317584 



ESTs 5.4 

ESTs 11.2 

adaptor-related protein complex 1 , sigma 10.1 

EST 11 

ESTs 61.5 

EST 10.7 

ESTs 107.3 
gb:zx48e06.s1 Soares_testis_NHT Homo sap 31.1 

ESTs 5.6 

ESTs 8.5 

ESTs 81.8 

ESTs 75.3 

Src4ikeedapter 5.8 

Janus kinase 2 (a protein tyrosine kinas 5.3 

ESTs 11.5 

Homo sapiens cONA FU1 1946 fis, done HE 2.8 

nineln {GSK3B Interacting protein) 8.7 

ESTs 5.1 

ESTs 5.2 

ESTs 3.6 

EST 7.4 

Homo sapiens cDNA FU 14680 fis, done NT 2.4 

gb:af12a12.s1 SoaresJestis_NHT Homo sap 7.8 

FH1/FH2 domain-containing protein 10 

Huntingtin interacting protein E 30.6 

ESTs 8.1 

gbaq49a10,x1 Stanley Frontal NB pool 2 57.1 
Human DNA sequence from done 989H1 1 on 7.8 

ESTs 8.3 
ESTs, Weakly similar to M3K9_HUMAN MfTOG 7.9 

ESTs 11.3 

Homo sapiens mRNA for KIAA1771 protein, 9 

ESTs 8.1 

Homo sapiens cDNA: FU22726 fis, done H 5.1 

hypothetical protein FU22604 1 4.2 

ESTs 7.9 

Homo sapiens cDNA FU 13558 fis, done PL 6.6 

EST 23.9 

ESTs 32.4 

ESTs, Weakly similar to ALUB_HUMAN 111! 22.8 

ESTs, Moderately similar to B34087 hypot 6.1 

EST 135.3 

ESTs 5.4 

K1AA1856 protein 5.6 

gb:ye57e05.s 1 Scares fetal Over spleen 9.6 

ESTs 8 

Homo sapiens cDNA FU12789 fis, done NT 12.8 

ESTs, WeaWy similar to DN4-GGTR14 |H.s 2 A 

Homo sapiens, Similar to RIKENcDNA 1700 7.1 

hypothetical protein MGC5576 7.7 

thymidine kinase 1 , soluble 5.3 

small nuclear ribonudeoprotein polypept 53.9 

stem cell growth factor lymphocyte seer 1 3^ 

chromosome 22 open reading frame 3 3 
Homo sapiens, done MGC: 16362, mRNA. com 13.3 

kynurenine 3-monooxygenase (kynurenlne 3 7.2 

NICE-5 protein 14 

OKFZP566C243 protein 1.9 
gbwq05c02jt1 NCI_CGAP_Kld12 Homo saplen 2.9 

ESTs. Highly similar to T46422 hypothetl 5 

paBadin 17.1 

WW Oomain-ContalnJng Gene 20:9 

WAA1415 protein 5.8 

melanoma-associated anOgen recognised b 7.6 

solute carrier family 1 2 (sodium/potass! 6.7 

CGI-99 protein 2 

ESTs 5 
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129482 M188185 Hs.289043 
129559 W01296 Hs.11360 
129587 H14718 Hs.11506 
129629 AK000398 Hs.11747 
5 129649 AO000092 Hs.16488 
129680 U03749 
129689 AW748482 Hs.77873 
129702 AI304966 Hs.12035 
129720 AA156214 Hs.12152 

10 130010 AA301116 Hs.142838 
130097 AL046962 Hs.14845 
130135 AA311426 Hs.21635 
130211 N!vL003358Hs.237O3 
130242 X79201 Hs.153221 

15 130359 NM_013449Hs.277401 
130365 W56119 Hs.155103 
130448 BE513202 Hs.15589 
130455 D90041 Hs.155956 
130471 AL121438 Hs.183706 

20 130503 BE208491 Hs.295112 
130511 L32137 Hs.1584 
130542 U64675 Hs.179825 
130553 AF062649 Hs.252587 
130556 AI907018 Hs.15977 

25 130567 AA383092 Hs.1608 
130574 AF083208 Hs.16178 
130617 M90516 Hs.1674 
130667 BE246961 Hs.17639 
130693 R68537 Hs.17962 

30 130744 H59696 Hs.18747 
130757 AL036067 Hs.18925 
130880 BE514434 Hs.20830 
130944 BE382657 Hs.21486 
131046 AA321649 Hs.2248 

35 131060 AA194422 Hs.22564 
131099 AL133353 Hs.226581 
131135 NMJ)16569Hs.267182 
131185 BE280074 Hs.23960 
131225 H62087 Hs.31659 

40 131245 AL080080 Hs.24766 
131283 X80038 Hs.339713 
131569 AL389951 Hs.271623 
131643 AW410601 Hs.30026 
131714 AA642831 Hs.31016 

45 131722 013757 Hs.311 
131760 X76732 Hs.3164 
131793 AW966127 Hs.32246 
131885 BE502341 Hs.3402 
131900 AA099014 Hs.231029 

50 131905 AA179298 Hs.3439 
131941 BE252983 Hs.35086 
131971 BE567100 Hs.154938 
132180 NWL004460HS.418 
132203 NM_004782Hs.194714 

55 132273 AA227710 Hs.43658 
132288 N36110 Hs.305971 
132294 AB023191 Hs.44131 
132348 AW067708 Hs.170311 
132370 AW572805 Hs.46645 

60 132384 AA312135 Hs.46967 
132450 AA100012 Hs.48827 
132465 AW169847 Hs.49169 
132532 AA454132 Hs.5080 
132574 AW631437 Hs,5184 

65 132638 A1796870 Hs.54277 
132718 NM_004600Hs.554 
132726 N52298 Hs.55608 



spindlin 67 

hypothetical protein FU14784 7,5 

Human done 23589 mRNA sequence 6.8 

hypothetical protein FU20391 3.8 

calreflcutin 3.3 
gb:Human chromogranin A (CHGA) gene, pro 14.1 

B7 homotog 3 2.6 

ESTs, Weakly similar to 138022 hypotheti 7.4 

APMCF1 protein 2 

nucleolar phosphoprotein Nopp34 1 .6 

fon\headbox03A 2.8 

tubulin, gamma 1 6.1 
ESTs, Moderately similar to CEGT_HUMAN C1.6 

synovial sarcoma, translocated to X chro 5.4 

bromodomain adjacent to zinc finger doma 8.5 

eukaryotic translation initiation factor 1 1 

PPAR binding protein 3.9 

N-acetyttransferase 1 (arylamine N-acety 33.6 

adducin 1 (alpha) 2.7 

KIAA0618 gene product 16.1 

cartilage ofigomeric matrix protein (pse 6.1 

RAN binding protein 2-tike 1 7.8 

pituitary tumor-transforming 1 14.4 

Empirically selected from AFFX single pr 4.7 

replication protein A3 (14kD) 7.9 

apoptosis antagonizing transcripfion fac 1 .2 

glutamine-fructose-6-phosphate transamin 1 2.1 
Homo sapiens ubiquitin protein ligase {U 13.9 

ESTs 2 

POP7 (processing of precursor, S. cerevi 3.1 

protein x 0001 57 

kinesin-like2 2.1 

signal transducer and activator of trans 5.4 
small inducible cytokine subfamily B (Cy 7.4 
myosin VI 5.1 
COX15 (yeast) homotog, cytochrome coxid 7 
TBX3-iso protein 3.3 
cyclin B1 5.8 
thyroid hormone receptor-associated prot 7.5 
thloredoxin domain-containing 2.8 
Homo sapiens clone F19374APOE-C2 gene 1.3 
nucteoporin 50kD 5 
HSPC182 protein 2.9 
putative DNA binding protein 2.9 
phosphoribosyl pyrophosphate amidotransf 3.4 
nucteobindin 2 2.9 
Homo sapiens cDNA FLJ14656 fis, done NT 7.9 
ESTs 137 
Homo sapiens, done MGC:15961, mRNA, com 8.7 



stomatMike 2 
ubiquitin specific protease 1 
hypothetical protein MDS025 
fibroblast activation protein, alpha 
synaptosomat-associated protein, 29kD 
DKFZP586L151 protein 
solute carrier family 2 (facilitated gfu 
KIAA0974 protein 



ESTs 

HSPC034 protein 
hypothetical protein FLJ12085 
KIAA1 634 protein 

mitochondrial ribosomal protein L16 
TH1 drosophib homolog 
ONA segment on chromosome X (unique) 99212.4 
Sjogren syndrome antigen A2 (60kD. ribon 3.7 
hypothetical protein MGC955 14.3 

145 



11.3 
2.3 
3.5 
14.7 
7.8 
10 
9.2 
2 

12.5 
28.3 
6.1 
8.6 
6.1 
7.1 
14 
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132731 
132744 
132773 
132798 
5 132310 
132833 
132842 
132851 
132891 

10 132941 
132972 
132980 
132994 
133016 

15' 133177 
133208 
133254 
133266 
133268 

20 133285 
133390 
133391 
133540 
133594 

25 133621 
133720 
133760 
133784 
133791 

30 133797 
133822 
133850 
133865 
133881 

35 133924 
133959 
133989 
133997 
134234 

40 134348 
134376 
134379 
134405 
134421 

45 134480 
134516 
. 134529 
134751 
134790 

50 134806 
134850 
134859 
134971 
135181 

55 135207 
135245 
135257 
135307 
135321 

60 135354 
135400 
302276 
317781 
321114 

65 322555 
420802 
424001 



A1189075 Hs.301872 
AA010233 Hs.55921 
AA459713 Hs.295901 
AI026701 Hs.5716 
AB007944 Hs.5737 
U78525 Hs.57783 
NM 016154HS.279771 
UQ9716 Hs.287912 
BE267143 Hs.59271 
AI817165 Hs.6120 
AA034365 Hs.288924 
AA040696 Hs.62016 
AA1 12748 Hs.279905 
AI439688 Hs.6289 



X97795 
A1801777 
AI567421 
AI160873 



Hs.66718 
Hs.6774 
Hs.273330 
Hs.69233 



AW956781 Hs.293937 
M76477 Hs.289082 
AI950382 Hs.72660 
AW103364 Hs.727 
AL037159 Hs.74619 
AW160781 Hs.172589 
NM_004893Hs.75258 
L27841 Hs.75737 
BE271766 Hs.181357 
BE622743 Hs.301064 
M34338 Hs.76244 
AL133921 Hs.76272 
D50525 Hs.699 
W29092 Hs.7678 
AB011155 Hs.170290 
U30872 Hs.77204 
086326 Hs.325948 
X81789 Hs.77897 
AL040328 Hs.78202 
AJ824113 Hs.78281 
8E300078 Hs.80449 
AW291946 Hs.82065 
X06560 Hs.82396 
AW362124 Hs.323193 
AW067903 Hs.82772 
AU077196 Hs.82985 
NM_005000Hs.83916 
AK001571 Hs.273357 
AW411479 Hs.848 
AW630803 Hs.89497 
BE002798 Hs.287850 
AD001528 Hs.89718 
AI701162 Hs.90207 
026488 Hs.90315 
AIQ97346 Hs.286049 
BE250865 Hs.279529 
N26427 Hs.9634 
A1028767 Hs.262603 
AW291023 Hs.97255 
AI743770 Hs.98368 
AI652069 Hs.98614 
AA456454 Hs.183418 
X78592 Hs.99915 
AW057736 Hs.323910 
NM_007057Hs.42650 
AA902256 Hs.78979 
BE041451 Hs.177507 
U22376 Hs.1334 
W67883 Hs.137476 



hypothetical protein MGC4840 5.9 

glutamyl-profyl-tRNA synthetase 6.4 

KIAA0493 protein 14.6 

KIAA0310 gene product 2.5 

KIM0475 gene product 4.2 

eukaryotic translation Initiation factor 6.1 
Homo sapiens done PP1596 unknown mRNA7.1 

lectin, mannose-binding, 1 6.1 

U2(RNU2) small nuclear RNA auxiliary fac 2.7 

hypothetical protein RJ13222 Z\ 

Homo sapiens cDNA FU1 1392 fis, done HE 3.5 

ESTs 1.3 

doneHQ0310PRO0310p1 17.1 

hypothetical protein RJ20886 4.4 

RAD54 (S.cerewsiaeHike 4.4 

ESTs 5.5 
Homo sapiens, clone IMAGE:3544662, mRNA, 1 .3 

zinc finger protein 16.1 
ESTs. Weakly similar to FXD2_HUMAN FORKH 12.2 

GM2 gangfioside activator protein 1 0.4 

phosphatidytserine receptor 5.7 

inhibin, beta A (activin A, adivin AB a 25.5 

proteasome (prosome, macropain) 26S subu 1.7 

nuclear phosphoprotein similar to S. cer 2.6 

H2A histone family, member Y 13.5 

pericentriolar material 1 6.7 

laminin receptor 1 (67kD, ribosomal prot 5.4 

arfaptin 1 12.1 

spermidine synthase 9,7 

retinoblastoma-binding protein 2 1 .3 

peptidyiprotyi isomerase B (cydophifin 9.7 

cellular retinoic acid-binding protein 1 4.2 

discs, targe (Drosophila) homolog 5 5 

centromere protein F (350/400kD, mitosin 9. 1 

vesicle docking protein p1 15 1.8 

splicing factor 3a, subunit 3, 60kO 10.4 

SWi/SNF related, matrix associated, actl 2.6 

regulator of G-protein signalling 12 13 



Homo sapiens, clone IMAGE:3535294, mRNA, 1 0.3 
interteukin 6 signal transducer (gp130, 
2 l ,5'-oIigoadenyJate synthetase 1 (40-46 
hypothetical protein MGC3222 
collagen, type XI, alpha 1 
collagen, type V, alpha 2 
Empirically selected from AFFX single or 
hypofoetical protein FU10709 
FKSOB-binding protein 4 (59kD) 
laminBI 

integral membrane protein 1 
spermine synthase 
hypothetical protein MGC11138 
K1AA0007 protein 



6.7 
5.5 
5.8 
72.9 
6.7 
6.2 
1.4 
18 
6.1 
1.2 
2.6 
9.1' 
13.3 
2 

14.9 

ESTs, Highly similar to C10_HUMAN PUTAT1 1.7 



px19-iike protein 



ESTs 12.2 

ESTs, Weakly similar to A46010 X-finked 7.6 

ESTs, Weakly similar to KIAA0822 protein 5.8 

ribosome binding protein 1 {dog 180kD ho 1 2.3 

cefl division cycle 2-Cke 1 {PfTSLRE pr 5.7 

androgen receptor (dihydrotestosterone r 13.9 

HER2 receptor tyrosine kinase {c 5 J 

2W10lnterador 2.8 

Golgl apparatus protein 1 5.5 

hypothetical protein 2.9 

v-myb avian myeloblastosis viral oncogen 2.3 

paternally expressed 10 (PEG10; WAA105 7 
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425182 AF041259.HS.155040 

446999 AA151520 Hs.334822 

450701 H39960 Hs.288467 

452461 N78223 Hs.108106 

453157 AF077036 Hs.31989 



zinc finger protein 217 2,3 
hypothetical protein MGC4485 7.5 
Homo sapiens cDNA FU12280 fis, done MA 5.6 
transcription factor 4.7 
DKFZP566G1722 protein 12,1 
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TABLE 5A 

Table 5A shows the accession numbers for those pkeys lacking unigenelD's for Table 5. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
5 designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

1 0 Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



15 Pkey CAT number Accessions 



35 



40 

122618 
125115 
120809 
45 129680 



50 

101045 
110501 
121558 
55 121911 



30686.-15 
656394J 
19346.14 
44573.2 



123615 
124385 
110856 
20 120472 



25 



30 129019 44573.2 



AA609170 
AI267847 N27351 

AA992380N33063 N21418 H79958 R21911 H79957 

AI950087 N70208 R97040 N36809 AI3081 19 AW967677 N35320 A1251473 H59397 AW971573 R97278 W01059 AW967671 
AA9Q8598 AA251875 AI820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 
AA488964 AA283144 AI890387 A1950344 AI741346 AI689062 AA282915 AW102898 AI872193 AI763273 AW173586 
AW150329 AI653832 A1762688 AA988777 AA488892 A1356394 AW103813 A1539642 AA642789 AA856975 AW505512 
AI961530 AW629970 BE612881 AW276997 AW513601 AW512843 AA044209 AWB56538 AA180009 AA337499 AW961 101 
AA251669 AA251874 A1819225AW205862 AI683338 A1858509AW276905 AI633006 AA972584 AA908741 AW072629 
AW513996 AA293273 AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 
AW613456 AI373032 AI564269 F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 
F04005 

AI950087 N70208 R9704O N36809 AI3081 19 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AJ820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 
AA488964 AA283144 AI890387 AI950344 AI741346 AI689062 M282915 AW102898 AI872193 AI763273 AW173586 
AW150329 AI653832 AI762688 AA988777 AA488892 AI356394 AW103813 AI539642 AA642789 AA856975 AW505512 
AI961530 AW629970 BE612B81 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961 101 
AA251669 AA251874 AI819225 AW205862 AI683338 AJ858509 AW276905 A1633006 AA972584 AA908741 AW072629 
AW513996 AA293273 AA969759 N75628 N22388 H84729 H60052 T92487 A1022058 AA780419 AA551005 W80701 
AW613456 AI373032 AI564269 F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 
F04005 

120695 9683 3 AA976503 AI917802 AA953664 AA404613 AA428771 BE280542 AW194691 AI927301 AI740458 
AI796100 AI935603 AW052210 AA970201 A1633384 AA425910 AI017004 AI241295 AA402816 AA291468 
AA453641 AA454061 
genbankJT97341 T97341 
genban!eM346495 AA346495 

23162 1 U03749 NM.001275 J03483 J03915 AI214509 AW245744 AL046455 AA318960 AI741505 AA843875 AI829382 A1560122 
AI858999 D55958 A1684005 D53170 M854091 AI025609 D53119 054729 D55504 D55377 D55313 AW512244 AA846441 
AW043898 AI969102 AA4 05741 AI091983AA788784AA706586AA854361 AW470949 AA843095AA772028A1148432 
AI038109 AA782478 AA910064 AI220384 AA781296 AA843881 AA854064 AA843125 AA843419 AA319036 AA319054 
AI273831 W32275 A1584185 C05724 AA789023 AI686818 D54392 AI022485 AA431410 AA854232 W3921 2 W15214 
AA894441 A1803081 AJ167381 AW245389 AA319430 AA335156 AI042646 AA327030 AA725170 T27943 AA889304 
AA976699 AI687001 AI621107 AI865540 AA772107 C06286 AA319661 AA405992 

entre*J05614J05614 

9^1^^55748 H55748 

genbankuAA412497 AA412497 

genbankJ\A427950 AA427950 



305217J 
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TABLE 6: Figure 6 from BRCA 001 US 
Table 6 shows genes upregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnfcjenelD: Unigene number 

1 0 Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal breast tissue 



Pkey ExAccn UnigenelD UnigeneTitle 



R1 



100147 D13666 Hs.136348 osteoblast specific factor 2 {fascfclln 7.5 

100678 AW502935 Hs.740 PTK2 protein tyrosine kinase 2 53.2 

101806 AA586894 Hs.1 12408 S100 calcium-binding protein A7(psorias 8.9 

102455 U48705 Hs.75562 discoidin domain receptor family, member 6.9 

20 103206 X72755 Hs.77367 monokine induced by gamma interferon 8.8 

105743 BE246502 Hs.9598 sema domain, immunoglobulin domain (Ig), 2.6 

105746 AW151952 Hs.46679 hypothetical protein FU20739 1.5 

106373 AW503607 Hs.21907 histone acetyltransferase 1.8 
1 10240 AI668594 Hs.176588 ESTs, Weakly similar to CP4Y_HUMAN CYTOC 4.2 

25 119260 AK001724 Hs.102950 coat protein gamma-cop 3.2 

120206 H26735 Hs.91668 Homo sapiens clone PP1498 unknown mRNA 45.7 

120253 AA131376 Hs.326401 fibroblast growth factor 12B 38.9 

120297 AA191384 Hs.104072 ESTs, Weakly simitar to Z195.HUMAN ZINC 15.2 

120624 AW407987 Hs.173518 M-phase phosphoprotein homotog 52 
30 120695 AA976503 gb:oq30a04.s1 NCLCGAPJ5C4 Homo sapiens 46.8 

120807 AA346385 Hs.30002 SH3-containing protein SH3GLB2; KIAA1 848 6.8 

121508 M402515 Hs.97887 ESTs 28 

122607 AA453518 Hs.98023 ESTs 61.5 

122616 AA453638 Hs.161873 ESTs 107.3 

35 122618 M453641 gb:zx48e06.s1 SoaresJestisJJHT Homo sap 31.1 

122829 AW204530 Hs.99500 ESTs 81.8 

12283B AA460584 Hs.334386 ESTs 75.3 

123753 AA609955 Hs.234961 Huntingtin interacting protein E 30.6 

124385 AI267847 gb:aq49a10.x1 Stanley Fronta! NB pool 2 57.1 

40 124860 R65763 Hs.101477 EST 23.9 

124930 AI076343 Hs.173939 ESTs, Weakly similar to ALUB_HUMAN HI! 22.8 

125051 T79956 Hs.100588 EST 135.3 

128781 N71826 Hs.105465 small nuclear ribonucteoprotein polypept 53,9 

129096 AA463189 Hs.288906 WW Domain-Containing Gene 20.9 

45 129347 BE614192 Hs.279869 melanoma-associated antigen recognised b 7.6 

129689 AW748482 Hs.77873 B7homolog3 2.6 

130503 BE208491 Hs.295112 K1AA06 18 gene product 16.1 

130511 L32137 Hs.1584 cartilage otigorneric matrix protein {pse 6.1 

131046 AA321649 Hs.2248 small inducible cytokine subfamily B (Cy 7.4 

50 131643 AW410601 Hs.30026 HSPC1 82 protein 2.9 

131925 AF151048 Hs.183180 anaphase promoting complex subunit 11 (y 2.7 

132180 NM 004460HS.418 fibroblast activation protein, alpha 14.7 

132370 AW572805 Hs.46645 ESTs 28.3 

132994 AA112748 Hs.279905 clone HQ0310PRO0310p1 17.1 

55 133016 A1439688 Hs.6289 hypothetical protein FU20886 4.4 

133266 AI160873 Hs.69233 zinc finger protein 16.1 

1 33391 AW103364 Hs.727 inhibin, beta A (adivin A, actlvin AB a 25.5 
134169 AI690916 Hs. 1781 37 transducer of ERBB2, 1 1.2 
134219 NNL000402Hs,80205 glucose-6-phosphate dehydrogenase 1.9 
60 134405 AW067903 Hs.82772 collagen, type XI, alpha 1 72.9 
134529 AW411479 Hs.848 FK506-binding protein 4 (59kD) 2.8 
134975 R50333 Hs.92186 Leman coHed-coll protein 2.6 
135181 BE250865 Hs.279529 px19-like protein 14.9 
322556 BE041451 Hs.177507 hypothetical protein 2.9 
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TABLE 6A 



5 

10 
15 
20 



Table 6A shows the accession numbers for those pkeys lacking unigenelD's for Table 6. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset Identifier number 

CAT number. Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 
124385 656394 1 AJ267847 N27351 

120695 9683 3~ AA976503 AI917802AA953664 AA404613AA428771 BE280542AW1 94691 A1927301 AI740458 AI796100 AI935603 

AW052210 M970201 AI633384 AA425910 A1017004 A1241295 AA402816 AA291468 
122618 305217J M453641 AA454061 
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TABLE 7: Figure 7 from BRCA 001-1 US 



5 Table 7 shows genes upregulated in tumor tissue compared to normal breast tissue. Open 
reading frames in the sequences have been characterized as having a signal sequence (SS), a 
transmembrane domain (TM) or other. 



10 Ptey: 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



ExAccn: 
UnigenelD: 
Unigene Title: 
R1: 

ORF struct info: 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 

Unigene gene title 

Ratio of tumor to normal breast tissue 
Structural c 



gene 



Pkey ExAccn UnigenelD UnigeneTitle 



100113 
100114 
100131 
100146 
100147 
100154 
100163 
100220 
100265 
100271 
100275 
100323 
100335 
100364 
100372 
100393 
100400 
100418 
100482 
100518 
100666 
100667 
100668 
100678 
100783 
100892 
100945 
100969 
100988 



101031 
101045 
101077 
101093 
101186 
101216 
101228 
101247 
101249 
101332 
101352 
101396 
101445 
101470 
101478 



NM001269 
X02308 
D12485 
BE185499 
D13666 
H60720 
W44671 
AW015534 
D38521 
BE160081 
BE242802 
D50920 
AW247529 
NNL004341 
NWL014791 
D84145 
AW954324 
D86978 
M65028 
NMJ04415 
L05424 
L05424 
L05424 
AW502935 
AF078847 
BE245294 
AF002225 
AA157634 
AK000405 
H38765 
J05070 
J05614 



106419 

AAO20956 

AA284166 

AA333387 

AA132666 

L18964 

J04088 

AJ494299 

BE267931 

M21259 

NIVL000546 

NMJ0O289O 



Hs.84746 

Hs.82962 

Hs.11951 

Hs.2471 
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107071 AW385224 
107113 AK000733 
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107146 AK001455 
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107263 D60341 
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107299 BE277457 
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107354 NNL.0Q6299 
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107529 BE515065 
107554 M001386 
107681 BE379594 
107772 AA018587 
107859 AW732573 
107901 L42612 
107922 8E153855 
107974 AW956103 
108040 AL121031 
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108496 M083069 
108607 BE300380 
108621 AA101809 
108634 AW022410 
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ESTs, Weakly similar to A54849 collagen 
cydin B2 

transmembrane 7 superfamily member 2 
KIAA1483 protein 

Homo sapiens cDNA: RJ21487 fis, clone C 

zinc finger protein 278 

Homo sapiens cDNA: RJ23038 fis, done L 

ESTs 

Homo sapiens cDNA FU10071 fis, clone HE 
Mn1 

tyrosylprotein sulfotransferase 2 

Homo sapiens mRNA; cDNA DKFZp564O0122 (f 

ESTs, Moderately similar to S65657 alpha 

hypothetical protein FU12549 

TIA1 cytotoxic granule-associated RNA-bi 

ESTs 
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hypothetical protein 
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transcription factor BMAL2 
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ESTs 
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transferase of outer mitochondrial membr 

ESTs, Moderately similar to ALU7_HUMAN A 

hypothetical protein MGC4606 

Homo sapiens mRNA; cDNA DKFZp586l0324 (f 

zinc finger protein 193 

TATA element modulatory factor 1 

kinesin family member 4A 

nucleolar protein (KKE/D repeat) 

ESTs 

ESTs, Moderately similar to AIU7JHUMAN A 
ESTs, Weakly similar to ALU1 JiUMAN ALU S 
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ESTs 
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108647 BE546947 


Hs.44276 


homeo box C10 


9.8 


other 


108695 AB029000 


Hs.70823 


WAA1077 protein 


7.3 


other 


108740 AIQ89575 


Hs.9071 


progesterone membrane binding protein 


2.8 


? 


108828 AK001693 


Hs.273344 


DKFZP564O0463 protein 


1.9 


other 


108859 AL121500 


Hs.178904 


ESTs 


1.6 


TM 


108872 H06720 


Hs.111680 


endosulfine alpha 


2.2 


other 


108891 AI801235 


Hs.48480 


ESTs 


5.4 


other 


108894 AK001431 


Hs.5105 


hypothetical protein FU10569 


4,1 


TM 


108955 AA149754 


Hs.195155 


Homo sapiens amino acid transport system 


5.7 


? 


108982 AA151708 


Hs.171980 


homeo box (expressed In ES cells) 1 


1.7 


other 


108987 AA152178 


Hs.23467 


hypothetical protein FU10633 


6,3 


other 


109002 AB028987 


Hs.72134 


WAA1064 protein 


17 


other 


109011 AA155542 


Hs.72127 


ESTs 


1.5 


other 


109026 AA1 57811 




gb:zo35d07.s1 Stratagene colon (937204) 


5.4 


other 


109068 AA164293 


Hs.72545 


ESTs 


3 


other 


109101 AW608930 


Hs.52184 


hypothetical protein FU20618 


1.6 


SS, 


109112 AW419196 


Hs.257924 


hypothetical protein FU13782 


3.3 


TM 


109124 AK000684 


Hs.183887 


hypothetical protein FU22104 


1.7 


other 


109139 AJ132592 


Hs.59757 


zinc finger protein 281 


2.7 


other 


109166 AA219691 


Hs.73625 


RAB6 interacting, kinesin-iike (rabkirtes 


3 


TM 


109198 BE566742 


Hs.58169 


highly expressed in cancer, rich in leuc 


2.1 


other 


109213 NMJJ16603 


Hs.82035 


potential nuclear protein C50RF5; GAP-li 


5.4 


other 


109220 AW958181 


Hs.189998 


ESTs 


5.8 


other 


109233 AU077281 


Hs.170285 


nucleoporin 214kD (CAIN) 


5.3 


other 


109270 N99673 


Hs.3585 


ESTs, Weakly similar to AF126743 1 DNAJ 


1.4 


other 


109273 AA375752 


Hs.82719 


Homo sapiens mRNA; cONA DKFZp586F1822 (f 


3 


other 


109313 AF153201 


Hs.86276 


C2H2 (KruppeMype) zinc finger protein 


1.3 


other 


109341 AA213506 


Hs.1 15099 


EST 


3 


? 


109391 AL096858 


Hs.184245 


KIAA0929 protein Msx2 interacting nudea 


1.5 


other 


109420 H83603 


Hs.40408 


homeo box C9 


2.2 


SS, 


109426 N30531 


Hs.42215 


protein phosphatase 1, regulatory subuni 


3.1 


TM 


109429 AI160029 


Hs.61438 


ESTs 


2 


? 


109445 AA232103 


Hs.189915 


ESTs 


1.8 


other 


109450 AB032969 


Hs.173042 


KIAA1143 protein 


3.8 


other 


109468 NMJJ15310 


Hs.6763 


K1AA0942 protein 


3.3 


other 


109478 AW074143 


Hs.87134 


ESTs 


2 


TM 


109570 L40027 


Hs.1 18890 


glycogen synthase kinase 3 alpha 


2.1 


other 


109662 F02614 


Hs.27319 


ESTs 


1.4 


other 


109825 R71264 


Hs.16798 


ESTs 


1.3 


other 


110039 H11938 


Hs.21907 


hlstone acetyttransferase 


2 


other 


110056 AA503041 


Hs.279009 


matrix Gla protein 


2.5 


other 


110085 AA603840 


Hs.29956 


KIAA0460 protein 


1.7 


other 


110110 T07353 


Hs.7948 


ESTs 


2.9 


other 


110129 R51853 


Hs.226429 


ESTs, Weakly similar to ALU1_HUMAN ALU S 


1.7 


SS, * 


110154 NMJM4521 


Hs.17667 


SH3-domain binding protein 4 


4.3 


other 


110240 AI668594 


Hs.176588 


ESTs. Weakly similar to CP4Y_HUMAN CYTOC 


4.3 


? 


110242 N41744 


Hs.19978 


CGI-30 protein 


1.3 - 


other 


110259 H28428 


Hs.32406 


ESTs, Weakly similar to 138022 hypotheti 


2.2 


other 


110312 BE256986 


Hs.1 1896 


hypothetical protein FU12089 


2.1 


other 


110501 H55748 




gb:yq94a01.s1 Scares fetal liver spleen 


6,1 


? 


110504 H55915 


Hs.210859 


hypothetical protein FU11016 


6.1 


TM 


110525 H57330 


Hs.37430 


EST 


6.4 


other 


110568 AK001160 


Hs.5999 


hypothetical protein FU10298 


« 


? 


110699 T97586 


Hs.18090 


ESTs 


1.8 


other 


110705 AB007902 


Hs.32166 


KIAA0442 protein 


1.6 


TM 


110742 AW190338 


Hs^8029 


hypothetical protein MGC11256 


7.8 


other 


110761 AL138077 


Hs.16157 


hypothetical protein FU12707 


2.5 


other 


110762 BE044245 


Hs.30011 


hypothetical protein MGC2963 


9.3 


? 


110765 AK000322 


Hs.18457 


hypothetical protein FU20315 


'5.5 


SS, 


110769 BE000831 


Hs.23837 


Homo sapiens cDNA FU1 1812 fis, clone HE 


2.1 


TM 


110799 A1089660 


Hs.323401 


dpy-30-Uke protein 


1.5 


TM 


110805 T25829 


Hs.24048 


FK506 binding protein precursor 


6.7 


TM 


110813 AA767373 


Hs.35669 


ESTs, Moderately similar to ALU1_HUMAN A 


5.7 


other 


110820 R33261 


Hs.6614 


ESTs, Weakly similar to A43932 mucin 2 p 


3,4 


other 


110840 N31598 


Hs.12727 


hypothetical protein FU21610 


1.7 


TM 


110844 AI740792 


Hs.167531 


memylcrotaioyl-Coenzyme A carboxylase 2 


1.7 


other 


110854 BE612992 


Hs.27931 


hypothetical protein FU10607 similar to 


4.7 


other 
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110856 AA992380 
110885 BE384447 
110897 AL117430 
110915 BE092285 
110918 H04360 
110958 NMJD05864 
110963 AK002180 
110981 AK001980 
110984 AW613287 
111125 N63823 
111132 AB037807 
111164 N46180 
111172 R67419 
111174 AL050166 
111179 AKO00136 
111184 AI815486 
111189 N67603 
111216 AW139408 
111221 AB037782 
111223 AA852773 
111239 N90956 
111285 M778711 
111299 AB033091 
111312 AI523913 
111318 T99755 
111337 AA837396 
111352 H58589 
111370 AI478658 
111384 N94606 
111389 AK000987 
111452 R02354 
111486 AI051194 
111543 W90638 
111585 R10720 
111627 R52656 
111870 AB037834 
111937 BE298665 
111944 AW083791 
111987 NM.015310 
112134 R41823 
112244 AB029000 
112388 R46071 
112456 NM 016248 
112464 AW007287 
112506 AI742756 
112513 R68425 
112752 AK001635 
112884 AK000004 
112923 T10258 
112936 AW970826 
112958 R61388 
112966 Z44718 
112978 AK000272 

112995 AA737033 

112996 BE276112 
113047 AI571940 
113049 AW965190 
113089 T40707 
113196 T57317 
113248 T63857 
113254 AK002180 
113277 AW971049 
113429 AA688021 
113499 A1467908 
113547 H59588 
113647 AA813887 
113702 T973Q7 
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gb:ot37g06.s1 SoaresJestisJJHT Homo sap 
hypothetical protein MGC13186 
DKFZP434D156 protein 
hypothetical protein aJ13187 
ESTs, Moderately similar to reduced expr 
signal transduction protein (SH3 contain 
OKFZP5640123 protein 
ADP-ribosyttransferase (NAD+; poly(ADP-r 
UDP-N^cety^ipha-D-galactosamine:polyp 
ESTs, Moderately similar to Z195_HUMAN Z 



Homo sapiens cDNA FU13289 fis, clone OV 
Homo sapiens cDNA RJ12900 fis, clone NT 
Homo sapiens mRNA; cDNA DKFZp586D1122 (f 
asporin (LRR class 1) 

Homo sapiens cDNA FLJ20738 fis, clone HE 
ESTs, WeaWy similar to S65824 reverse t 
ESTs 

KIAA1361 protein 

K1AA1866 protein 

hypoftettcal protein FU22087 

eukaryotic translation initiation factor 

KIAA1265 protein 

ESTs 

ESTs 

US1-interading protein NUDE1, rat homo 

Homo sapiens cDNA FU11027 fis, clone PL 

brefeldin A-lnhlbited guanine nudeotide 

HSCARG protein 

oxidation resistance 1 

ESTs 

EST 

ESTs, Moderately similar to ZRF1JWMAN Z 

EST 

ESTs 

Homo sapiens mRNA for KIAA1413 protein, 
Homo sapiens mRNA; cDNA OKFZp564D016 (tr 



K1AA0942 protein 
ESTs; calsynlenin-2 
KIAA1077 protein 

Homo sapiens, done 1MAGE:3638994, mRNA, 
A kinase (PRKA) anchor protein 11 
Homo sapiens cDNA: FLJ21086 fis, clone C 
ESTs 

hypothetical protein FU10648 

hypothetical protein FU10773 

Homo sapiens mRNA for FU00004 protein. 

EST 

KIAA1557 protein 
ESTs 

glucocorticoid receptor DNA binding fact 

hypothetical protein FU20265 

ESTs, Moderately similar to 2115357ATYK 

zinc finger protein 259 

ESTs 

Homo sapiens mRNA for KIAA1729 protein, 
ESTs 

gb:yb51a03.s1 Stratagene fetal spleen (9 

gb:yc16e01.s1 Stratagene lung (937210) H 

0KFZP564O123 protein 

protein (peptidyl-proiyi ds/trans isome 
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Homo sapiens cDNA FU12187 fis, done MA 
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113759 AW499665 


Hs.9456 


SWl/SNF related, matrix associated, actl 


1.2 


other 


113777 BE266947 


Hs.10590 


zinc finger protein 313 


13.4 


other 


113783 AL35958B 


Hs.7041 


hypothetical protein DKFZp762B226 


1.7 


other 


113791 AI269096 


Hs.135578 


chitobiase, di-N-acetyt- 


1.3 


other 


113808 W44735 


Hs.9286 


Homo sapiens cONA: FU21278 fe, clone C 


3.3 


other 


113811 BE207480 


Hs.6994 


Homo sapiens cONA: FU22044 6s, done H 


3.1 


other 


113817 H13325 


Hs.332795 


hypothetical protein DKFZp761017121 


3.2 


other 


113826 AW378212 


Hs.24809 


hypothetical protein RJ10826 


2.3 


? 


113834 T26483 


Hs.6059 


EGF-containing Gbufin-Gke extraceOuh 


11-3 


TM 


113868 W57902 


Hs.90744 


proteasome (prosome, macropaJn) 26S subu 


Z7 


other 


113870 AL079314 


HsJ6537 


hypothetical protein, similar to (U06944 


6.1 


other 


113885 AW959486 


Hs.21732 


ESTs 


6.6 


other 


113923 AW953484 


Hs.3849 


hypothetical protein FLJ22041 simflarto 


1.9 


? 


113989 W87544 


Hs.268828 


ESTs 


1.2 


other 


114022 AI539519 


Hs.120969 


Homo sapiens cDNA FU1 1562 fis, clone HE 


5.4 


other 


114030 AI825386 


Hs.164478 


hypothetical protein FU21939 similar to 


9.4 


other 


114060 AB029551 


Hs.7910 


R1NG1 and YY1 binding protein 


1.8 


other' 


114196 AF017445 


Hs.150926 


fucose-1 -phosphate guanyiyltransferase 


1.5 


other 


114226 AB028968 


Hs.7989 


KIAA1045 protein 


1.8 


other 


114253 BE149866 


Hs.14831 


Homo sapiens, Similar to zinc finger pro 


2.3 


other 


114262 AL117518 


Hs.3686 


KIAA0978 protein 


1.4 


TM 


114275 AW515443.comp 


Hs.306117 


KIAA0306 protein 15.8 


114292 AI815395 


Hs.184641 


fatty acid desaturase 2 
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TM 


114309 AA332453 


Hs.20824 


CGW5 protein 


2.4 


other 


114392 AA249590 


Hs. 100748 


ESTs. Weakly similar to A28996 proEne-r 


1.9 


other 


114407 BE539976 


Hs.103305 


Homo sapiens mRNA; cONA OKFZp434B0425 (f 


1.3 


TM 


114455 H37908 


Hs.271616 


ESTs. Weakly similar to ALU8.HUMAN ALU S 


5.6 


other 


114463 AL120247 


Hs.40109 


KIAA0872 protein 


5.3 


TM 


114464 AI091713 


Hs.106597 


Homo sapiens, Similar to RIKEN cONA 1110 


1.3 


other 


114471 AA028074 


Hs.104613 


RP42 homolog 
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? 


114480 BE066778 


Hs.151678 


UDP-N-acetyt-a1pha-0-gaiactosamine;polyp 


13.4 


other 


114671 AA766268 


Hs,266273 


hypothetical protein FU13346 
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other 


114698 AA476966 


Hs.110857 


polymerase (RNA) ill (DNA directed) poly 


3.6 


other 


114730 AJ373544 


Hs.331328 


intermediate filament protein syncoiiin 


3.9 


other 


114767 AI859865 


Hs.154443 


minlchromosome maintenance deficient (S. 


1.7 


other 


114774 AV656017 


Hs.184325 


CGt-76 protein 


3,2 


other 


114798 AA159181 


Hs.54900 


serologically defined colon cancer an tig 


3.6 


other 


114860 AL157545 


Hs.42179 


bromodomain and PHD finger containing, 3 


4.4 


other 


114895 AA236177 


Hs.76591 


K1AA0887 protein 
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other 
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Hs.5324 


hypothetical protein 
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other 
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ESTs 
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ESTs 
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other 
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Hs.72472 
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other 
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toll-like receptor 9 
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other 
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Hs.41271 


Homo sapiens mRNA fuB length insert cDN 
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other 
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Hs.5324 


hypothetical protein 


1.5 


other 


115206 AW183695 


Hs.186572 


ESTs 


2.5 


other 


115221 AW365434 


Hs.79741 


hypoflietical protein FU101 16 


15 


other 


115239 BE251328 


Hs.73291 


hypothetical protein FU10881 


1.3 


TM 


115242 AJ358236 


Hs.283732 


ESTs, Moderately similar to ALU1_HUMAN A 


1.4 


other 


115278 AK002163 


Hs.301724 


hypothetical protefri FU11301 


1.5 


other 


115285 AW972872 


Hs.293736 


ESTs 


2.4 


other 


115291 BE545072 


Hs.122579 


hypothetical protein FU10461 


6.3 


SS, 


115400 AI215069 


Hs.89113 


ESTs 


6.7 


? 


115468 AA314349 


Hs.48499 


tumor antigen SLP-8p 


7.5 


? 


115471 AK001376 


Hs.59346 


hypothetical protein FU10514 


1.4 


TM 


115479 AW3O1608 


Hs.278188 


ESTs, Moderately similar to I54374 gene 


4.1 


TM 


115496 AW247593 


Hs.71819 


eukaryotic translation initiation factor 


16.3 


other 


115500 Y14443 


Hs.88219 


zinc finger protein 200 


5 


other 


115553 AJ275986 


Hs.71414 


transcription factor (SMIF gene) 


2.5 


other 


115581 A1540842 


Hs.61082 


ESTs 


6.2 


other 


115587 BE081342 


Hs.283037 


HSPC039 protein 


2.9 


other 


115590 AA399477 


Hs.67896 


7-60 protein 


5.3 


TM 


115646 N36110 


Hs.305971 


solute carrier family 2 {facilitated gfu 


4.8 


? 


115652 BE093589 


Hs.38178 


hypothetical protein FU23468 


10.6 


other 



other 
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115655 AL048269 Hs.288544 Homo sapiens, clone MGC: 16063, mRN A, com 

115663 AI138785 Hs,4O507 ESTs 

115676 AA953006 Hs.88143 ESTs 

115690 M625132 Hs.44159 hypothetical protein FLJ21615 

5 115693 AF231023 Hs.55173 cadherin, EGF LAG seven-pass G-type rece 

115715 BE395161 Hs.1390 proteasome (prosome. macropaln) subunlt, 

115734 AI950339 Hs.40782 ESTs 

115811 NMJJ15434 Hs.48604 OKFZP434B 168 protein 

115823 AI732742 Hs.87440 ESTs 

10 115837 AI675217 Hs.42761 ESTs 

115844 AI373062 Hs.332938 hypothetical protein MGC5370 

115866 AW062629 Hs.52081 KIAA0867 protein 

115875 N55669 Hs.333823 mitochondriaJ ribosomal protein L1 3 

115941 A1867451 Hs.46679 hypothetical protein FLJ20739 

15 115968 AB037753 Hs.62767 K1AA1332 protein 

116003 BE275469 Hs.66493 Down syndrome critica! region gene 5 

116011 AL359053 Hs.57664 Homo sapiens mRNA full length insert cDN 

116108 AA770688 Hs2Q777 H2A histone family, member i 

116134 BE243834 Hs.50441 CG1-04 protein 

20 116189 N35719 Hs.44749 ESTs, Moderately similar to T00358 hypot 

116195 AW821113 HsJ2402 ESTs 

116238 AV660717 Hs.47144 DKFZP586N0819 protein 

116246 AF265555 Hs.250646 baculovlral tAP repeat-containing 6 

116262 A1936442 Hs.59838 hypothetical protein FU10808 

25 116298 AI955411 Hs.94109 Homo sapiens cONA FU13634 fis, clone PL 

116318 AF097645 Hs.58570 deleted in cancer 1; RNA helicase HDB/Dl 

116325 AI472106 Hs.49303 HomosapienscONAFUIieeSfis.doneHE 

116336 AL133033 Hs.4084 KIAA1025 protein 

116339 AK000290 Hs.44033 dipeptidyl peptidase 8 

30 116350 AA497129 Hs.184771 nuclear factor l/C (CCAAT-binding transc 

116358 AI149586 Hs.38125 interferan-induced protein 75, 52kD 

116365 N50174 Hs.46765 ESTs 

116368 N90456 Hs.71109 KJAA1229 protein 

116417 AW499664 Hs.12484 Human done 23826 mRNA sequence 

35 116436 AA161411 Hs.58668 chromosome 21 open reading frame 57 

116462 AF218313 Hs.236828 putative helicase RUVBL 

116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 

116575 AA312572 Hs.6241 phosphoinositide-3-kinase, regulatory su 

116637 AK001043 Hs.92033 Inlegrirv-finked kinase-associated serine 

40 116640 X89984 Hs.211563 B-cell CLUrymphoma 7A 

116700 A1800202 Hs.317589 hypothetical protein MGC1 0765 

116705 AW074819 Hs.12313 hypothetical protein FU 14566 

116732 AW152225 Hs.165909 ESTs, Weakly similar to I38022 hypotheti 

116926 H73608 Hs.290830 ESTs 

45 117034 U72209 Hs.180324 YY1-associated factor 2 

117132 AI393666 Hs.42315 p10-binding protein 

117247 N21032 gb:yx46f06.s1 Soares melanocyte 2NbHM Ho 

117276 N71183 Hs.121806 Homo sapiens cDNA FU11971 fis, done HE 

117284 AK001701 Hs.183779 Homo sapiens cDNA FU10590 fis, done NT 

50 117367 AI041793 Hs.42502 ESTs 

117368 A1878942 Hs.90336 ATPase, H+ transporting, lysosomal (vacu 

117382 AF150275 Hs.40173 ESTs 

117412 N32536 Hs.42645 solute carrier family 16 (rnonocarboxylic 

117557 AF123050 Hs.44532 dlubiqultln 
55 117588 N34895 Hs.44648 ESTs 

117745 BE294925 Hs.46680 - CGM2 protein 

117754 AA121673 Hs.59757 zinc finger protein 281 

117879 N54706 Hs.303025 chromosome 11 open reading frame 24 

117904 BE540675 Hs.332938 hypothetical protein MGC5370 
60 117911 AL137379 Hs.47125 hypothetical protein FU1 39 12 

117933 Y10518 Hs.116470 hypothetical protein FU20048 

117983 AL110246 Hs.47367 KIAA1785 protein 

118078 N54321 Hs.47790 EST 

118301 AA453902 Hs.293264 ESTs 
65 118429 AA243332 Hs.74649 cytochrome c oxidase subunit Vic 

118472 AL157545 Hs.42179 bromodomain and PHD finger containing, 3 

118488 AJ277275 Hs.50102 rapa-2 (rapa gene) 
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118509 N22617 Hs.43228 Homo sapiens cDNAOJ 11 83 5 fis, done HE 

118528 AI949952 Hs.49397 ESTs 

118656 AW58020 Hs.293287 ESTs 

118670 AA332845 Hs.152618 ESTs, Moderately similar to ZN91_HUMAN Z 

5 118698 AB033113 Hs.50187 K1AA1287 protein 

118737 AA199686 gb:zq75g09.rl Stratagene hNT neuron {937 

118925 N92293 Hs.206832 ESTs, Moderately similar to ALU8_HUMAN A 

118984 AI668709 Hs.240722 ESTs, Moderately similar to ALIHLHUMAN A 

118986 AF148713 Hs.125830 bladder cancer overexpressed protein 

10 119206 W24781 Hs.293798 WAA1710 protein 

119235 AW453069 Hs.3657 activity-dependent neuroprotective prate 

119265 BE539706 Hs.285363 ESTs 

119279 N57568 Hs.48028 EST 

119298 NNL001241 Hs.155478 . cycBnT2 

15 119338 AM17240 Hs.320836 ESTs, WeaWy similar to A47582B-cellgr 

119403 AL117554 Hs.119908 nucleolar protein NOP5/NOP58 

119478 AI624342 Hs.170042 ESTs 

119486 AI796730 Hs.55513 ESTs 

119513 W37933 Empirically selected from AFFX single pr 

20 119601 AK000155 Hs.91684 Homo sapiens mRNA; cDNA DKFZp6671 103 (fr 

119602 AW675298 Hs.233694 hypothetical protein FLJ1 1350 

119676 AA243837 Hs.57787 ESTs 

119682 W61019 Hs.57811 ESTs 

119774 AB032977 Hs.6298 KIAA1151 protein 

25 119780 NWL016625 Hs.191381 hypothetical protein 

119789 BE393948 Hs.50915 kailikreln 5 (KLK5; KLK-L2; stratum com 

119805 AJ223810 Hs.43213 ESTs, Weakly similar to iEFSJHUMAN TRANS 

119818 AA130970 Hs.58382 hypothetical protein FU 11 101 

119863 AA081218 Hs.58608 Homo sapiens cDNA FU14206 fis. done NT 

30 1 19905 AW449064 Hs.1 19571 collagen, type HI, alpha 1 {Ehlers-Oanl 

119966 AA703129 Hs.58963 ESTs 

120132 W57554 Hs.125019 lymphoid nudear protein (LAF-4) mRNA 

120206 H26735 Hs.91668 Homo sapiens done PP1498 unknown mRNA 

120248 AI924294 Hs.1 73259 un characterized bone marrow protein BMD3 

35 120269 AW13194Q Hs.104030 ESTs 

120274 AA177051 gb:hc02aO2.s1 NCLCGAPJ>r3 Homo sapiens 

120280 AA190577 gb:zp52g02.s1 Stratagene HeLa cell s3 93 

120296 AW995911 Hs.299883 hypothetical protein FLJ23399 

120297 AA191384 Hs.104072 ESTs, Weakly similar to Z195_HUMAN ZINC 
40 120324 AA195517 Hs.191643 ESTs 

120325 AA195651 Hs.104106 ESTs 

120327 AK000292 Hs.278732 hypothetical protein FU20285 

120336 N85785 Hs.181165 eukaryotic translation elongation factor 

120342 AW450669 Hs.45068 hypothetical protein OKFZp434l1 43 

45 120345 AA210722 Hs.104158 ESTs 

120349 AW969481 Hs.55189 hypothefical protein 

120352 R06859 Hs.193172 ESTs, Weakly similar to I38022 hypothec - 

120356 AF000545 Hs.296433 putative purinergic receptor 

120371 AA219305 Hs.1Q4196 EST 

50 120382 AA228026 Hs.38774 ESTs 

120383 AL109963 Hs.123122 FSH primary response (LRPR1, rat) homob 

120386 AW969665 Hs.154848 hypoftietical protein DKFZp434D0127 

120388 AA232874 Hs.104245 ESTs 

120389 AW967985 Hs.325572 ESTs, Moderately similar to ALU7_HUMAN A 
55 120396 AA134006 Hs.79306 eukaryotic translation initiation (actor 

120404 AB023230 Hs.96427 KIAA1013 protein 

120418 AW966893 Hs.26613 Homo sapiens mRNA; cDNA OKFZp586F1 323 (f 

120423 AA236453 Hs.18978 Homo sapiens cDNA: FU22822 fis, done K 

120472 AI950087 gbwq05c02j(1 NCLCGAP^Kld12 Homo sapien 

60 120473 AA251973 Hsi69988 ESTs 

120484 AA253170 Hs.96473 EST 

120504 AA256837 gb:zr84d10.s1 Soares_NhHMPu_S1 Homosapl 

120509 BE047718 Hs.96545 ESTs 

120520 AA258601 Hs.161731 EST 

65 120535 BE350244 Hs.96547 ESTs 

120551 AA279160 Hs.UUQ7 Homo sapiens, done 1MAGE:3613029, mRNA, 

120570 AA280679 Hs.271445 ESTs, WeaWy similar to ALU1.HUMAN ALUS 
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BE244830 Hs.284228 2NF135-Bke protein 

AW372799 Hs.1 25790 leutine-rich repeat-containing 2 

AA282074 Hs.237323 N-acetylglucosamine-phosphate mutase 

AW965339 Hs.1 11471 ESTs 

AW407987 Hs.173518 M-phase phosphoproteln homolog 

AA286942 gb:zs56fD5.s1 NCLCGAPJ3CB1 Homo sapiens 

AA287095 Hs.140309 Homo sapiens, clone 1MAGE:3677194, mRNA, 

AW063659 Hs.191649 ESTs 

AW969638 Hs.1 12318 6.2 kd protein 

BE536739 Hs.109909 ESTs 

AA976503 gb:oq30a04.s1 NCLCGAPJ3C4 Homo sapiens 

A1821539 Hs.97249 ESTs 

AW449855 Hs.96557 Homo sapiens cDNA FU12727 fis, done NT 

AA292747 Hs.97296 ESTs 

AI191410 Hs.96693 ESTs, Moderately similar to 2109260A B c 

A1608909 Hs.193985 ESTs 

AA346385 Hs.30OO2 SH3-containing protein SH3GUB2; K1AA1848 

AA346495 gb:EST52657 Fetal heart II Homo sapiens 

AA386260 Hs.104632 EST 

AA398155 Hs.97600 ESTs 

BE262951 Hs.99052 ESTs 

AI219896 Hs.97592 ESTs 

AA398360 Hs.97608 EST 

A1439713 Hs.165295 ESTs 

AA398721 Hs.186749 ESTs, Highly similar to I37550 mismatch 

AA363307 Hs.97032 ESTs 

AL121523 Hs.97774 ESTs 

AI002110 Hs.97169 ESTs, Weakly similar to dJ667H12.2.1 Jl 

AA403008 Hs.301927 c6.1A 

AW956981 Hs.97910 Homo sapiens cDNA FU13383 fis, clone PL 

AA406137 Hs.98019 EST 

AA410190 Hs.98076 ESTs, Weakly similar to A47582 B-ceil gr 

AA4G6430 Hs.105362 Homo sapiens, done MGC:18257, mRNA, com 

AW971063 Hs.292882 ESTs 

H58306 Hs.15165 retinoic add Induced 14 

W07404 Hs. 144502 hypothetical protein FLJ22055 

AA442224 Hs.97900 ESTs 

AA494172 Hs.194417 ESTs 

AA402515 Hs.97887 ESTs 

AA416653 Hs.181510 ESTs 

AA412112 gb:zt69b02.s1 Soares_testis_NHT Homo sap 

AA412477 Hs.98142 EST 

AA412497 gb:zt95g12.s1 Soares_testis__NHT Homo sap 

AA411970 Hs.98096 EST 

AA416568 gb:zu05c10.s1 Soares_testis_NHT Homo sap 

AD001528 Hs,89718 spermine synthase 

AA626010 Hs.98247 ESTs 

AA416931 Hs.126065 ESTs j(1JB4(VBtt 

AA421537 Hs.178072 Homo sapiens mRNA; cDNA DKF2p434B1023 {f 

AA418160 Hs.86043 Homo sapiens cDNA FU13558 fis, done PL 

AV660305 Hs.110286 ESTs 

U55184 Hs.154145 hypothetical protein FLJ 11 585 

AA419225 Hs.98269 Homo sapiens cDNA FLJ11953 fis, done HE 

A1949597 Hs.98325 ESTs 

AA421041 Hs.180744 ESTs 
AA398784 Hs.97514 ESTs 
BE53691 1 Hs.234545 hypothetical protein NUF2R 
AB033022 Hs.158654 WAA1 196 protein 
AA421773 Hs.161008 ESTs 
AA292579 Hs.125133 hypothetical protein FU22501 
AI810774 Hs.98376 ESTs 
AW340797 Hs.98434 ESTs 
AA328348 Hs.218289 ESTs 

AA425691 Hs.191606 ESTs, Highly similar to K1AA1048 protein 
AF027406 Hs.1 04865 serine/threonine kinase 23 
AA446628 Hs.2799 cartilage linking protein 1 
AW972668 Hs.293044 ESTs 
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ESTs 
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. other 


121911 AA427950 




gb.*zw50ff)2.s1 Soares_totaLfetus_Nb2HF8 
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TM 


121915 AA428179 


Hs,223405 


ESTs, Moderately similar to A46010 X-fin 
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other 


121935 AA428647 


Hs.98611 


EST 


2.3 


other 


121983 AA298760 


Hs.180191 


hypothetical protein FU14904 


3.4 


other 


121985 A1862570 


Hs.299214 


Homo sapiens, clone IMAGE:2822295, mRNA, 
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other 


121995 AA210863 


Hs.3532 


nemo-like kinase 


3.8 
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121999 AA430211 


H5.93668 


EST 
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other 


122009 AW292763 


Hs.160822 


Homo sapiens cDNA: FU20863 fis, clone A 
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other 


122013 AA431085 


Hs.98706 


ESTs 
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other 


122036 W92142 


Hs.271963 


ESTs, Weakly similar to ALU5.HUMAN ALU S 


13.1 


other 


122050 AI453076 


Hs.166109 


ELAV (embryonic lethal, abnormal vision, 


9.1 


other 


122060 M431738 


Hs.98750 


EST 


13.1 
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122114 AW161023 


Hs.104921 


ESTs 
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other 


122188 AA398838 




gbzt80d01.r1 Soares testis NHT Homo sap 


3.4 


other 


122204 AA435936 


Hs.98842 


EST 
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other 


122246 AA329550 


Hs.29417 


HCF-binding transcription factor Zhangfe 


5.2 


other 


122257 AA436819 


Hs.98899 


ESTs 
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other 


122302 AA441801 


Hs.104947 


ESTs 


5.8 


other 


122341 AW601969 


Hs.99010 


hypothetical protein FU22263 similar to 
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ofter 


122356 M443794 


Hs.98390 


ESTs 
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SS.TM 


122369 AA443985 


Hs.303222 


ESTs 


12.2 


? 


122371 AA868555 


Hs.178222 


ESTs 
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? 


122372 AA446008 


Hs.336677 


EST 


7.8 


? 


122378 AB032948 


Hs.21356 


hypothetical protein OKFZp762K2015 


2.5 


? 


122405 AA446572 


Hs.303223 


EST 
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TM 


122412 AA446869 


Hs.119316 


ESTs 


7.4 


other 


122415 AA446918 


Hs.99088 


EST 
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other 


122418 AA446966 


Hs.99090 


ESTs, Moderately similar to similar to K 


6.9 


? 


122440 AW505139 


Hs.9460 


Homo sapiens mRNA; cONA OKFZp547C244 (fr 
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other 


122446 AA447603 


Hs.99123 


EST 
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TM 


122448 AA447626 
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EST 
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other 


122458 AI266159 


Hs.104980 


ESTs 


15 


other 


122460 AW418788 


Hs.99148 


ESTs, Weakly similar to S43569 R01H10.6 
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other 


122464 AA448158 
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EST 
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other 


122490 AA448349 


Hs.238151 


EST 
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122492 AA448417 


Hs.104990 


ESTs 


5.5 


other 


122502 AA204969 


Hs.234863 


Homo sapiens cDNA RJ12082 fis, clone HE 


1.3 


other 


122510 AA449232 


Hs.99195 


ESTs 


\\2 


? 


122530 AW959741 


Hs.40368 


adaptor-related protein complex 1. sigma 


10.1 


other 


122547 AA779725 


Hs.164589 


ESTs 


2.5 


SS, 


122555 AA 194055 


Hs.293858 


ESTs 


1.9 


other 


122570 AA452578 


Hs.262907 


ESTs 


9.5 


other 


122572 AM52601 


Hs.992S7 


EST 


11 


? 


122586 AK001910 


Hs.99303 


Homo sapiens cDNA FU1 1048 fis. done PL 


3.4 


other 


122587 AB040893 


Hs.6968 


KIAA1460 protein 


2 


other 


122598 AI028173 


HS.99329 


ESTs 


1.7 


? 


122599 AL355841 


Hs.99330 


hypothetical protein FU23588 


4.4 


? 


122602 AA411925 


Hs.301960 


ESTs 


4.7 


other 


122607 AA453518 


Hs.98023 


ESTs 


61.5 


other 


122614 AA453830 


Hs.99339 


EST 


10.7 


? 


122616 AA453638 


Hs.161873 


ESTs 


107.3 


? 


122617 AI681535 


Hs.148135 


serine/threonine kinase 33 


121.4 


other 


122618 AA453641 




gb:zx48e06.s1 SoaresJestis.NHT Homo sap 


31.1 


SS. 


122622 AA453987 


HS.144802 


ESTs 


5.6 


other 


122717 AA456859 


Hs.178358 


ESTs 


8.5 


SS, 


122762 AI376875 


Hs-105119 


ESTs 


10.4 


other 


122829 AW204530 


Hs.99500 


ESTs 


81.8 


? 


122834 AA461492 


Hs.99545 


Homo sapiens cDNA FU10658 fis, clone NT 


3.7 


? 


122836 AA460581 


Hs.290996 


ESTs 


4.6 


other 


122837 AA461509 


Hs.293565 


ESTs, Weakly similar to putative p15Q (H 


17 


TM 


122838 M460584 


Hs.334386 


ESTs 


75.3 


other 


122854 AA600235 


Hs.9625 


NIMA (never in mitosis gene abated k 


7.8 


other 


122856 AI929374 


Hs.75367 


SroBke-adapter 


5.8 


other 


122861 AA335721 


Hs.1 19394 


ESTs 




other 


122866 BE539656 


Hs.283705 


ESTs 


42 


other 


122868 AF005216 


Hs.1 15541 


Janus kinase 2 (a protein tyrosine klnas 


5.3 


other 
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AW576312 Hs.318722 Homo sapiens cDNA: FLJ21766 fls, done C 

AW081394 Hs.97103 ESTs 

AA769410 Hs.128654 ESTs 

AA470074 Hs.169896 ESTs 

AA470140 Hs.229170 EST 

AA478951 Hs.105629 ESTs 

AW968324 Hs. 17384 ESTs 

AW338067 Hs.323231 Homo sapiens cONA FU1 1946 fis, done HE 

AL359571 Hs.44054 ninein (GSK3B interacting protein) 

AI382600 Hs.104308 ESTs, Weakly similarto K1AA1395 protein 

AA485360 Hs.105661 ESTs 

AI343652 Hs.105667 ESTs 

AA486256 Hs.193510 EST 

BE304942 Hs.265848 myomegalln 

T52027 Hs.271795 ESTs, Weakly similar to 138022 hypotheti 

A1061582 Hs.324179 Homo sapiens cDNA RJ12371 fis, done MA 

AW451999 Hs.194024 ESTs 

AI734179 Hs.105676 ESTs 

AW601773 Hs.270259 ESTs 

AA490929 Hs.105274 ESTs, Weakly similar to RMS1_HUMAN REGUL 

AA496369 gb:zv37d10.s1 Soares ovay tumor NfaHOT H 

AA504757 Hs.105738 ESTs 

AA731404 Hs.105510 ESTs 

AW450922 Hs.1 12478 ESTs 

AA599042 Hs.112503 EST 

AW303285 Hs.303632 Human DNA sequence from clone RP1 1-1 10H4 

AB021644 Hs.197219 zinc finger protein 14 (KOX 6) 

BE439553 Hs.250528 Homo sapiens, done IMAG£:4098694 ( mRNA, 

N95059 Hs.55098 ESTs 

BE019072 Hs.334802 Homo sapiens cONA FU1468Q fis, done NT 

AW380388 Hs.155546 KIAA1080 protein; Golgi-assodated, gamm 

AA609170 gb:af12a12.s1 SoaresJestisJWT Homo sap 

AA602964 g b:no97c02.s1 NCLCGAP_Pr2 Homo sapiens 

AA609364 gb:zu7ld09.s1 SoaresJestis_NHT Homo sap 

AI269609 Hs.105187 kinesin protein 9 gene 

NMJH3241 Hs.95231 FH1/FH2 domain-containing protein 

AA609891 Hs.1 12777 EST 

AA609955 Hs.234961 Huntingtin interacting protein E 

AA620464 Hs.261915 EST, Weakly similarto S65657 alpha-1C-a 

AA620586 gbae60g05.s1 Stratagene King carcinoma 

AB012922 Hs.173043 metastasis-associated 1 -like 1 

AJ272267" Hs.146178 choline dehydrogenase 

L42542 Hs.75447 ralA binding protein 1 

AH47155 Hs.270016 ESTs 

AI950314 Hs.1 54762 HIV-1 rev binding protein 2 

H05635 Hs.294030 topoisomerase-related function protein 4 

BE463721 Hs.97101 putative G protein-coupled receptor 

AA372796 Hs.269339 ESTs, Weakly similar to AF161356 1 HSPC0 

AA640891 Hs.102406 ESTs 

D87454 Hs.192966 KIAA0265 protein 

AI267847 gb:aq49a10J(1 Stanley Frontal NB pool 2 

AA317338 Hs.7535 COBW-like protein 

AF155099 Hs.279780 ■ NY-REN-18 antigen 

N34059 gb:yv28h09.s1 Soares fetal liver spleen 

H13540 Hs.82202 ribosomal protein L1 7 

AA532519 Hs.129043 Human DNA sequence from done 989H1 1 on 

R10084 Hs.113319 kinesin heavy chain member 2 

N53935 gb:yv59d09,s1 Soares fetal liver spleen 

H79433 Hs.268997 ESTs 

AA669097 Hs.109370 ESTs 

N71076 Hs.102800 ESTs, Weakly similar to neuronal thread 

NMJ14053 Hs.270594 FLVCR protein 

AI765123 Hs.143671 Homo sapiens cDNA FLJ13533 (is, clone PL 

AA160474 Hs.75798 hypothetical protein 

AW968856 Hs.278569 sorting nexin 17 

N92593 Hs.313054 ESTs 

R48170 Hs.78436 EphB1 
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124683 AA381661 


Hs.1 19878 


ESTs, Weakly similar to M3K9.HUMAN MITOG 


7.9 


TM 


124712 R09156 


Hs.191148 


ESTs 


5.7 


other 


124735 R22952 


Hs.268685 


ESTs 


11.3 


? 


124761 AA374756 


Hs.93560 


Homo sapiens mRNA for KIAA1771 protein, 


9 


other 


124768 AW368528 


Hs.100855 


ESTs 


8.3 


other 


124775 R41772 


Hs.100878 


ESTs 


4.9 


other 


124777 R41933 


Hs.140237 


ESTs, WeaWy similar to ALU1_HUMAN ALU S 


2.8 


other 


124788 R43543 


Hs.100912 


Homo sapiens cDNA: FU22726 fis, clone H 


5.1 


other 


124809 AL355722 


Hs.106875 


Homo sapiens EST from done 35214, full 


4.2 


other 


124811 R46068 


Hs.283912 


hypothetical protein FU22604 


14.2 


other 


124812 R47948 


Hs.188732 


ESTs 


7.9 


other 


124822 AA418160 


Hs.86043 


Homo sapiens cDNA FU13558 fis, clone PL 


6.6 


other 


124825 AA501669 


Hs.336693 


ESTs 


2.3 


SS.TM 


124833 AW975868 


Hs.294100 


ESTs 


2.7 


SS.TM 


124857 R63652 


Hs.137190 


ESTs 


2.3 


other 


124860 R65763 


Hs.101477 


EST 


23.9 


? 


124863 AI382555 


Hs.127950 


bromodomain-containing 1 


2 


other 


124876 AF135422 


Hs.27059 


GDP-mannose pyrophosphoryiase A 


4.4 


SS, 


124878 BE397530 


Hs.288057 


hypothetical protein FU22242 


2.7 


other 


124902 H37941 


Hs.101883 


ESTs 


5.7 


other 


124903 AW296713 


Hs.221441 


ESTs 


32.4 


other 


124930 AI076343 


Hs.173939 


ESTs, Weakly similar to ALUB_HUMAN III! 


22.8 


other 


124942 R99978 


Hs.268892 


ESTs, Moderately similar to B34087 hypot 


6.1 


other 


124958 AI078645 


Hs.431 


murine leukemia viral (bmi-1) oncogene h 


1.9 


other 


124980 T40841 


Hs.98681 


ESTs 


4,5 


? 


125002 T59338 


Hs.269463 


ESTs, Weakly similar to ALU1_HUMAN ALU S 


4.9 


other 


125047 T79815 


Hs.279793 


ESTs 


5 


? 


125051 T79956 


Hs.100588 


EST 


135.3 


? 


125056 T81310 


Hs.100592 


ESTs 


5.4 


other 


125101 AI472068 


Hs.286236 


KIAA1856 protein 


5.6 


other 


125113 T96595 


Hs.302270 


ESTs, Weakly similar to ALUF_HUMAN 111! 


1.8 


other 


125115 T97341 




gb:ye57e05.s1 Scares fetal liver spleen 


9.6 


? 


125125 AI222382 


Hs.240767 


Human DNA sequence from clone RP1-12G14 


1.5 


TM 


125147 W38150 




Empirically selected from AFFX single pr 


1.7 


? 


125161 W44657 


Hs.144232 


EST 


10.7 


? 


125249 AA630863 


Hs.131375 


ESTs, Moderately similar to ALUBJiUMAN I 


1.3 


other 


125255 AF098162 


Hs,118631 


timeless (Drosophfla) homolog 


9.4 


other 


125279 AW401809 


Hs.4779 


K1AA1150 protein 


1.5 


7 


125280 AI123705 


Hs.106932 


ESTs 


8.1 


? 


125298 AW972542 


Hs.269008 


Homo sapiens cDNA: FU21814 fis, done H 


1.5 


other 


125660 AW292171 


Hs.23978 


scaffold attachment factor B 


5.9 


other 


125827 NM.003403 


Hs.97496 


YY1 transcription fador 


1.2 


? 


125891 U29589 


Hs.7138 


cholinergic receptor, muscarinic 3 


6.5 


? 


126005 AW409701 


Hs.1578 


baculoviral IAP repeat-containing 5 (sur 


14.3 


? 


126202 AA157632 


Hs.272630 


vacuolar proton pump delta polypeptide s 


2.5 


SS, 


126695 AA643322 


Hs.172028 


a dislntegrin and metalloprotelnase coma 


9.1 


SS.TM 


127050 AW411066 


Hs.274351 


CGI4J9 protein 


17 


other 


127274 AW966158 


Hs.58582 


Homo sapiens cDNA FLJ12789 fis, clone NT 


12.8 


other 


128355 AW293012 


Hs.161623 


ESTs 


7.4 


SS, 


128493 D87466 


Hs.240112 


K1AA0276 protein 


3.1 


TM 


128522 BE173977 


Hs.10098 


putative nucleolar RNA helicase 


9.4 


other 


128527 AA504583 


Hs.101047 


transcription fador 3 (E2A Immunoglobul 


1.5 


other 


128528 R39234 


Hs.251699 


ESTs. Weakly similar to IDN4-GGTR14 [H.s 


2.8 


other 


128595 U31875 


Hs.272499 


short-chain aicohol dehydrogenase family 


12.1 


TM 


128599 NM.015366 


Hs.102336 


Rho GTPase activating protein 8 


2.4 


? 


128604 AI879099 


Hs.102397 


GIOT-3 for gonadotropin inducible transc 


1.3 


other 


128608 BE267994 


Hs.102419 


zinc finger protein 


12 


other 


128625 AB037641 


Hs.102652 


hypofoeflcal protein ASH1 


1.3 


other 


128629 AL096748 


Hs.102708 


DKFZP434A043 protein 


32 


other 


128639 AW582962 


Hs.102897 


CGW7 protein 


2 


TM 


128656 AA458542 


Hs.10326 


coatomer protein complex, subunit epsilo 


1.4 


other 


128658 BE397354 


Hs.324830 


diptheria toxin resistance protein requl 


Z5 


other 


128670 AA975486 


Hs. 103441 


Homo sapiens, Similar to RIKEN cDNA 1700 


7.1 


? 


128691 W27939 


Hs.103834 


hypothetical protein MGC5576 


7.8 


? 


12B696 BE081143 


Hs.225977 


nudear receptor coactivator 3 


3.8 


other 


128700 Y15221 


Hs.103982 


smaQ inducible cytokine subfamily B (Cy 


1.6 


other 


128714 T85231 


Hs.179661 


tubulin, beta 5 


7.8 


other 
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AK001564 
BE147740 
AF292100 
AA307211 
AI470163 
AB027249 
BE302796 
N71826 
NMJ02975 
AW630942 
AW248431 
BE281170 
AK001731 
BE159181 
AF189723 
R57988 
AA622037 
R67419 
Y13153 
AA009647 
AI580127 
AW150697 
AI375672 
BE560779 
AW271217 
AI816224 
AI950087 
AUM4675 
R80088 
AW296806 
AI351010 
AA744610 
L12350 
AA463189 
8E243933 
AF146074 
W93048 
AA356620 
AW162916 
AA286914 
AA150797 
N57532 
AI934365 
U40714 
AF013758 
AA252468 
AI961727 
W26392 
A1051967 
AA287239 
H75334 
BE614192 
U30246 
BE220806 
AI686379 
NM.016039 
AF149785 
AI267700 
AA204686 
M188185 
AW843633 
AF255303 
AA769221 
W01296 
AA317841 
A1923097 
F0B282 



Hs.104222 hypothetical protein FU10702 

Hs. 104558 ESTs, Moderately similar to I38022 hypot 

Hs.1 0461 3 RP42homo!og 

Hs.251531 proteasome (prosome, macropain) subunit, 

Hs.323342 actin related protein 2/3 complex, subun 

Hs.104741 PDZ-blnding kinase; T-cell originated pr 

Hs.105097 thymidine kinase 1, soluble 

Hs.105465 small nuclear ribonucieoprotein polypept 

Hs. 1 05927 stem cell growth fecton lymphocyte seer 

Hs.106061 RDRNA-blnding protein 

Hs.256526 nuclear prelamin A recognition factor 

Hs. 1 06357 vatosin-containing protein 

Hs.106390 Homo sapiens mRNA; cONA DKF2fc586H0924 (f 

Hs.168232 hypothetical protein FU13855 

Hs.106778 ATPase, Ca++ transporting, type 2C, memb 

Hs.10706 epithelial protein tost in neoplasm beta 

Hs.166468 programmed cell dealh 5 

Hs.21851 Homo sapiens cDNA FU12900 fis, clone NT 

Hs.107318 kynurenine 3-monooxygenase (kynurenine 3 

Hs.8850 a dlsintegrin and metaltoproteinase doma 

Hs.107381 hypothetical protein FU1 1200 

Hs.107418 ESTs 

Hs.165028 ESTs 

Hs.284233 NiCE-5 protein 

Hs.281434 Homo sapiens cONAFU14028 fis, done HE 

Hs.107747 DKFZP566C243 protein 

gtewq05c02j(1 NCI_CGAPJ<id12Homosapien 

Hs.173081 KIAA0530 protein 

Hs.108104 ubiqultin-conjugating enzyme E2L 3 

Hs.326234 ESTs, Highly similar to T46422 hypotheti 

Hs.102267 lysosomal 

Hs.194431 paliadin 

Hs.108623 thrombospnndin2 

Hs.288906 WW Domain-Containing Gene 

Hs.108642 zinc finger, protein 22 (KOX 15) 

Hs.108660 ATP-binding cassette, sub-family C (CFTR 

Hs.250723 hypothetical protein MGC2747 

Hs.108947 K1AA0050 gene product 

Hs.241576 hypothetical protein PR02577 

Hs.183299 ESTs 

Hs.109276 latexin protein 

Hs.109315 KIAA1415 protein 

Hs.109439 osteoglydn (osteoinductive factor, mime 

Hs.239307 tyrosyl-IRNA synthetase 

Hs.109643 potyadenylate binding proteln-interactin 

Hs.1098 DKFZp434J1813 protein 

Hs.109804 H1 histone family, member X 

Hs.110080 ESTs, Weakly similar to S13495 pregnancy 

Hs,1 10122 ESTs 

Hs.5518 Homo sapiens cONA FU11311 fts, done PL 

Hs.11050 F-box only protein 9 

Hs.279869 melanoma-associated antigen recognised b 

Hs.1 10736 solute carrier family 12 (sodium/potassl 

Hs.184697 Homo sapiens done 23785 mRNA sequence 

Hs.1 10796 SAR1 protein 

Hs.1 10803 CGi-99 protein 

Hs.1 1 1 126 pituitary tumor-transforming 1 interact! 

Hs.317584 ESTs 

Hs.234149 hypothetical protein FU20647 

Hs.289Q43 spindiin 

Hs.306163 hypothetical protein AL1 101 1 5 

Hs.1 12227 membrane^assodated nudeic acid binding 

Hs.270847 delta-tubulin 

Hs.1 1360 hypothetical protein FU14784 

Hs.7845 hypothetical protein MGC2752 

Hs.1 1441 chromosome 1 open reading frame 8 

Hs.278428 progestin Induced protein 
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129587 H14718 


Hs.11506 


Human clone 23589 mRNA sequence 


6.8 


other 


129588 BE408300 


Hs.301862 


postmeiofic segregafion increased 2-UKe 


1.4 


TM 


129591 N57423 


Hs.179898 


HSPC055 protein 


7.4 


other 


129594 AW403724 


Hs.36989 


coagulation factor VII (serum prothromb! 


9 


? 


129596 AF035537 


Hs.1 15521 


REV3 (yeast homologHike, catalytic sub 


1.6 


other 


129628 U38945 


Hs.1174 


cycfin-dependent kinase inhibitor 2A (me 


2.2 


other 


129649 AD000092 


Hs.16488 


calretfcuBn 


3.3 


other 


129675 NM.015556 


Hs.172180 


K1AA0440 protein 


13.4 


other 


129680 U03749 




gb:Human chromogranln A (CHGA) gene, pro 


14.1 


? 


129689 AW748482 


Hs.77873 


B7homoiog3 


2.6 


other 


129702 AI304966 


Hs.12035 


ESTs, Weakly simHar to I38022 hypotheti 


7.5 


TM 


129720 AA156214 


Hs.12152 


APMCF1 protein 


2 . 


other 


129721 NMJJ01415 


Hs.211539 


eukaryoiic translation initiation factor 


1.7 


TM 


129778 AK001676 


Hs.12457 


hypothetical protein FU1Q814 


1.8 


other 


129779 AA394090 


Hs.12460 


Homo sapiens clone 23870 mRNA sequence 
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WAA0431 protein 

ATP-bhding cassette, sub-family A (ABC1 

desmoplakin(DPI,DPIl) 

ESTs, Moderately similar to A46010 X4in 

TBX3-feo protein 

Nijmegen breakage syndrome 1 (nibrin) 
nucleobindin 2 
DKFZP586J0119 protein 
ESTs, Highly similar to IRX1_HUMAN JROQU 
upstream regulatory element binding prot 
SW1/SNF related, matrix associated, adi 
COP9 complex subunit 7a 
mitochondrial ribosomal protein L37 

l - ! -,29kD 



K1AA0781 protein 
heterogeneous nuclear ribonucleoprotein 
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132528 T78736 Hs.50758 SMC4 (structural maintenanoe of chromoso 7.4 ? 

132571 AW674699 Hs.5169 suppressor of G2 allele of SKP1, S. cere 6.9 other 

132726 N52298 Hs.55608 hypothetical protein MGC955 14,3 ? 

132863 BE268Q48 Hs.236494 RA810, member RAS oncogene family 10.3 other 

5 133016 AM39688 Hs.6289 hypothetical protein FU20886 4.4 other 

133053 AI065016 Hs.6390 Homo sapiens clone FLB3344 PRO0845 mRNA, 1.8 SS.TM 

133197 AI275243 Hs.180201 hypoftetical protein FU20671 1.8 other 

133240 AK001489 Hs.242894 ADP-ribosyiaGon factor-like 1 1.8 other 

133266 AI160873 Hs.69233 zinc finger protein 16.1 other 

10 133285 M76477 Hs.289082 GM2ganglk>side activator protein 10.4 SS, 

133383 BE313555 Hs.7252 KIAA1 224 protein 1.5 7 

133540 AL037159 Hs.74619 proteasome (prosome, macropain) 26S subu 1.7 other 

133784 BE622743 Hs.301064 arfaptin 1 1Z1 other 

133791 M34338 Hs.76244 spermidine synthase 9.7 other 

15 133850 W29092 Hs.7678 ceDular retinoic acid-binding protein 1 42 SS, 

133859 U86782 Hs.178761 26S proteasome-associated pad1 homolog 22 other 

133881 U30872 Hs.77204 centromere protein F (350/400kD, mitosin 9.1 other 

134208 NM_000288 Hs.79993 peroxisomal biogenesis factor 7 Z2 other 

134403 AA334551 Hs.82767 sperm specific antigen 2 1.4 other 

20 134724 AF045239 Hs.321576 ring finger protein 22 1.4 other 

134806 AD001528 Hs.89718 spermine synthase 2.6 other 

134859 D26488 Hs.90315 KIAA0007 protein 13.3 other 

135193 X95525 Hs.96103 TATA box binding protein (TBP)-associate 3.1 other 

AA243007 ESTs 1.6 ? 

25 n0541 ESTs 2.5 SS ( 

X57766 Human stromelysin-3 mRNA 4.5 other 

S66431 Homo sapiens clone 23592 mRNA sequence 3.1 other 

AA453483 ESTs 4.6 TM 

R63925 ESTs 1.4 other 

30 AA173417 ESTs 1.9 other 

AA280588 ESTs 2.2 other 

AA504223 ESTs Highly similar to CHROMOSOME Z4 other 

AA609996 ESTs Highly similar to Surf-4 protein (M.musculusl 5.5 ? 

F02907 ESTs 2.3 TM 

35 AA480103 ESTs Weakly similar to ill! ALU SUBFAMILY J 2.8 TM 

AA024664 Human NADK'ubiqumone oxkJoreductase subunit 6.2 other 

AA251776 ESTs 2.3 other 

AA399047 ESTs 2.4 other 

N34059 EST-RCJJ34059 3.3 other 

40 U95367 Human GABA-A receptor pi subunit mRNA complete cds 1.7 

AA490899 - ESTs 3.3 other 

T54762 ESTs 2.9 ? 

Z41953 Homo sapiens HP protein (HP) mRNA complete cds 1.3 ? 

AA521186 ESTs 1.6 TM 

45 AA400195 ESTs 1.3 other 

AA045083 VITAMIN K-DEPENDENT GAMMA-CARBOXYLASE 2.5 

AA099589 Homo sapiens mRNA for GDP dissociation inhibitor beta 1.6 

W85712 ESTs Weakly similar to PROCOLLAGEN ALPHA 2(1V 2.6 

W45728 ESTs Highly similar to HETEROGENEOUS 3.7 other 

50 U61232 Human tubuIin-foldingcofactorE mRNA complete cds 2.1 

AA425154 ESTs 5.3 other 

T39176 ESTsWeaWysimaartoZK1058.4[C.elegans] 2.6 SSJM 

AA496000 ESTs 1.9 SS, 

W38150 EST-RCJW38150 17 ? 

55 T96595 EST-RC_T96595 1.8 TM 

AA227463 ESTs Weakly similar to No definition line found IC.elegans] 1 .9 

R46025 ESTs 2.8 SS, 

AA233177 ESTs 2 other 

AA338760 ESTs 1.3 ? 

60 AA412106 ESTs * 6.2 other 

L47276 EST-L47276 3.4 other 

D82307 ESTs Weakly similar to TH1 protein [D.melanogaster] 11.4 

AA293568 ESTs 1.5 other 

R37778 ESTs 2.4 other 

65 AA250843 Interferon regulatory factors 14.6 ? 

W49521 Human prolyl 4-hydroxylase alpha (II) subunit 6.5 ? 

D80000 Human mRNA for KIAA0178 gene partial cds 2 other 
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R99978 


ESTs Weakly similar to Bne-1 protein 0RF2 [H.saptens] 


0.1 


AA1 95036 


Human Ro/SSA ribonudeoprotein homolog (RoRet 5.3 


{ 


Z38501 


ESTs Weakly similar to PROBABLE E5 


1.4 


other 


U37547 


Human IAP homolog B (MJHB) mRNA complete cds 3.2 


other 




ESTs 


17 


other 


VC7C7Q 


Inhlbin beta A (aciivin A activln AB alpha polypeptide) 


1 0.0 


AA449071 


ESTs 


1.3 


1 IV) 


N51855 


ESTs Moderately similar to NAD(+) ADP- 


1.3 


other 


AA421213 


ESTs Weakly similar to F28F8.3 (C.elegans] 


3.2 


other 


AA355201 


ESTs 


1.2 


SS.TM 


N78717 


H^apjens mRNA fortranslin 


15 


? 


N73808 


ESTs 


5 


? 


U86782 


Human 26S proteasome-assoclated pad1 


2.2 


other 


AA234817 


ESTs 


1.3 


other 


D13S66 


Homo sapiens mRNA for osteoblast specific 


7.5 


SS, 


AA236177 


ESTs 


7.1 


? 


U50648 


Protein kinase interferon-inducible double 


4.1 


? 


M28211 


Homo sapiens GTP-binding protein (RAB4) 


2.9 


other 


AA446949 


ESTs 


2.2 


other 


W03007 


ESTs 


1.2 


other 


W61011 


ESTs 


1.2 


other 


W87544 


ESTs 


1.2 


other 


X02751 


Neuroblastoma RAS viral (v-ras) oncogene homolog 1.2 


? 


Z14077 


YY1 transcription factor 


U 


other 


238839 


ESTs 


1.2 


? 


AA410894 


ESTs 


1.7 


other 


AA504499 


ESTs Highly similar to probable chloride channel 3 fH.sap 


1.3 



older 
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TABLE 7A 
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10 
15 
20 



Table 7 A shows the accession numbers for those pkeys lacking unigenelD's for Table 7. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset Identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

102481 31281.-28 U50360 

105032 genbanleAA127818 AA127818 

409487 1134778.1 H19886 AW402806 T10231 
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TABLE 8: Figure 8 from BRCA 001-1 US 

Table 8 shows genes upregulated in tumor tissue compared to normal breast tissue. 
Specifically, one column shows the ratio of expression of the indicated gene in breast tumor 
tissue compared to other body tissues, and another column shows the ratio of expression of 
the indicated gene in breast tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigeneiD: Undone number 

Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal body tissue 

R2: Ratio of tumor to normal breast tissue 



Pkey ExAccn UnigeneiD Unigene Title 



100075 
100229 
100262 
100271 
100355 
100522 
100552 
100599 
100676 
100690 
100895 
101046 
101086 
101148 
101161 
101201 
101212 
101441 
101447 
101469 
101567 
101600 
101624 
101674 
101861 
101977 
102193 
102199 
102304 
102345 
102457 
102534 
102541 
102827 
102962 
102991 
103119 
103175 



103319 
103372 
103419 
103471 
103546 



AF152333 

AV652249 

D38500 

BE160081 

AI907114 

X51501 

AA019521 

X77343 

X02761 

AA383256 

U01351 

K01 160 

AA382524 

NMJJ02923 

NMJJ06262 

L22524 

AI186220 

AW468397 

M21305 

AA310162 

M33552 

BE561617 

M55998 

NMJ502291 

AA350659 

AF112213 

AL036335 

AA334592 

AF015224 

NM_003480 

NM 001394 

U96759 

AI379954 

BE244588 

R50O32 

AW293542 

X63629 

X69089 

D38616 

X83492 

BE536700 

T34708 

Y00815 

Z14244 



Hs.284160 

Hs.180107 

Hs.278468 

Hs.256290 

Hs.71465 

Hs.99949 

Hs.301946 

Hs.334334 

Hs.287820 

Hs.1657 

Hs.75772 

Hs.250959 

Hs.78944 

Hs.37044 

Hs.2256 

Hs.83164 

Hs.100000 

Hs.169248 
Hs.56729 
Hs.1 19192 

Hs.82124 

Hs.83347 

Hs.184062 

Hs.313 

Hs.79914 

Hs.46452 

Hs.300946 

Hs.2359 

Hs.198307 

Hs79025 

Hs.6456 

Hs.159263 

Hs.75309 

Hs.2877 

Hs.79227 

Hs.54941 

Hs.82359 

Hs.4888 

Hs.272927 

Hs.75216 

Hs.75752 



protocadherln gamma subfamily B, 4 
polymerase (DNA directed), beta 
posimelotic segregation increased 2-Jike 
S100 calcium-binding protein A1 1 {calgiz 
squalene epoxidase 
prolactin-induced protein 
lysosomal 

transcription factor AP-2 alpha (activat 
fibronectin 1 
estrogen receptor 1 

nuclear receptor subfamily 3, group C, m 
NMJX)2122:Homo sapiens major histocompatl.7 
histatinl 

regulator of G-proteln signalling 2, 24k 
peripherin 

matrix metalloproteinase 7 (MMP7; uterin 
collagen, type XV, alpha 1 
S100 calcium-binding protein A8 (calgran 
gb:Human alpha satellite and satellite 3 
cytochrome c 
lysosomal 

H2A histone family, member Z 
gb:Human alpha-1 collagen type 1 gene, 3 
laminin, beta 1 

anglo-assoclated, migratory cell protein 



secreted phosphopratein 1 (osteopontin, 

lumfcan 

mammaglobln 1 

Microfibril-associated glycoprotein-2 
dual specificity phosphatase 4 
von Hippei-Undau binding protein 1 
K1AA0096 protein 

chaperonin containing TCP1, subunit 2 (b 
collagen, type VI, alpha 2 
eukaryoCc translation elongation factor 
cadherin 3, type 1, P-cadherin (placenta 
myomesin (M-protein) 2{165kD) 
phosphorylase kinase, alpha 2 (fiver) 
tumor necrosis factor receptor superfaml 
seryl-tRNA synthetase 
Sec23 (S. cerevislae) homolog A 
protein tyrosine phosphatase, receptor t 
cytochrome c oxidase subunit Vllb 
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R1 


R2 


1 


3.8 


1.7 


5.3 


0.8 


4.8 


3.2 


2.3 


3.3 


1.4 


11.9 


0.4 


3.8 


1.2 


9.4 


9.4 


3 


7.8 


4.4 


4.4 


1 


3.9 


tt.7 


4 


0.8 


4.1 


1.2 


12 


3.1 


1.1 


4.4 


0.6 


3.1 


3.4 


0.9 


4.2 


29.9 


0.3 


0.8 


4.9 


1 


5.9 


2.8 


4 


3.1 


1.7 


1.5 


4.1 


3.1 


1.4 


1.3 


6.9 


1.9 


4.9 


2.2 


3.8 


4.2 


0.7 


1.1 


4.2 


4.5 


0.5 


1.4 


4.2 


0.9 


3.9 


1.5 


10.9 


2.2 


6.2 


5.6 


5.7 


3.7 


0.5 


1.3 


4 


1.3 


3.8 


0.8 


4.6 


0.9 


8 


1.1 


5.1 


3.7 


1.2 


0.9 


4.4 
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103658 NM.000088 


Hs.172928 collagen, type I, alpha 1 


3.2 


3 


103758 AA084874 


gb:zn13e04.r1 Stratagene hNT neuron (937 


0.9 


10 


103774 H24185 


Hs.92918 hypothetical protein 


1.9 


15.9 


103821 M095971 


Hs.198793 Homo sapiens cDNA: FU22463 fe, clone H 


1.2 


3.9 


103869 BE439604 


Hs.24322 ATPase, H+ transporting, lysosomal {vacu 


1.4 


3.9 


103980 AW130242 


Hs.293476 hypothetical protein FKSG44 


1.6 


4.1 


104054 AK001913 


Hs.7100 hypothetical protein 


1.5 


4.3 


104115 AF183810 


Hs.26102 opposite strand to trichomlnophalangeal 


7 


7 


104189 AB040927 


Hs.301804 KIM1494 protein 


2 


4.6 


104230 AB002347 


Hs.15303 K1AA0349 protein 


0.7 


4.5 


104278 AW583693 


Hs. 109253 N-terminal acetyltransferase complex ard 


3.3 


3.3 


104295 AW365522 


Hs.103657 hypothetical protein PR02219 


2.3 


4.2 


104319 AW804296 


Hs.9950 Sec61 gamma 


3.1 , 


7 


104425 AF283775 


Hs.35380 x 001 protein 


4 


1.3 


104432 X51501 


Hs.99949 pralactin-induced protein 


3.8 


0.6 


104464 AW9fiB72fl 


Hs.54642 methionine adenosyltransferase II, beta 


0.8 


67 


104479 AK001731 


Hs.106390 Homo sapiens mRNA; cDNA DKFZp586H0924 (f 


1.7 


104505 W94824 


Hs.11565 RIKENcDNA 2010100012 gene 


2 


7.5 


104592 AW630488 


Hs.325820 protease, serine, 23 


1.9 


7.4 


104613 API 23303 


Hs.24713 hypothetical protein 


1.1 


6.3 


104636 R82252 


Hs.106106 protein kinase (cAMP-dependent, catalyfi 


1.2 


4 


104782 AW2705S5 


Hs.171774 hypothetical protein 


1.4 


3.9 


104792 AA960961 


Hs.305953 zinc finger protein 83 (HPF1 ) 


t.5 


A2 


104848 AA305351 


Hs.274369 uncharacterized hypothalamus protein HAR 


1.1 


4.1 


104849 AI279065 


Hs.241507 ribosomal protein S6 


1.3 


4.6 


104850 AL133035 


Hs.8728 hypothetical protein DKFZp434G171 


12 


3.6 


104852 W70164 


Hs.20107 ESTs 


0.8 


4.2 


104861 AA058630 


Hs.29759 RNA POLYMERASE I AND TRANSCRIPT RELEASE 


1.7 


104873 W03831 


Hs.20597 host ceH factor homobg 


0.8 


5.4 


104891 W44626 


Hs.30627 ESTs 


0.7 


6.8 


104920 AW955089 


Hs.306083 Novel human gene mapping to chomosome 22 


1 3.9 


104926 BE298808 


Hs.33363 DKFZP434N093 protein 


3.3 


3.3 


104952 AW076098 


Hs.74316 desmoptekinpPl.DPU) 


1.2 


3.7 


104963 AB029020 


Hs.173694 KIAA1097 protein 


1.1 


5.5 


104977 A1392640 


Hs.18272 amino acid transporter system A1 


3.2 


14 


105030 BE613061 


Hs.337772 Homo sapiens, Similar to RIKEN cONA 0610 1.6 


11.4 


105035 N39760 


Hs.8859 Homo sapiens. Similar to RIKENcDNA 5830 1.5 


7.2 


105068 BE410438 


Hs.9006 VAMP (veside-associated membrane pratei 


1.1 


3.5 


105159 AF146277 


Hs.265561 CD2-assoclated protein 


1.2 


10 


105178 AA313825 


Hs.21941 AD036 protein 


3.6 


8.3 


105182 BE407961 


Hs. 1 8271 golgi phosphoprotein 3 


1.7 


6.8 


105274 AI554929 


Hs.281866 ATPase, H+ transporting, rysosomal (vacu 


1.1 


3.7 


105303 BE243327 


Hs.182626 chromosome 22 open reading frame 5 


1.5 


4 


105413 A1015709 


Hs.172089 Homo sapiens mRNA; cDNA DKFZp586l2022 (f 


1.5 


105426 W20027 


Hs.23439 ESTs 


4.3 


2.9 


105432 W03516 


Hs.76698 stress-associated endoplasmic reticulum 


1.5 


5 


105443 AA252372 


Hs.12144 KIAA1033 protein 


1.2 


3.6 


105483 AL1 37257 


Hs.23458 Homo sapiens cDNA: FU23015 fe, clone L 


1.7 


15.8 


105492 AI805717 


Hs.289112 CGI-43 protein 


2 


4.8 


105495 AL037715 


Hs.28785 microfibriitar-associated protein 3 


1.3 


3.9 


105539 AB040884 


Hs.109694 WAA1451 protein 


27 


114 


105594 AB024334 


Hs^5001 tyrosine 3-monooxygenase/tryptophan 5-mo 


1.3 


6.1 


105623 BE504200 


Hs.30127 hypothetical protein 


1.7 


4.5 


105807 AA788946 


Hs.16869 ESTs, ModeratelysimilartoCAICRAT COL 


3.9 


24.6 


105812 BE614149 


Hs.20814 CGI-27 protein 


1.8 


3.6 


105823 AI559444 


Hs.293960 ESTs 


1.9 


6.6 


105831 AA329449 


Hs.247302 twisted gastrulation 


1.5 


4.3 


105851 AI827976 


Hs.24391 hypothetical protein RJ13612 


3.B 


19 


105879 BE392914 


Hs.30503 Homo sapiens cONAFUl 1344 fe, clone PL 


1.7 


4 


105918 AW028485 


Hs.26136 hypothetical protein MGC14156 


1.7 


7.4 


105939 AL137728 


Hs.12258 Homo sapiens mRNA; cONA DKFZp434B0920 (f 


12 


105941 AB033075 


Hs. 10669 development and differentiation enhancin 


1.3 


4.6 


105969 AB030656 


Hs.17377 coronin, acfin-binding protein, 1C 


1.1 


5.9 


105990 AI6905B5 


Hs.29403 hypothetical protein FU2206O 


2 


4.6 


106012 AI240665 


Hs.8895 ESTs 


4.1 


\2 


106060 NM-001329 


Hs.171391 C-terminal binding protein 2 


2.6 


7 


106070 T74445 


Hs.5957 Homo sapiens clone 24416 mRNA sequence 1.4 


10.7 



4.8 



5.1 



14 



3.8 
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106083 
106155 
106255 
106414 
106538 
106568 
106574 
106613 
106617 
106619 
106701 
106721 
106776 
106866 
106868 
106887 
106940 
106968 
107052 
107061 
107149 
107222 
107233 
107295 
107679 
107914 
107965 
108033 
108060 
108081 
108137 
108186 
108215 
108297 
108339 
108371 
108399 
108469 
108470 
108564 
108641 
108668 



108824 
108863 
108893 
108992 
109072 
109097 
109160 
109244 
109481 
109484 
109795 
110009 
110107 
110411 
110731 
110756 
110930 
110935 
111051 
111110 
111356 
111357 
111770 
111900 



H62087 

AA425414 

8E613206 

BE568205 

AK000274 

AW051564 

BE044325 

N88604 

H09548 

AA459480 

BE387614 

AA741038 

AA206079 

AA487416 

BE185536 

BE503373 

T85594 

AF216751 

BE391904 

BE147611 

AI289507 

BE172058 

BE267795 

AA186629 

AA011510 

AA027229 

AF109219 

AW368993 

AA291440 

AA093668 

AI283611 

AW068579 

A879238 

AA333660 

AW151340 

AA074374 

AF086070 

AA079487 

AA079500 

M23114 

AA112059 

M058522 

AA036725 

AK001332 

AA133456 

BE276891 

AA152312 

AI732585 

AA167512 

BE220601 

BE179030 

AA878923 

AA366263 

AA173942 

BE075297 

AW151660 

AW001579 

NM.014899 

N21207 

BE242691 

AI753230 

AI681293 

AK001566 

BE301871 

BE314949 

R27975 

AF131784 



Hs.31659 

Hs.33287 

Hs.279607 

Hs.28827 

Hs.278635 

Hs.28285 

Hs.227280 

Hs.30212 

Hs.5367 

Hs.23956 

Hs.25797 

Hs.6670 

Hs.6693 

Hs.268231 

Hs.301183 

Hs.334335 

Hs.339808 

Hs.26813 

Hs.12482 

Hs.6354 

Hs.299883 

Hs.82689 

Hs.22595 

Hs.80120 

Hs.60512 

Hs.61329 

Hs.108787 

Hs.323748 

Hs.73149 

Hs.28578 

Hs.263479 

Hs.7780 

Hs.299315 

Hs.71331 

Hs.51615 

Hs.67639 

Hs.237519 



Hs.1526 

Hs.429 

Hs.185751 

Hs.61847 

Hs.44672 

Hs.102548 

Hs.194691 

Hs.72047 

Hs.22394 

Hs,301997 
Hs.64239 



1.5 
5.4 
1.8 
5.1 
1.2 
1.8 
2.3 
1.2 
0.9 
1.3 
1.6 
1.7 
1 

1.6 
3.3 
1.4 
3.3 
3 

1.7 
\2 
1.8 
1.2 
1.4 
2.6 
1.8 
1.3 
1.6 



thyroid hormone receptor-associated prot 
nuclear factor I/B 
calpastatin 

mitogen-adivated protein kinase kinase 
HOCMA18P protein 
patched related protein translocated In 
U6 snRNA-assoclated Sm-Bke protein 
thyroid receptor interacting protein 15 
ESTs, WeaWy similar to 138022 hypotheti 
hypothetical protein RJ20502 
splicing factor 3b, subunlt 4, 49kD 
ESTs 

hypothetical protein FU20420 
Homo sapiens cDNA: FU2311 1 lis, clone I 
molecule possessing ankyrin repeats lndu 
hypothetical protein FU13576 
hypothetical protein FU10120 
CDA14 

glyceronephosphate O-acyltransferase 
stromal cell derived factor receptor 1 
hypothetical protein FLJ23399 
tumor rejection antigen (gp96) 1 
hypothetical protein FU10637 
UDP-N-acetyl-alpha-D-galactosamine:polyp 
ESTs 

ESTs, Weakly similar to T16370 hypotheti 
phosphatidyiinositol glycan, class N 
Homo sapiens clone CDABP0086 mRNA sequen 
paired box gene 8 1.1 
muscleblind (Drosophiia)-like 0.7 
ESTs, Weakly similar to HMG1JUJMAN HIGH 
Homo sapiens mRNA; cDNA DKFZp564A072 (fr 
ccBapsin response mediator protein-5; C 1 ,5 
hypothetical protein MGC5350 1.5 
ESTs, Weakly similar to ALU7_HUMAN ALU S 
ESTs 1-3 
EST 1 
gbzm97f08.s1 Stratagene colon HT29 (937 1 .5 
gb:zm96h10.s1 Stratagene colon HT29 {937 
ATPase, Ca++ transporting, cardiac muscl 
ATP synthase, H+ transporting, mltochond 
ESTs 
ESTs 

hypothetlca! protein FU10470 
glucocorticoid receptor DNA binding fact 
retinoteackJ induced 3 
ESTs 

hypothetical protein RJ10893 
gfczp10f12.s1 Stratagene fetal retina 93 
hypothetical protein FU 13033 
Human DNA sequence from clone RP5-1 174N9 
hypothetical protein FLJ21Q16 3.8 
Hs.72531 hypothetical protein RJ11838 1.9 
Hs.326416 Homo sapiens mRNA; cDNA DKFZp564H1916 (f 
Hs.6614 ESTs, Weakly similar to A43932 mucin 2 p 4.6 
Hs.31444 ESTs 

Hs.9645 Homo sapiens mRNA for KIAA1 741 protein, 
Hs.188006 KIAA0878 protein 
Hs.182999 ESTs 
Hs.14947 ESTs 

Hs.323562 hypothetical protein DKFZp564K142 
Hs.121 86 hypothetical protein FU22558 
Hs.23618 hypofoetical protein FU10704 
Hs.4867 mannosyl (atpha-1 .^-glycoprotein beta- 
Hs.87128 hypotheUcal prated FU23309 
Hs.269401 ESTs, Moderately similar to S65657 alpha 
Hs.25318 " n "'" M 



1.1 

2 

1.1 
U 
1.4 
1.4 
1.2 
1.3 
1.1 
1.2 
1.3 
4 



1.2 
3.7 
2.8 
1.6 
3.1 
1.9 
2 

1.1 
1 

3.3 
U 



Homo sapiens clone 25194 mRNA sequence 3.2 
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3.6 

I. 2 
4 

6.1 
5.9 
5.4 

II. 2 
3.6 
4.4 
3.6 
7.3 
6.1 
5.4 
5.4 
1.2 
6.3 
1.8 
3 

7.6 
4.3 
6.5 
6.9 
3.5 
4.3 
4 

3.5 
3.5 
1.8 
3.5 
5.6 
1.2 
3.1 
4.6 
4 

6.3 
3.8 
3.6 
3.6 
4.3 
4.9 
3.5 
3.6 
3.6 
3.5 
4 

3.6 
4.1 
3.5 
5 

6.1 
17 
7,7 
4 

3.7 
7.4 
3.5 
3.3 
3.7 
3.5 
1.2 
7.5 
4 

3.8 
8.2 
6.1 
5.4 
0.8 



8.1 



5.6 
6.9 



4.7 



7,4 
1.3 
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111903 
111951 
112141 
.112193 
112197 
112610 
112971 
112984 
113056 
113449 
113497 
113508 
113531 
113604 
113674 
113841 
113857 
113931 
113936 
113987 
114132 
114156 
114213 
114636 
114760 
114781 
114795 
114901 
115096 
115518 
115646 
115764 
115802 
115994 
116032 
116046 
116274 
116310 
116356 
116429 
116461 
116470 
116578 
116579 
116589 
116786 
117170 
117215 
117280 
117576 
117667 
117881 
118336 
118475 
118493 
118505 
119159 
119307 
119355 
119462 
119771 
119940 
119943 
120407 
120493 
120677 
120867 



NMJM906 

NMJM927 

AW137198 

R49499 

NMJ303655 

AW500106 

Z42387 

T16971 

AF019226 

AW160683 

AF143321 

AL042936 

AK001898 

AI075407 

NM_014214 

W30681 

AW243158 

BE255499 

W17056 

AA345519 

AI342493 

BE179882 

N58309 

AA075488 

AI929382 

T10446 

A8037853 

AV660012 

AI683069 

BE541042 

N36110 

AW582256 

AW410233 

AB037836 

BE383668 

BE395293 

AI129767 

Z24854 

AI371223 

AF191018 

AA313607 

AI272141 

021262 

AW888411 

AI557212 

H25836 

N25929 

N20066 

M18217 

AI383467 

U59305 

AF161470 

BE327311 

N66845 

AL353944 

N67343 

AF142419 

BE048061 

BE218319 

BE041667 

AI905687 

AL050097 

BE565849 

AA235207 

AW968080 

AF150208 

AA350781 



Hs.166351 

Hs.100527 

Hs.278682 

Hs.138238 

Hs.5637 

Hs.23643 

Hs.83883 

Hs.289014 

Hs.8036 

Hs.158006 

Hs.15572 

Hs.211571 

Hs.16740 

HS.2960B3 

Hs.5753 

Hs.146233 

Hs.5297 

Hs.3496 

Hs.83623 

Hs.9641 

Hs.24192 

Hs.336920 

Hs.19575 

Hs.252692 

Hs.95388 

Hs,173484 

Hs.196437 

Hs.175319 

Hs.23240 

Hs.305971 

Hs.91011 

Hs.206521 

Hs.109315 

Hs.42484 

Hs.94491 

Hs.182874 

Hs.42299 

Hs.288671 

Hs.279923 

Hs.58633 

Hs.83484 

Hs.75337 

Hs.81915 

Hs.17132 

Hs.301527 

Hs.42500 

Hs.133207 

Hs.172129 

Hs.44597 

Hs.44708 

Hs.260622 

Hs.47166 

Hs.50115 

Hs.15020 

Hs.37054 

Hs.5807 

Hs.314544 

Hs.2533 

Hs.272531 

Hs.14158 

Hs.250456 

Hs.152939 

Hs.108327 

Hs.96967 



KIAA1072 protein 
KIAA0902 protein 

Phosphatidylgtycerophosphate Synthase 

ESTs 

ESTs 

serine/threonine protein kinase MASK 
transmembrane, prostate androgen induced 
ESTs, Weakly similar to A43932 mucin 2 p 



hypothetical protein 
hypothetical protein IMAGE 109914 
holocytochrome c synthase (cytochrome c 
hypothetical protein FLJ11036 
ESTs, Moderately similar to 154374 gene 
Inositol(myo)-1{or4)-monophosphatase 2 
Homo sapiens cONA: FU22130 fis, done H 
OKF2P564A2416 protein . 
hypothetlca] protein MGC15749 
nuclear receptor subfamily 1, group I, m 
complement component 1, q subcomponent, 



1 
1 

1.4 
1.5 
4.6 
3.3 
3.2 
3.7 
4.5 
1.2 
0.9 
1.1 
1.2 
1.7 
0.8 
1.7 
1.2 
1.5 
3.8 
1.2 



Homo sapiens cDNA FU20767 lis, clone CO 0.3 

glutathione peroxidase 3 (plasma) 1.1 

CGI-11 protein 1.6 

gb:zmB8d01 .s1 Stratagene ovarian cancer ' 1 .6 

hypothetical protein FU20343 1 .4 

ESTs 1 

hypothetical protein FIJ 10337 1 .6 

hypothetical protein FU10788 1 .4 

ESTs 3J 

Homo sapiens cDNA: FU21848 fis, clone H 3.2 

solute carrier family 2 (facilitated glu 1 .5 

anterior gradient 2 (Xenepus laevis) horn 1 .3 

YME1(S.cerevisiae)-(ike1 1.7 

K1AA1415 protein 1.5 

hypothetical protein FU10618 0.9 

hypothetical protein FU20297 1 .6 

guanine nucleotide binding protein (G pr 3.2 

ESTs 0.8 

Homo sapiens cDNA FU 11997 fis, clone HE 2.4 

putative nucleotide binding protein, est 5.5 

Homo sapiens cDNA; FU22145 fis, clone H 5 

SRY (sex determining region Y)-box 4 8.7 

nucleolar and coiled-body phosphprotein 3.2 

leukemia-associated phosphoprotein p18 ( 3.2 

ESTs, Moderately similar to 154374 gene 3.1 

ESTs, Moderately similar to unknown [H.s 3.2 

ADP-ribosylation factor-like 5 7 

PTPRF interacting protein, binding prote 1 .2 

Homo sapiens cDNA:FU21409 fis, done C 4.5 

ESTs 1.4 

Ser-Thr protein kinase related to the my 4.3 

butyrate-tnduced transcript 1 2,1 

HT021 3.6 

gb:za46ri 1 .si Soares fetal fiver spleen 4.2 
Homo sapiens mRNA; cDNA OKFZp761J1 1 12 (f 

gb:yz50b07.s1 Morton Fetal Cochlea Homo 2.1 

homolog of mouse quaking QKI (KH domain 3.7 

ephrfn-A3 3 

GTPaseRab14 1.1 

Homo sapiens cervical cancer suppressor- 1 .4 

EST 3.2 

DKFZP586B0319 protein* 4.3 

coplne III 3.5 

hypothetical protein OKFZp762F201 1 1 .5 
Homo sapiens done 24630 mRNA sequence 4 

damage-specific DNA binding protein 1(1 1 .6 

ESTs 1.1 

180 



5.4 
3.8 
3.5 
3.6 
2 

105 
3 

10.8 

3.7 

4.4 

3.6 

3.5 

3.9 

5.3 

6.1 

6.2 

4.6 

4 

1 

4.7 
4.3 
4.3 
9.2 
3.7 
4 

4.3 
9.2 
5.2 
1 

4.2 
3.9 
5.9 
6.6 
9.1 
4.3 
5.5 
2,4 
4.7 
3.9 
5.5 
1.3 
4.5 
6.9 
3 

8.3 
4.5 
5.5 
6.2 
2.4 
4.2 
0.5 
5.7 
7.7 
0.5 
3.5 
3.8 
1.5 
1.1 
5.6 
4.3 
1 

0.7 
1.9 
3.7 
1.4 
6.8 
3.6 



3.3 
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121368 BE262956 
121603 AA416785 
121723 AA243499 
122223 AF169797 
122378 AB032948 
122946 AI718702 
123155 AF121856 
123158 AF161426 
123327 AA421581 
123495 W28673 
123526 AA608657 
123533 AA608751 
123768 AI932318 
123961 AL050184 

123999 AF084555 

124000 BE563957 
124038 AB037860 
124059 BE387335 
124083 AW195237 
124148 BE300094 
124153 AU077333 
124252 BE613340 
124314 AK001552 
124375 D87454 
124432 N39016 
124447 N48Q00 
124539 054120 
124543 AI393320 
124564 H66409 
124574 AL036596 
124605 AA749315 
124639 H60193 
124659 AI6BQ737 
124737 BE270465 
124760 AW403586 
124763 BE410405 
124792 R44357 
124842 R56485 
124940 AF068846 
124949 A1903210 
124960 AL023513 
124995 T527Q0 
125030 AA610577 
125034 BE548446 
125058 T83731 
125076 AA973971 
125090 T91518 
125103 AA570056 

125154 W38419 

125155 AA837043 
125159 AK000669 
125170 AL020996 
125181 R40815 
125193 W67577 
125260 H05635 
125262 AW884980 
125272 BE612888 
125388 W27235 
125824 Z45258 
125852 AW630088 
125970 AW504721 
126192 AW160399 
126469 BE384361 
126510 AA057593 
127095 AA340277 
127524 AI243596 
128312 J04182 



Hs.178292 K1AA0180 protein 1.5 

Hs.249495 heterogeneous nuclear ribonucleoprotein 2.2 

Hs.104800 hypothetical protein FU10134 3.4 

Hs.27413 adaptor protein containing pH domain, PT 3.9 

Hs.21356 hypothetical protein DKFZp762K2015 1.4 

Hs.308026 major histocompatibility complex, class 1.4 

Hs.284291 sorting nexln 6 1.2 

Hs.218329 hypothetical protein 2.4 

Hs.178443 ESTs 0.9 

Hs.106747 serine carboxypepiidase 1 precursor prot 1 .3 

gb:ae55d04.s1 Stratagene lung carcinoma 2. 1 

gbae56h07.s1 Stratagene lung carcinoma 2.1 
Hs.188762 ESTs, Moderately similar to H2BL_HUMAN H 1.1 

Hs.21610 DKFZP434B203 protein 1.1 

Hs.7351 cyclic AMP phosphoprotein, 19 kD 1.4 

Hs.74861 activated RNA polymerase II transcriptio 1.9 

Hs.173933 nuclear (actor t/A 1.5 

Hs.283713 ESTs, Weakly similar to S64054 hypotheti 14.8 

Hs.7734 hypothetical protein FU22174 1.2 

Hs.227751 lectin, galactoside-blnding, soluble, 1 2.5 

Hs.160483 erythrocyte membrane protein band 7.2 (s 1 

Hs.334725 Homo sapiens, Similar to RIKEN cDNA 9430 1.5 

Hs.215766 GTP-binding protein 1.8 

Hs.192966 K1AA0265 protein 1.1 

Hs.268869 ESTs, Weakly similar to ALUCJWMAN fill 1.3 

gteyy98e12.s1 Soares_multiple_sclerosis_ 2.7 

Hs.146409 cell division cycle 42 {GTP-binding prot 2.1 

Hs.104573 ESTs 1 

Hs.108275 ESTs 1-4 

Hs.42322 A kinase (PRKA) anchor protein 2 0.7 

Hs.77171 minichromosome maintenance deficient (S. 1 .1 

Hs.21143 DKFZP586C1324 protein 1.4 

Hs.289068 Homo sapiens cDNA FU11918 fis. clone HE 1.5 

HsJ8793 protein kinase C.zeta 0.7 
Hs.91052 ESTs, Moderately similar to ALU5_HUMAN A 0.9 

Hs.76288 calpain2 f (m/ll)largesubunit 1.3 

Hs.48712 hypotheflcal protein FU20736 1.8 

gb:yg93h09.s1 Soares infant brain 1N1B H 1 

Hs.1 03804 heterogeneous nuclear ribonucleoprotein 3.2 

Hs.336780 tubulin, beta polypeptide 1 

Hs.194766 seizure related gene 6 (mouse)-flke 0.9 

Hs.110044 ESTs 0.9 
Hs.187775 ESTS 1-2 
Hs.5167 Homo sapiens mRNA; cDNA DKFZp434F152 (fr 

Hs.3343 phosphoglycerats dehydrogenase 0.9 

gbxxj02h08.s1 NCI_CGAP_Lu5 Homo sapiens 

gb:ye20fi)5.s1 Stratagene lung (937210) H 3.2 

Hs.122730 ESTs,ModeratelysimilartoKlAA1215pro 5.3 

gb:zc78a07,s1 Pancreatic Islet Homo sapl 0.9 
Hs.143669 ESTs 1.1 
Hs.274426 TRF2-lnteractlng telomerlc RAP1 protein 11 
Hs.8518 selenoprotein N 1.1 
Hs.12396 ESTs, Weakly similar to 2004399A chromos 1 
Hs.84298 CD74 antigen {invariant polypeptide of m 1.2 
Hs.294030 topoisomerase-related function protein 4 1 
Hs.171957 triple functional domain (PTPRF Interact 1.3 
Hs.180224 myosin regulatory light chain 1.1 
Hs.64311 a disintegrin and metalloproteinase doma 1.4 
Hs.286013 short coiled-coil protein 2.4 
Hs.76550 Homo sapiens mRNA; cDNA DKFZp564B1264 (f 
Hs.177516 high density lipoprotein binding protein 1.9 
Hs.30376 hypothetical protein 1.4 
Hs.182885 ESTs, Weakly similar to JC5024 UDP-galac 2 
Hs.334762 hypothetical protein FLJ14735 1.3 
Hs.10248 Homo sapiens cDNA FU20167 fis, clone CO 1.3 
Hs.94830 ESTs, Moderately similar to T03094 A-kin 4.3 
Hs.150101 lysosomal 15 
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128453 X02761 


Hs.287820 fibronedin 1 


1.2 


4.3 


128460 T16206 


Hs.237164 ESTs, Highly similar to LDHH_HUMAN L-LAC 3.1 


44.4 


128491 H08379 


Hs.165563 hypothetical protein DKFZp434N1429 


0.6 


13.1 


128495 NNL005904 


Hs. 100602 MAD (mothers against decapentapleglc, Or 


1.3 


4 


128546 NM.003478 


Hs.101299 cuHin5 


1 


5.1 


128574 AI185977 


Hs.38260 ubiquitin specific protease 1 8 


0.8 


4 


128611 NMJJ14721 


Hs.102471 K1AA0680 gene product 


1.3 ' 


3.7 


128652 AA432202 


Hs.103147 hypothetical protein FU21347 


1.4 


3.9 


128653 D87432 


Hs.10315 solute carrierfarnfy 7 (calionlc amino 


\2 


3.6 


128655 AI246669 


Hs.324275 WW domain-containing protein 1 


0.8 


4.1 


.126684 BE246444 


Hs.283685 hypothetical protein FU20396 


3 


1.6 


128717 AK001564 


Hs.104222 hypothetical protein FU10702 


2.8 


4.8 


128774 AA476220 


Hs.54457 CD81 antigen (target of antiproSferaOv 


1.1 


10.6 


128790 AF026692 


Hs.105700 secreted frizzied-related protein 4 


1 


3.8 


128805 AA194554 


Hs.1 83434 ATPase. H+ transporting, lysosomal (vacu 


5.3 


5.3 


128827 AI638184 


Hs.1 06334 Homo sapiens done 23836 mRNA sequence 22 


5,3 


128840 AI917602 


Hs.106440 ESTs 


1 


4.5 


128869 AA768242 


Hs.80618 hypothetical protein 


0.8 


3.6 


128889 060985 


Hs.106909 OKFZP566D193 protein 


4.6 


3.7 


128890 AI222020 


Hs.1 82364 CocoaCrisp 


3 


1.5 


128915 AK000140 


Hs.1 07139 hypothetical protein 


0.2 


3.9 


128920 AA622037 


Hs.166468 programmed ceil death 5 


2.5 


15.2 


128926 AF155096 


Hs.107213 hypotheucal protein FLJ205B5 


4 


4 


128930 AA298958 


Hs.10724 MDS023 protein 


1.2 


4.5 


128942 AW247536 


Hs.10729 hypothetical protein 


1.4 


5 


128948 AW953622 


Hs.223025 RAB31, member RAS oncogene family 


2.3 


5.6 


128953 AB020716 


Hs.107362 KIAA0909 protein 


0.9 


3.9 


128979 AW271217 


Hs.281434 Homo sapiens cDNA FLJ14028 fis, clone HE 1.5 


3.6 


128980 AA258924 


Hs.10758 NM_002495*:Homo sapiens NADH dehydrogena 


0.8 


129005 AI770025 


Hs.13323 hypothetical protein FU22059 


1.2 


5.7 


129009 C15105 


Hs^30716 Homo sapiens cDNA FU14368 fis, clone HE 2.1 


9.9 


129013 AA371156 


Hs.107942 DKFZP564M112 protein 


24 


3.8 


129068 AI634522 


Hs.152925 K1AA1268 protein 


1.2 


3.8 


129106 AW504486 


Hs. 108689 sterol regulatory element binding transc 


12 


5.5 


129113 BE543205 


Hs.288771 OKFZP586A0522 protein 


0.5 


3.7 


129125 AB00245O 


Hs.278391 CG1-109 protein 


1 


5.2 


129126 AW881089 


Hs.108806 Homo sapiens mRNA; cDNA DKFZp566M0947 (f 


1.5 


129151 N23018 


tis.171391 C-terminal binding protein 2 


2.1 


9.7 


129230 AA335362 


Hs,1 09646 Empirically selected from AFFX single pr 


0.9 


8.6 


129234 M18916 


Hs.282997 glucosidase, beta; acid (Includes glucos 


1.1 


3.5 


129238 BE542214 


Hs.109697 ESTs 


1.1 


128 


129239 W57656 


Hs.109701 ubiquiMke 5 


3.2 


5.1 


129241 AI878857 


Hs.1 09706 hematological and neurological expressed 


1.9 


5.7 


129243 BE169531 


Hs.109727 TAK1-binding protein 2; KIAA0733 protein 


1.2 


6.6 


129247 R49920 


Hs.109733 CGM31 protein 


1.5 


3.5 


129250 AA344367 


Hs. 109760 Empirically selected from multiple AFFX 


1 


5.4 


129258 AA250970 


Hs.251946 poty(A)-binding protein, cytoplasmic 1-1 


1.3 


4.1 


129260 AF077200 


Hs.279813 hypothetical protein 


1.6 


3.9 


129270 AA357185 


Hs.109918 ras homotog gene family, member H 


1.8 


4.2 


129277 AB007896 


Hs.1 10 putative L-type neutral amino acid trans 


1.1 


6.1 


129284 AA318224 


Hs.296141 ESTs 


25 


4.8 


129300 W94197 


Hs.1 10165 ribosomal protein L26 homolog 


1.6 


5.1 


129318 AF189062 


Hs.285976 tumor metastasis-suppressor 


1.8 


6.5 


129352 AW511656 


Hs.1 70177 Metsl (mouse) homolog 


0.9 


4 


129362 U30246 


Hs.1 10736 solute earner family 12 (sodium/potassl 


1.4 


9.2 


129379 BE278964 


Hs.1 1085 CGI-1 11 protein 


1 


4.8 


129390 AA318271 


Hs.250905 hypothetical protein 


1 


4.1 


129416 AA016188 


Hs.111244 hypothetical protein 


1.8 


10.7 


129427 AI498631 


Hs.1 11334 ferritin, light polypeptide 


1.1 


4.8 


129470 W92931 


Hs.250899 heat shock factor binding protein 1 


1.8 


9.3 


129472 AL050260 


Hs.323817 OKF2P547E1010 protein 


1 


5 


129475 N^004477 


Hs.203772 FSHD region gene 1 


1.1 


4.2 


129498 AA449789 


Hs.7551 1 connective tissue growth factor 


1.9 


6.8 


129501 AI631811 


Hs.180403 STRIN protein 


1.1 


9.7 


129527 AA769221 


Hs.270847 delta-tubuHn 


1.1 


4.3 


129545 R18087 


Hs.323769 cisplatin resistance related protein CRR 


1 


4.2 


129579 AW517695 


Hs.286218 junctional adhesion molecule 1 


2.3 


3.5 



3.8 
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129606 
129619 
129620 
129621 
129634 
129663 
129679 
129688 
129691 
129712 
129747 
129788 
129796 
129797 
129800 
129834 
129836 
129843 
129874 
129878 
129904 
129917 
129976 
129982 
130007 
130060 
130064 
130068 
130090 
130095 
130102 
130112 
130115 
130123 
130150 
130161 
130210 
130213 
130215 
130232 
130252 
130281 
130343 
130385 
130414 
130417 
130440 
130442 
130465 
130479 
130499 
130546 
130568 
130606 
130612 
130616 
130623 
130629 
130632 
130639 
130641 
130653 
130655 
130666 
130669 
130693 
130694 



AW968941 

AA209534 

D79338 

AL1 10212 

AB020335 

AI207406 

AW889132 

U53209 

M26939 

U46386 

AU050272 

BE397454 

BE218319 

M62839 

AF052112 

AL0B0084 

AW410233 

NMJ14840 

AA626937 

Z43161 

AL1 19499 

M30773 

X14008 

Z14221 

R15917 

BE277024 

X57815.comp 

M93143 

H97878 

AK001635 

W61005 

AA916785 

T47294 

NM_0Q5095 

BE094848 

R42678 

M23115 

BE278370 

BE3018B3 

U29463 

U92014 

W78907 

AB040914 

AW067800 

AW842182 

AW163518 

AA852868 

NW.006245 

AW362955 

R44163 

AB007915 

AI598022 

AA232119 

A1652143 

BE242873 

AL049963 

AL045128 

AL042896 

AW073971 
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Homo sapiens mRNA; cONA DKFZp586H0324 (f 

phosphatidyEnositol glycan, class Q 1.1 

hypothetical protein FU12701 1 .2 

Homo sapiens cONA: FU2 1449 fis. clone C 1.4 

reticu!on3 1.2 

hypothetical protein FU22794 1 .8 

cyctinF 1.3 

HMT 1 (hnRNP methyltransferase, S. cerevl 3.2 

K1AA1329 protein 1 

hypothetical protean FU1 1 830 similar to 1.3 
OnaJ (Hsp40) homolog, subfamily A, membe 2.7 

transtocase of outer mitochondrial membr 1 .9 

UDP-N^(^yigiucosammepyrophosphorylas 1.9 
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chromosome 20open reading frame 3 1.9 
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GCN1 (general control of amino-acid synt 2. 1 

DKF2P564F0522 protein 1.1 

K1AA1273 protein \2 

p53-lnduced protein PIGPC1 4.5 

tumor-associated calcium signal transduc 3.4 

ESTs 0.8 

keratin 19 3.3 

ESTs, Moderately similar to 138022 hypot 0.6 

KIAA0997 protein 1.4 

amino acid transporter 2 1 .2 

transcriptional co-activator with PDZ-bi 0.7 

Homo sapiens cONA FU20738 lis, clone HE 2.1 
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WAA1494 protein 

UDP-gruoose dehydrogenase - - - - 
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Homo sapiens cONA FU14175 fis, clone NT 3.2 

hypothetical protein 1.3 

cytochrome c oxidase subunit Vila polype 1 .3 

secreted frtafed-related protein 2 10.6 

peroxiredoxin 5 1.1 

DKF2P586G1722 protein 1.6 

gMtsapiens VII-5 gene for lmmunogfobut 1 .1 
ESTs, Highly similar to IRX1_HUMAN IROQU 5.3 

K1AA0214 gene product 0.6 

deleted In oral cancer (mouse, homolog) 2.4 
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ubiquiSn specific protease 1 0.5 
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131947 AI123939 Hs.182997 ESTs 0.7 

131961 AA129782 Hs.3576 Homo sapiens mRNA full length Insert cON 0.9 

131964 AW381148 Hs.198365 2,3-bisphosphogiycerate mutase 1.1 

131974 AF208856 Hs.268122 hypothetical protein 1.3 

131983 AF119665 Hs.184011 pyrophosphatase (inorganic) 3.3 

131997 AF229181 Hs.136644 CS box-containing WO protein 0.9 

132006 AW162336 Hs.3709 bw molecular mass ubiquinone-binding or 1.2 

132063 BE277910 Hs.3833 3 , i)hosphoadenosine5 , -phosphosulfalesy 3.2 

132055 BE379335 Hs.211594 proteasome{prosome,macropain)26Ssubu A2 

132071 AF217798 Hs.3850 US 1 -interacting protein NUDEL; endoolig 0.7 

132079 AI701457 Hs.38694 ESTs 2 

132094 NMJ>16045 Hs.3945 CGI-107 protein 1.2 

132116 AW960474 Hs.40289 ESTs 3.1 

132164 AI752235 Hs.41270 procollagen-lysine, 2*xoglutarate 5-dio 1.8 

132181 AW961231 Hs.16773 Homo sapiens clone TCCCIA00427 mRNA sequ 

132208 AL031709 Hs.241575 hypothetical protein CAB561 84 1.4 

132258 AA306325 Hs.4311 SUMO-1 activating enzyme subunit 2 2 

132303 BE177330 Hs.325093 Homo sapiens cDNA: FU21210 fis, done C 1.2 

132316 U28831 Hs.44566 KIAA1641 protein 5.9 

132358 NWLQ03542 Hs.46423 H4 histone family, member G 5.8 

132384 AA312135 Hs.46967 HSPC034 protein 2.1 

132397 AA021160 Hs.4750 hypothetical protein DKFZp564K0822 1.3 

132413 AW361383 Hs.260116 metaltopratease 1 (pitrilysin family) 2 

132442 AW970859 Hs.313503 ESTs 1.2 

132534 BE388673 Hs.5086 hypothetical protein MGC1 0433 2 

132540 BE396290 Hs.5097 synaptogyrin2 1.4 

132554 AF065391 Hs.194718 zinc finger protein 265 1.2 

132575 AV660538 Hs.284162 60S ribosomat protein L30 isoiog 3 

132585 AF029750 Hs.179600 TAP binding protein (tapasin) 1.8 

132602 AW606927 Hs.5306 hypothetical protein DKF2p586F1 122 simil 1.6 

132608 AA353044 Hs.5321 ARP3 (actirwelated protein 3, yeast) ho 1.8 

132718 NM.004600 Hs.554 Sjogren syndrome antigen A2 (60kD, ribon 4.2 

132719 AI264357 Hs.55405 hypothetical protein MGC 162 12 1.1 
132730 AK000868 Hs.5570 hypofiietical protein FU 10006 1.4 
132765 BE222975 Hs.56205 insulin induced gene 1 1.1 
132782 F07424 Hs.279840 zinc finger protein 222 13 
132793 AB020713 Hs.56966 KIAA0906 protein 2.3 
132805 AW975748 Hs.5724 sclerostin 0.7 
132863 BE268048 Hs.236494 RAB10, member RAS oncogene family 1.8 
132894 D63209 Hs.5944 solute carrier family 11 (proton-coupled 1.5 
132930 AA579258 Hs.6083 Homo sapiens cDNA: FU21028 fis, clone C 1 

132932 AW1 18826 Hs.6093 Homo sapiens cDNA: FU22783 fis, clone K 0.7 

132933 BE263252 Hs.6101 hypothetical protein MGC3178 1.6 
132965 AI248173 Hs.191460 hypothetical protein MGC12936 1 
132984 BE539199 Hs.62112 zinc finger protein 207 1.5 
132990 X77343 Hs.334334 transcription factor AP-2 alpha {activat 13.9 
132998 Y00062 Hs.170121 protein tyrosine phosphatase, receptor t 0.6 
133002 AW499985 Hs.42915 ARP2 (actin-related protein 2, yeast) ho 1.5 

133011 NMJ)06379 Hs.171921 serra domain, immunogtobulin domain (lg), 3.5 

133012 AA847843 Hs.62711 Homo sapiens, clone IMAGE:3351295, mRNA 1 
133040 AW502761 Hs.30909 KIAA0430 gene product 0.9 
133056 H12028 Hs.6396 jumping translocation breakpoint 1.7 
133063 AI654133 Hs.30212 thyroid receptor interacting protein 15 0.6 
133067 AK000708 Hs.169764 hypothetical protein FU20701 1.2 
133080 AF089816 Hs.6454 chromosome 19 open reading frame 3 1.2 
133110 AA808177 Hs.65228 ESTs 0.9 
133150 AV655783 Hs.661 Empirically selected from AFFX single pr 1.1 
133175 AW955632 Hs.66666 ESTs, Weakly similar to S 1 9560 proline-r 1.5 
133199 AF231981 Hs.250175 homotog of yeast long chain polyunsatura 5.5 
133203 AA464362 Hs.6748 hypothetical protein PP 1665 1-2 
133206 AB037773 Hs.6762 hypothetical protein 1.6 
133221 W32474 Hs.301746 RAP2A, member of RAS oncogene family 2.4 
133229 AL137480 Hs.6834 KIAA1014 protein 1 
133241 AW796524 Hs.68644 Homo sapiens microsomal signal peptidase 1.3 
133257 8E617892 Hs,6895 actin related protein 2/3 complex, subun 1.4 
133271 Z48633 Hs.283742 risapiensrnRNAforretrotransposon 3.1 
133273 N27672 Hs.69469 dendritic cell protein 2.5 
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B-factor. properdin 1.3 4 

NRAS-reiatedgene 1.4 5 

peptidylprolyl isomerase 8 (cydophOin 2.2 6.8 

zinc finger protein 238 1.5 4.3 

hypothetical protein MGC10753 1.4 6.3 

apoGpoprotein AA\ 02 3.6 

H factor 1 (complement) 0.6 5 

acid duster protein 33 1.2 42 

DKFZP564I1922 protein 3.7 5.8 

KIAA0447 gene product 1.4 5.1 

hexokinase2 0.9 6.3 

KIAA1235 protein 1.2 3.7 

Homo sapiens mRNA; cDNA DKFZp434G227 (fr 0.7 4.8 

immunoglobulin lambda-like polypeptide 1 1.1 4.3 

voltage-dependent anion channel 3 0.7 4.2 

hypothetical protein MGC2714 3.1 5.9 

desmoptakjn(OPI l DPII) 4.3 11.5 

hypothetical protein MGC14353 1.8 19.7 

K1AA0263 gene product 1.2 5.4 

Empirically selected from AFFX single pr 1 .4 3.9 

synapsin I 0.8 4.9 

RNA-binding protein S1 , serine-rich doma 2 1 0.8 

protease, serine, 1 1 (IGF binding) 2. 1 4.5 

weel + (S. pombe) homolog 3.3 1 .1 

ADP-ribosylation factor-like 6 interact) 2.3 5.6 

dihydropyrimldinase-like 2 0.8 13.5 

popeye protein 3 1 9.1 

quinokJ dihydropteridine reductase 0.5 5.8 

mitogen-activated protein kinase 6 1.1 6.9 

zinc Anger protein 146 1.8 3.8 

tyrosine 3-monooxygenase/tryptophan 5-mo 1.5 11.1 

heterogeneous nuclear ribonucleoprotein 2 3.9 

synaptophysin 0.6 3.5 

KIAA0174 gene product 1.2 7.2 

CREBBP/EP300 Inhibitory protein 1 1.5 5 

zyxin 1.2 4.8 

cadherin 1 1, type 2, OB-cadherin (osteob 3.2 4.1 

ADP-ribosyltransferase(NAD+;poly(ADP- 2.1 3.8 

OKFZP564B167 protein 1.9 12.6 

thymosin, beta 10 2.6 6.6 

G protein-coupled receptor kinase 6 1 4.9 

step II splicing factor SLU7 0.5 3.8 

H1 histone family, member 2 1 .5 4.5 

Homo sapiens mRNA; cDNA OKFZp564B1264 (f 3.7 5.6 

carboxyiesterase 1 (nranocyte/macrophage 0.3 4.4 

ESTs 5.5 2.9 

synuclein, alpha (nonA4 component of am 0.6 4.8 

protein kinase, cAMP-dependent catafyti 1 10.2 

arginyitransferase 1 0.9 4.8 

calumenin 2.8 10.5 

ESTs 1.8 5.6 

proteasome(prosome l macropain)26Ssubu 1.5 6.6 

nuclear factor of activated T-ceHs. cyt 1 .5 3.8 

v-Kkas2 Wrsten rat sarcoma 2 viral cm 0.9 4.3 

von HippeJ-Undau syndrome 2.3 4.3 

SWI/SNF related, matrix associated, acti 3.3 3.4 

Homo sapiens mRNA; cDNA DKFZp564C1216 (f 1.3 5.7 

hypothetical protein hCLA-lso 1 6.5 

ubiquitin specific protease 7 (herpes vi 1.7 3.6 

lysosomal 1 7.5 

matrin3 1.2 4 

cycfinG2 2.7 4.8 

mitochondrial ribosomai protein L3 3.3 2.1 

K1AA0009 gene product 1-3 3.5 

programmed cell death 6 1.7 6.9 

plm-2 oncogene 0.8 5.3 

1.4 11.4 

186 



# 



WO 02/059377 



PCT/US02/02242 



34277 NWL004369 Hs.80988 collagen, type VI, alpha 3 

134280 NM_000712 Hs.81029 bilwerdin reductase A 

I34288 A1022650 Hs.8117 erbb2-lnterading protein ERBIN 

134296 R00603 Hs.8128 phosphatidylserine decarboxylase 

I34300 NNUW1430 Hs.8136 endothelial PAS domain protein 1 

134310 AL037800 Hs.8148 selenoprotein T 

I34343 D50683 Hs.82028 transforming growth factor, beta recepto 

I34364 X76534 Hs.82226 glycoprotein (transmembrane) nmb 

134374 N22687 Hs.8236 ESTs 

134378 AL035786 Hs.82425 actin related protein 2/3 complex, subun 

134382 BE512856 Hs.109051 SH3 domain binding glutamic ackJ-rich pr 

34415 AI750762 Hs.82911 protein tyrosine phosphatase type IVA, m 

134417 NM 006416 Hs.82921 solute carrier family 35 (CMP-siaGc aci 

134421 AU077196 Hs.82985 collagen, type V, alpha 2 

I34439 Z23024 Hs.138860 Rho GTPase activating protein 1 

I34454 NM.013230 Hs.286124 C024 antigen (small cell lung carcinoma 

I34494 D86981 Hs.84084 amyloid beta precursor protein (cytoplas 

134501 W84869 Hs.211568 eukaryotic translation initiation factor 

134505 AW960673 Hs.177530 ATP synthase, H+ transporting, mitochond 

134520 BE091005 Hs.74861 activated RNA polymerase II transcriptio 

134528 M23161 Hs.84775 Human transposon-Bke element mRNA 

I34545 AI902899 Hs.85155 butyrate response factor 1 (EGF-response 

I34553 AI203545 Hs.296169 S-phase response {cydin-related) 

I34573 NMJJ16142 Hs.279617 steroid dehydrogenase homolog 

I34576 AB033017 Hs,8594 K1AA1 191 protein 

34577 BE244323 Hs.8595t exportin, tRNA (nuclear export receptor 

I34579 AW936928 Hs.85963 DKFZP564M1 82 protein 

134582 AA927177 Hs.86041 CGG triplet repeat binding protein 1 

134600 AF078859 Hs.86347 hypometical protein 

134655 AF265208 Hs.123090 SWI/SNF related, matrix associated, acfi 

I34700 AK000606 Hs,8868 golgi SNAP receptor complax member 1 

I34737 D17530 Hs.89434 drebrinl 

134762 T51986 Hs.283108 hemogtobin, gamma G 

I34843 AA428520 Hs.90061 progesterone binding protein 

I34854 J03464 Hs.179573 coHagen. type I, alpha 2 

I34865 AA587775 Hs.66295 mulli-POZ-domaln-containing protein 

I34868 AB020689 Hs.9Q419 WAA0882 protein 

I34874 A1803761 Hs.90458 serine palmitoyltransferase, long chain 

1 34885 AJ002030 Hs.9071 progesterone membrane binding protein 

134891 R51083 Hs.90787 ESTs 

1 34908 8E089782 Hs.9877 hypothetical protein 

I34934 AF005043 Hs.91390 poly (ADP-ribose) glycohydrolase 

[34970 BE560779 Hs.284233 NICE-5 protein 

134982 AK002085 Hs.92308 Homo sapiens cDNA FU1 1223 fls, clone P 

135011 AB037835 Hs.92991 KIAA141 4 protein 

I35032 AW301984 Hs.173685 hypothetical protein FU12619 

I35035 AL034344 Hs.284186 forkhead box CI 

135051 AI272141 Hs.83484 SRY {sex determining region Y)-box 4 

I35060 AK001887 Hs.259842 protein kinase, AMP-activated, gamma 2 n 

I35062 AK000967 Hs.93872 KIAA1682 protein 

I35077 AW503733 Hs.9414 KIAA1 488 protein 

135082 AB017363 Hs.94234 frizzled (Drosophila) homolog 1 

135107 T97257 Hs.337531 ESTs, Moderately similar to 138022 hypot 

135143 AA132813 Hs.69559 KIAA1 096 protein 

135156 BE563088 Hs.9552 binder of Arl Two 

135172 AB028956 Hs.12144 KIAA1033 protein 

135181 BE250865 Hs.279529 px19-like protein 

135222 AA534009 Hs.183487 interferon stimulated gene (20kD) 

I35232 AL038812 Hs.96800 ESTs, Moderately similar to ALU7_HUMAN 

135289 AW372569 Hs.9788 hypothetical protein MGC10924 similar to 

135290 AA331901 Hs.184736 hypothetical protein FU 10097 

135291 T83882 Hs.97927 ESTs 
I35349 AA114212 Hs.9930 serine (or cysteine) proteinase Inhibito 
135357 AI565004 Hs.79572 cathepsln O (lysosomal aspartyl protease 
I35398 M16029 Hs.287270 ret proto-oncogene (multiple endocrine n 
J35399 W79431 Hs.326249 ribosomal protein L22 
I35400 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 
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15 
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302665 R99693 


Hs.224410 


Homo sapiens cONA FU12843 Gs, done NT 3.6 


3.6 


302892 AW1 76909 


Hs.42346 


caldneurei -binding protein caisarcin-1 


3.3 


1.6 


302963 AW673106 


Hs.151945 


mitochondrial ribosomal protein L43 


0.9 


4.2 


303131 AW081061 


Hs.103180 


DC2 protein 


3 


17.3 


303150 M887146 


Hs.8217 


stromal antigen 2 


6.2 


4 


310125 AA147979 


Hs.285005 


mitochondria! import receptor Tom22 


1.2 


6.6 


312662 AA233808 


Hs.286241 


protein kinase, cAMP-dependent, regulate 


1 


3.5 


319429 BE616412 


Hs.286218 


junctional adhesion molecule 1 


1.5 


4.7 


320591 AA054761 


Hs.169149 


karyopherin alpha 1 (importin alpha 5) 


\2 


^ 5.6 


406779 AA412048 


Hs.279574 


CGI-39 protein: cell death-regulatory pr 


1.3 


3.5 


410691 AW239226 


Hs.65450 


reticuton4 


1.2 


13.9 


410763 AF279145 


Hs.8966 


hypothetical protein RJ21776 


2 


5.1 


415738 BE539367 


Hs.295953 


ESTs, Weakly similar to AF220O49 1 uncha 


1.3 


3.9 


420186 NM 015925 


Hs.95697 


Over-specific bHLH-Zip transcription fa 


1.5 


6.2 


422055 NM_014320 


Hs.1 11029 


putative heme-binding protein 


2 


11.3 


425815 R94023 


Hs.337531 


ESTs. Moderately similar to 138022 hypot 


1.7 


3.6 


425218 AF1 19043 


Hs.168005 


Homo sapiens cDNA FU13372 fis, clone PL 3.3 


2.8 


427397 AI929685 


Hs 177656 

1 Iw. Iff \j\J\J 


calmodufin 1 (phosphorylase kinase, delt 


1.3 


4.7 


427466 AA523543 


Hs.7678 


cellular refinoic acid-binding protein 1 


1.1 


3.7 


427505 AA361562 


Hs.1 78761 


26S proteasome-assodated pad1 homotog 


3.2 


2.5 


427723 AI355260 


Hs!279789 


his tone deacetylase 3 


2.8 


22 


428673 AW601325 


Hs.324278 


Homo sapiens mRNA; cDNA OKFZp566M063 (fr 


1.1 


430219 X99209 


Hs.235887 


HMT1 (hnRNP methyltransferase, S. cerevi 


1.8 


8.8 


430450 R23553 


Hs.241489 


hypothetical protein 


1.1 


5.6 


432866 BE395875 


Hs.279609 


mitochondrial carrier homolog 2 


1.5 


6.1 


433423 BE407127 


Hs.8997 


heat shock 70kD protein 1 A 


1.3 


7.6 


437562 AB001636 


Hs.5683 


DEAD/H (Asp-Olu-Ala-Asp/His) box polypep 


1.6 


6.5 


437667 BE616412 


Hs.286218 


junctional adhesion molecule 1 


1.3 


3.5 


437754 R60366 


Hs.5822 


Homo sapiens cDNA: FU22120 lis, done H 


2 


5.7 


440252 BE513940 


Hs.6101 


hypothetical protein MGC3178 


1.1 


6.2 


441471 ALD42986 


Hs.7857 


erythrocyte membrane protein band 


0.5 


3.7 


448292 BE281316 


Hs.47334 


hypothetical protein FU14495 


2.5 


4.9 


449404 H51066 


Hs.23581 


leptin receptor gene-related protein 


1.1 


3.6 


449964 AW001741 


Hs.273193 


hypothetical protein FLJ10706 


1.4 


3.5 


451389 N73222 


Hs.279009 


matrix Gla protein 


4 


11.2 


452685 AI634651 


Hs.30250 


v-maf musculoaponeurotic fibrosarcoma (a 


0.8 


5.6 


RC_H15847j> 




peptidylprolyl bomerase B (cydophiQn B) 


1.8 


4.8 


RCJV84712 




calumenln 


3.5 


4.6 


X14008_ma1J 


lysozyme (renal amyloidosis) 


0.9 


4.5 


RC..H86543J 




ESTs 


1.8 


6.6 


H07011 




ESTs; Weakly similar to SAS [H.sapiensl 


1.8 


3.9 


R<LAA164586_s 




ESTs 


6.2 


RC.AA070485 


Homo sapiens done 23967 


3.4 


2.6 


RC_H98714j> 




ESTs 


1.6 


3.5 


RCJVA406145J 




ESTs 


4.6 


AA458584 




SRY (sex determining region Y>box 4 


3.4 


0.4 


AA031548 




cell division cycle 42 (GTP-blnding protein; 25kD) 


3.1 


X02761 




fibronectin 1 


3.6 


15.2 


RC.AA487193 


secreted frtezled-related protein 4 


4.7 


4 


R25326 




Homo sapiens mRNA for putative vacuolar 


0.9 


5 


RCJVA393805 


ESTs; Weakly similar to (defline not 


1.1 


8.4 


RC.M449333 


ESTs 


2.9 


4.6 


RC_M287681_S 




ESTs 


1.3 


RC.AA490864 


ESTs; Highly similar to heat shock factor 


1.4 


5 


RC.C14243J 




ESTs; Highly similar to heat shock factor 


1.7 


5 


R21443 




ESTs 


1.6 


3.7 


RCJVA251902 


Homo sapiens lysophosphofipase (LPL1) 


2.2 


3.8 


M21121.S 




smaD inducible cytokine A5 (RANTES) 


0.9 


9.9 


C00038.S 




ESTs 


2.8 


4.8 


Y00503 




keratin 19 


3.1 


1.1 


RC_R27006J 




ESTs 


1.6 


3.7 


RCLAA416886 


EST s; Weakly simflar to predicted using 


3.1 


3.1 


RCJW60450 


fibroblast growth factor receptor 2 (bacteria- 


1.5 


3.7 


RC_AA488433 


ESTs; Weakly simflar to deduced amino acid 1.1 


4 


RCLAA27840GJ 




Homo sapiens Hi 


U28831 




Human protein Immune-reactive with anti-PTH4.4 


0.6 


RC.AA199588 


Homo sapiens actin-related protein Arp3 (ARP3) 


1.8 



5.2 



0.8 



3.9 



1.5 



3.6 



4.7 
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AF006082 


Homo sapiens actirwelated protein Arp2 (ARP2) 


1.6 


10.9 


RC_H90899 


desmopfakln(DPI;DPII) 5.4 


5.5 




RCJV95070 


desmopbkin(DPI;DPI!) 5 


2.6 


3.9 


RCJ90946J 


Human mRNA for KIAA263 gene; complete cds 


1.1 


D87258 


protease; serine; 1 1 (IGF binding) 2.4 


3.5 


5.3 


AA313414_S 


ESTs; Weakly similar to cDNA EST EMBUT1157 


1.5 


RC_H73484_j> 


ESTs; Weakly similar to similar to Yeast 1.3 


6.3 


13.5 


AFFX-HUMISGF3A/M97935J3 


2.3 


AFFX-HUMRGE/M10098..5 1.1 


7.9 


7.4 


AFFX-M2783Q_5 




0.5 


AFFX-M27830.5 




0.6 


' 5.4 


RC.AA063431J 


ESTs 


0.8 


4.1 


RC T63769J 


ferritin; light polypeptide 1.1 


3.7 
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TABLE 8A 

Table 8A shows the accession numbers for those pkeys lacking unigenelD's for Table 8. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

108469 116761J AA079487 M1 28547 AA128291 AA079587 AA079600 
125076 190299J AA973971 T88817 AA253263 

114636 109698 1 AA075488 M1 29081 AA074851 AA082852AA074732AA084908AA084751 AA076042 AA131172 AA085374 AA079519 
AA074510 AA1 13824 AA102437 AA070833 AA070143 AA084693 AA084389 AA076373 AA075492 AA062834 AA084335 
AA078829 AA079344 AA069916 AA079275 AA070914 AA654069 AA081976 AA080957 AA0831 15 AA070942 AA085296 

123526 genbanlLAA608657 AA608657 

123533 genbanR_AA608751 AA608751 

125090 genbanKJ91518 T91518 

125154 genbank_W38419 W38419 

118475 genbankJJ66845 N66845 

118505 genbank_N67343 N67343 

101046 entre^K01160K01160 

129982 221J67 Z14221 AW381862 M97920 AW401444 Z66542 M29470 AW406502 X61011 M34024 AA327072 214166 214167 214165 

AW403806 Z14200 AA383972 214205 Z14201 M18513 Z14202 AW403684 X14584 AF062221 U43760 X65892 X65883 X62107 
Z80847 X65885 X65893 AF062142 X65891 X17675 Z47274 247277 247276 X65888 247275 X62109 AF062140 L01278 
AF062134 AF062139 X81723 Z80840 X81733 X81743 X81744 X81732 Z80843 AW402942 AW403516 X65919 AF062190 
AF062177 AF062222 AF0621 15 Z47240 AF062263 AF062261 AF062223 AF06221 1 247238 AW401714 AW404008 AW404991 
AF062280 M99595 247214 Z47232 247218 M26995 AF062184 X65895 L38433 X81731 211946 247226 AF062205 AF174012 
L01276 AF062168 AF062136 X81755 XB1748 AF174019 AF062285 L06924 L1 1699 AW402665 L09070 L28049 L08090 
AW407843 Z14171 AW402944 L08083 AW405627 133035 L26907 M17750 L28052 M17751 AJ239360 U19885 L14821 X56526 
AB014341 L12087 L12098 U68231 L12184 AF062242 AF022000 U64499 U0G570 A1268604 Y15773 X64239 X62969 U00506 
X73605 X99360 U00577 D83677 AB021539 AF035796 Z33899 U00588 AJ239353 AF174062 Z33901 X98B99 AF174058 X63080 
D83676 AB021529 218318 U00488 L01412 X81746 U21262 U21272 U00560 AF174060 U00547 U00561 Z18321 S73957 
M26435 AF115130 AF1 15117 Z92896 U21254 292895 AF115112 AF062290 T28938 Y09386 AF174067 U27189 Y09384 
U77373 AF174057 M17749 X69692 AF174038 U64478 U64486 AF174098 X99365 AF174077 AF174083 AF174089 AF015133 
AF127792 AJ006171 U00510 U58144 AF004323 AF115109 AFO04324 AF015130 AF090414 AF090418 AB021536 AB021519 
Z96957 AF021986 292898 296956 L34164 AF062251 AF052524 U00549 U00541 D83687 AF052522 AF087420 AF052520 
AF02201 1 L43085 U00536 L19915 L43083 AF090420 AB021530 AB021534 296955 Z96954 233903 X62964 AF103282 U00509 
AF062298 AF062289 AF062307 AW408326 L33037 L04337 L04328 X81742 L04336 L03385 AF032360 S56184 AF062191 
AF191092 AI906954 AF103184 L04343 AW364860 AF001424 AF103163 298717 AF103143 AJ008250 L04323 AF103321 
AF103335 L04342 L03818 L03817 AW404978 AW4Q3690 U86801 AF103150 AJ010334 AF035027 AJ007327 AF1031 15 
AF017458 AJ008207 AJ008183 AJ008196 AJ008241 AJ008208 AF103210 AF068668 AF058670 AF068671 AF068664 
AF068669 AF068666 AF068665 AF068672 AW371244 AW403670 AW408074 AW404575 AW3621 53 AW403803 AW406702 
AW351514D78345 T2914OJ00231 NWL002179AW405146AA301091 X04646 H64660 AW402990 AW406534 T93007 
AI857980 AW368899 AI905833 AW406586 AA482084 AI872299 M715266 AW404328 A1831674 AI709348 AA6031 12 
AW514864 AA485775 H64492 AW404789 AA487630 M715498 AA295885 T27613 T98113 

108470 genbankJ\A079500 AA079500 
101447 entrei_M21305 M21305 
124447 genbanK_N48000 N48000 
101624 entre?_M55998 M55998 

131791 221 260 X62111 S67984 AJ131056 247243 247235 AF062268 Z47237 AJ131058 AF062130 AF062282 X62108 AA385989 AA464794 
X69861 AW402964 M90808 298735 298734 298736 AF035018 X79161 U00545 AF1 74046 AF 174071 U0O552 U96288 
AF021989 AF062255 AF174061 S73953 AF062135 AF062155 X64147 U00555 AF174070 AF129754 U86789 298714 298738 
S75886 AJ008175 AJ008186 AJ008188 Y08403 H15753 H22208 AA327241 AW405737 H42300 H39677 H26239 H26903 
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H45128 R86072 AA327565 AI660584 AW361537 AW383759 S71043 H15014 H45570 H42819 H45523 H45134 R72043 
H24543 H27636 H27610 T28147 H25496 AW364071 AW364072 H45561 H45556 H42605 AI735017 T47421 R48719 H27570 
H44599 AI459598 H42347 H41938 H24993 AA345888 H22339 AI538691 AJ012264 AA664201 A1880450 AA327310 AI991250 
AI833028 AW001210 AI956075 H30467 AA326915 H41943 AI749266 A1744441 AA327377 AW512326 AI735170 H01634 
5 AI587047 AI571623 AA327486 M327103 AA327195 AA326973 T28143 

124842 217726J R56485 R37248 R59992 

103758 AA084874_Lat AA084874J 

130064 221 J64 X57815 L29157 AA367448 AW328098 AW404536 L29156 L29163 H27683 R83195 AA295096 AW327822 H25458 AW404692 
X57819 X57823 AW405604 AW404447Z34914 AW406542 AA427726 AA604389 AW405606 AW405918 AW4051 17 

1 0 AW407182 L03632 AW405058 L03627 AW407470 R72738 L21959 AW375738 X87888 AF124166 AF054638 AJ241406 

U09902 AW405284 M12025 AA360219 L03631 L03629 AW327727 AF194810 AF058076 AF194886 AF063774 AR363737 
AF063755 AF063740 AF063739 AF063738 S73129 U3088 AF063775 L33026 AF060138 AF194813 AF194812 AF19481 1 
AF194809 AF194814 AF194808 AF064505 AF064503 AF063765 AF063757 AF063752 AF063716 AF063736 AF194806 
AF058077 AF063747 AF063772 AF063781 AF060137 AF194805 AF060134 AF060132 AF058074 AF063754 AF063704 

1 5 U38589 218332 AF060122 AF194807 AF060135 AF064506 AF084504 AF063773 L26540 AF058072 AF060131 AF0645O0 

AF060136 AF064513 AF194683 AF194711 AF058075 AF063717 Z19546 AF194581 X72746 U96393 U09890 X98897 
AF194592 M80916AJ241405AF194632AJ388659 AF194625 Z74694AF1 94588 AF194601 U09901 U09911 U09892 
AJ010336 AJ006162 AJ249377 AJ241414 AA327392 U97248 X72747 Z46850 X95739 AW406701 AJ243109 AF194609 
S80758 AF194595 AF194596 L22483 Z70262 084141 AW405758 AF001788 AF194580 D84143 U76684 AF194593 L03630 

20 X87892 X91 134 U21249 Z46346 AJ132426 AF103659 AJ233718 AF021038 AJ233727 H24657 U09882 S75627 AA573599 

AF047231 AF047232 AF103616 AW404484 Z46848 S76132 AF103663 AF103713 Y17940 AF047216 AF103595 Y17956 
AF052799 AF052797 AF052802 AF052798 AF052801 AF052794 AF052796 AF047218 AF052800 AF047217 AF052795 
AF032351 AF103701 AF103708 AF103710 AF103706 S62232 AF103645 AF103632 AF103647 AF103644 AF103640 
AF103692 AF093581 AF103620 AW405934 AM45389 AW383753 AA360256 AF<m676 H21654 H39501 A1820828 H53689 

25 W26785 AW384496 AW407708 AA541663 AA91 1602 A1821461 AA588300 AA327050 H42717 AI951280 AA421322 A1923193 

AA864302 H25133 D87023 J0301 1 M61771 D87017 AA526865 AA253450 
130232 18831J U29463 U06715 W24970 AA584303 T39581 AA155603 AA305043 AA429426 W05664 AA102382 M482044 W24487 AA319060 
T88946 F10106 M232161 AA243117 AA158937 AA100864 
109097 genbanRJWI67512 AA167512 
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TABLE 9: Figure 9 from BRCA 001-2 US 

5 Table 9 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

1 0 ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 



15 Pkey ExAccn UnigenelD UnlgeneTitle 



20 



25 



30 



35 



100690 
102211 
103587 
104115 
105038 
105500 
105990 
106155 
106373 
106414 
110009 
111900 
114540 
116470 
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TABLE 10: Figure 10 from BRCA 001-3 PCT 

5 Table 10 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

1 0 ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 ; Ratio of tumor to normal body tissue 

R2: Ratio of SO" percentile tumor to body 

15 R3: Ratio of 75* percentile body to tumor 

R4: Ratio of tumor to normal breast tissue 
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25 100154 H60720 Hs.81892 
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30 100219 AW972300 Hs.118110 
100234 D29677 Hs.3085 
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1.4 


1296 


394 


2.2 


240 


64 


3.2 


101 


13 


7.9 


33 


1 


2.6 
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15 



20 



25 



135117 W52493 


Hs.94694 


Homo sapiens cDNA FLJ 10561 fe, done NT 


5.3 


DO 


4 
1 


A i 




135144 NM_016255Hs.95260 


Autosomal Highly Conserved Protein 


7.4 


74 


5 


2.4 




135154 AK001B35 


Hs.267812 


sorting nexin 4 


6.6 


69 


11 


6.3 




135155 AI207958 


Hs.166556 


Homo sapiens, Similar to TEA domain rami 


6.1 


61 


1 


5.1 




135172 AB028956 


Hs. 12144 


KIAA1033 protein 


3.4 


88 


26 


1.4 




135242 AI583187 


Hs.9700 


cydinEI 


3.1 


31 


1 


2.3 




135243 BE463721 


Hs.97101 


putative G protein-coupled receptor 


3.4 


169 


50 


9.1 




135269 NNL003403HS.97496 


YY1 transcription factor 


3.4 


m □ 








135356 BE312948 


Hs.18104 


hypothetical protein FU11274 


3.1 


31 


10 


1.7 




135357 AI565004 


Hs79572 


calhepsin 0 (lysosomal aspartyl protease 


4.7 


710 


151 


25 




135389 U05237 


Hs.99872 


fetal Alzheimer antigen 


20.6 


206 


4 


19.1 




135397 L14922 


Hs.166563 


replication factor C (activator 1) 1 (14 


3.2 


32 


1 


2.4 




-135400 X78592 


Hs.99915 


androgen receptor (dihydrotestosterone r 


3.2 


117 


37 


9.4 




AI471525 


Hs.247486 


ESTs 


3.8 


58 


16 


5.5 




X70683 


Hs.93668 


ESTs 


1.8 


1047 


596 


1.6 




L14922 


Hs.82128 


5T4 oncofetal trophoblast glycoprotein 


5 


285 


58 


1.2 




M23263 


Hs.904 


amylo-1;6-glucosidase; 4-alpha-gtucanotransferas 


3.1 


31 


1 


2.6 


AI267866 


Hs.148027 


polymerase (RNA) II (ONA directed) polypeptide B 


7.8 


137 


18 


11.9 


AA044640 


Hs.241676 


stromal cell-derived factor 1 


4.7 


114 


25 


0.9 




N90960 


Hs.227459 


ESTs; Moderately similar to HII ALU SUBFAMILY 


4.7 


151 


32 


9.3 


AA8732B5 Hs.137947 


ESTs 


4.7 


47 


3 


4.4 




T56679 


Hs.865 


RAP1A; member of RAS oncogene family 


4 


40 


1 


3.4 




AA305536 




"EST1 76522 Colon carcinoma (Caco-2) cell fine II 


3.6 


121 


34 


11.8 


AI369384 




arylsuffatase 0 


3.5 


113 


33 


1.7 




AA219081 


Hs.242396 


ESTs; Moderately similar to l!!l ALU SUBFAMILY] 


3.4 


107 


32 


9.9 
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TABLE 10A 

Table 10 A shows the accession numbers for those pkeys lacking unigenelD's for Table 10. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 


CAT number 


Accession 


123619 


371681J 


AA602964AA609200 


104602 


524482J 


H47610R86920 


121581 


283769J 


AA416568 AA442889 AA417233 AA442223 


123523 


genbankJ\A608588 AA608588 


100821 


tigr_HT4306 


M26460UQ9116 


125091 


genbanKJT91518 


T91518 


125150 


NOT_FOUND_entre^W38240 W38240 


118475 


genbanK_N66845 


N66845 


104787 


genbank_AA027317 


AA027317 


106055 


genbankJA417034 


AA417034 


113702 


genbankJ37307 


T97307 


101046 


entre?_K01160 


K01160 


101447 


entrezLM21305 


M21305 


101624 


entre^M55998 


M55998 


124677 


genbank_RQ1073 


R01073 


110581 


genbanm61560 


H61560 


119023 


genbankJ498488 


N98488 


110775 


genbanUl22414 


N22414 


112092 


genbankj*44538 


R44538 


112253 


genbank_R51818 


R51818 


107014 


genbank_AA598820 


AA598820 


114988 


genbanKJ^SIOag 


AA251089 
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TABLE 11: Figure 11 from BRCA 001-3 PCT 

5 Table 11 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

1 0 ExAccn: Exemplar Accession number. Genbank accession number 

UnigenelD: Unigene number 

Unigene Titie: Unigene gene title 

R 1 ; Ratio of tumor to norma) body tissue 

R2: Ratio of 90* percentile tumor to normal body 

1 5 R3: Ratio of 75* percentile normal body to tumor 

R4: Ratio of tumor to normal breast tissue 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Pkey ExAccn UnigenelD Unigene Title 


R1 


R2 


R3 


R4 


100131 012485 Hs. 11951 ectonudeotidepyrophosphatase/phosphodi 


13.2 


244 


19 


9.9 


100147 D13666 Hs. 136348 osteoblast specific factor 2 (fasciclin 


15.7 


1030 66 


5 


100522 X51501 Hs.99949 proiactin-induced protein 


22J 


760 


34 


1.4 


100666 L05424 Hs.169610 CD44 antigen (homing function and Indian 


8.5 


85 


1 


3.2 


101104 AW862258 Hs.169266 neuropeptide Y receptor Y1 


15.3 


153 


1 


14.1 


101478 NM.002890Hs.758 RAS p21 protein activator (GTPase activa 


9.6 


96 


1 


8.5 


101724 L11690 Hs.620 bullous pemphigoid antigen 1 (230/240kD) 


9.4 


94 


1 


0.3 


101754 S70114 Hs.239489 TIA1 cytotoxic granule-associated RNA-bi 


8.9 


89 


5 


8 


101888 AL049610 Hs.95243 transcription elongation factor A (Sll)- 


7.3 


73 


1 


5.3 


102165 BE313280 Hs.159627 death associated protein 3 


9.3 


93 


5 


8 


102304 AF015224 Hs.46452 mammaglobin 1 


8.5 


2058 243 


1.4 


102348 U37519 Hs.87539 aldehyde dehydrogenase 3 family, member 


6,4 


428 


67 


2.3 


102457 NM_001394Hs.2359 dual spedGdty phosphatase 4 


20.2 


202 


5 


1.3 


102567 U63830 Hs.146847 TRAF family member-associated NFKB activ 


8.2 


82 


1 


6.8 


102823 D85390 Hs.5057 carboxypeptidaseD 


5.6 


56 


1 


5.3 


103557 AL133415 Hs.297753 vimentin 


7.5 


136 


18 


3.4 


103613 NM_000346Hs.2316 SRY (sex determining region Y)-box 9 (ca 


7.3 


73 


1 


5.2 


104115 AF183810 Hs.26102 opposite strand to trlchominophalangeal 


29 


290 


1 


26.8 


104667 AI239923 Hs.30098 ESTs 


14.9 


149 


1 


6.4 


104804 AI858702 Hs.31803 ESTs, Weakly similar to N-WASP (H.sapien 


7.7 


77 


1 


5.1 


104807 A1139058 Hs.125790 teudne-rich repeat-containmg 2 


7 


70 


1 


6.5 


104896 AW015318 Hs.23165 ESTs 


7.4 


74 


1 


6 


104943 AF072873 Hs.1 14218 frizzled (Drosophlla) homolog 6 


16.2 


162 


1 


4.2 


105038 AW503733 Hs.9414 K1AA1488 protein 


5.5 


55 


1 


5.2 


105329 AA234561 Hs.22862 ESTs 


2.8 


131 


47 


3,9 


105500 AW602166 Hs.222399 CEGP1 protein 


25.4 


508 


20 


3 


105516 AK001269 Hs.30738 hypofteticaJ protein FU10407 


8.3 


83 


3 


1.8 


105730 AW377314 Hs.5364 DKFZP564 1052 protein 


6.9 


69 


1 


4.4 


106012 AI240665 Hs.8895 ESTs 


21,2 


212 


6 


17.4 


106095 AF115402 Hs.1 1713 E74-Gke factor 5 (ets domain transcript 


26.3 


356 


14 


1 


106155 AA425414 Hs.33287 nucfear factor IB 


9.9 


483 


49 


1.8 


107102 AB037765 Hs.30652 K1AA1 344 protein 


6.3 


63 


1 


5.4 


107136 AV661958 Hs.8207 GK001 protein 


2.5 


392 


155 


4.3 


107151 AW378065 Hs.8687 ESTs 


15.6 


156 


7 


10.8 


107922 BE153855 Hs.61460 Ig superfamBy receptor LNiR 


9 


90 


1 


5,5 


108339 AW151340 Hs.51615 ESTs, Weakly similar to ALU7_HUMAN ALUS 


18.7 


187 


1 


17 


109112 AW419196 Hs.257924 hypothetical protein FU 13782 


4.1 


334 


82 


3.4 


109292 AW975746 Hs.188662 KIAA1 702 protein 


7.1 


71 


1 


6.5 


109415 U80736 Hs.110826 tnnudeotkJe repeat containing 9 


12.3 


123 


1 


11-3 


109912 AW390822 Hs.301528 L^urenine/alpha^amtodipateaminotra 


14.2 


142 


1 


9.5 


110009 8E075297 Hs.6614 ESTs, Weakly similar to A43932 mucin 2 p 


6.3 


693 


110 


7.2 


110915 BE092285 Hs.29724 hypothetical protein FU13187 


20.9 


209 


1 


19.5 


111164 N46180 Hs.122489 Homo sapiens cONAFLJ13289fis. done OV 


7.7 


77 


1 


5 
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111179 AK000136 Hs.10760 asporin (LRR class 1) 
111190 AK002055 Hs.151046 hypothetical protein aJ1 1193 
111223 AA852773 Hs.334838 KIAA1866 protein 
111357 BE314949 Hs.87126 hypothetical protein RJ23309 
112244 AB029000 Hs.70823 KIAA1077 protein 
113047 A1571940 Hs.7549 ESTs 

1 13702 T97307 gb:ye53h05.s1 Scares fetal liver spleen 

114124 W57554 Hs.125Q19 lymphoid nuclear protein (LAF-4) mRNA 
114138 AW384793 Hs.15740 Homo sapiens mRNA; cDNA DKFZp434E033 (fr 
114768 AF212848 Hs.182339 ets homologous factor 
114860 AL157545 Hs.42179 bromodomaln and PHD finger containing, 3 
114965 AI733881 Hs.72472 BMP-R1B 

1 14988 AA251089 gb:zs04f05.s1 NCI_CGAP_GC81 Homo sapiens 

115206 AW183695 Hs.186572 ESTs 

1 15719 AW992405 Hs.59622 Homo sapiens, done IMAG&35Q7281, mRNA, 

115844 AI373062 Hs.332938 hypothetical protein MGC5370 

116470 AI272141 Hs.83484 SRY {sex determining region Y}-box 4 

116786 H25836 Hs.301527 ESTs, Moderately similar to unknown \Hs 

117280 M18217 Hs.172129 Homo sapiens cDNA: FU21409 fis, clone C 

117412 N32536 Hs.42645 solute carrier family 16 (monocarboxyiic 

118472 AL157545 Hs.42179 bromodomain and PHD finger containing, 3 

119271 AI061118 Hs.65328 Fanconl anemia, complementation group F 

119771 AI905687 Hs.2533 EST 

120562 BE244580 Hs.302267 hypothetical protein FU 10330 

121463 AK000282 Hs.239681 hypothetical protein FU20275 

121723 AA243499 Hs.104800 hypothetical protein FU10134 

122963 AA478446 Hs.69559 KIAA1096 protein 

123137 AI073913 Hs.100686 ESTs, Weakly similar to JE0350 Anterior 

123619 AA602964 gb;no97cQ2.s1 NCLCGAP_Pr2 Homo sapiens 

123709 AA706910 Hs.1 12742 ESTs 

124006 AI147155 Hs.270016 ESTs 

124059 BE387335 Hs.283713 ESTs, Weakly similar to S64054 hypotheti 
1243QS AA249027 Hs.241507 ribosoma! protein S6 
125279 AW401809 Hs.4779 K1AA1 150 protein 
125617 AA287921 Hs.164950 ESTs 

127439 D60237 Hs.14368 SH3 domain binding glutamic acid-rich pr 

128305 AI954968 Hs.279Q09 matrix Gla protein 

128482 AI694143 Hs.296251 programmed cell death 4 

128790 AF026692 Hs.105700 secreted mzzled-related protein 4 

128925 R67419 Hs.21851 Homo sapiens cONA RJ 12900 fis, clone NT 

129017 AA1 15333 Hs.107968 ESTs 

129229 AF013758 Hs.1 09643 polyadenylate binding protein-interactin 

129337 NM_014918Hs.110488 KIAA0990 protein 

129366 BE220806 Hs.184697 Homo sapiens clone 23785 mRNA sequence 

129821 AB028945 Ha 12696 cortadin SH3 domain-binding protein 

130036 BE061916 Hs.125849 chromosome 8 open reading frame 2 

130057 AF027153 Hs.324787 solute carrier family 5 (Inositol transp 

130095 AKD01635 Hs.14838 hypothetical protein FLJ10773 

130343 AB040914 Hs.278628 K1AA1481 protein 

130385 AW067800 Hs.155223 stanniocaldn 2 

130407 BE385099 Hs.334727 hypothetical protein MGC301 7 

130441 U63630 Hs.155637 protein kinase, DNA-actlvated, catalytic 

130455 D90O41 Hs.155956 N-acetyltransferase 1 (aryfamlne N-acety 

130604 AA383256 Hs.1657 estrogen receptor 1 

130617 M90516 Hs.1674 glutamine-frudose-6-phosphatetransamin 

130712 AJ271881 Hs.279762 bromodomain-containing 7 

131148 AW953575 Hs.303125 p53-induced protein PIGPC1 

131388 NM_O1481QHs.92200 KIAA0480 gene produd 

131564 T93500 Hs.28792 Homo sapiens cDNAFUl 1041 fis, done PL 

131742 AA961420 Hs.31433 ESTs 

131877 J04088 Hs.156346 topoisomerase (DNA) II alpha (170kD) 

131985 AA5O302O Hs.36563 hypothetical protein FU2241 8 

132316 U28831 Hs.44566 KIAA1641 protein 

132528 T78736 Hs.50758 SMC4 (structural maintenance of chromoso 

132742 AA025480 Hs.292812 ESTs, Weakly similar to T33468 hypotheti 

132990 X77343 Hs.334334 transcription fadorAP-2 alpha (adivat 

133015 AJ002744 Hs.246315 UDP-N-acetykJpha^aladosamfr)e:polyp 
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25.1 


288 


12 


6.7 


6,3 


63 


1 


5.8 


3.6 


402 


112 


4.9 


3.8 


425 


111 


4 


5.7 


567 


100 


6.7 


9.6 


124 


13 


9 


12.3 


129 


11 


11.7 


24.2 


242 


10 


5.6 


6.7 


67 


1 


6.3 


13J 


137 


1 


8.9 


9.1 


91 


1 


7.6 


35.9 


359 


10 


29.7 


11,5 


115 


1 


6.9 


5.8 


58 


1 


5 


7.6 


144 


19 


13.9 


6.2 • 


62 


1 


5.4 


1.8 


1047 596 


1.6 


22.8 


228 


9 


12.4 


3.9 


322 


83 


4.4 


17.4 


174 


9 


6.9 


14.5 


145 


1 


2.4 


8.2 


82 


1 


6.4 


3.5 


2073 595 


21 


8.5 


127 


15 


1.6 


10.3 


103 


1 


9.3 


Z9 


214 


74 


3.7 


7.2 


72 


1 


5.7 


9.9 


351 


36 


13.9 


8.5 


65 


1 


4.3 


3.9 


60 


"16 


4.8 


5.8 


321 


55 


17 


10.4 


880 


85 


5.3 


105 


105 


1 


9.9 


13.1 


131 


1 


5.1 


6.7 


67 


1 


6 


30.6 


306 


4 


26.5 


7.5 


75 


1 


6.5 


7.2 


72 


1 


5.8 


17.4 


409 


24 


7.8 


7.1 


392 


56 


3.6 


8.2 


82 


1 


7.4 


7.1 


71 


1 


6.2 


9.5 


95 


1 


8.5 


7.1 


150 


21 


14.5 


11.4 


114 


1 


10 


6.7 


67 


1 


5.7 


1 


1 


1 


1 


14.6 


219 


15 


7.6 


13.2 


331 


25 


124 


72.2 


722 


1 


1.9 


6.5 


65 


4 


5.3 


6.1 


61 


1 


5.7 


10.8 


706 


66 


9.2 


32.2 


322 


1 


4.7 


10 


100 


1 


7.6 


17.5 


175 


2 


12.8 


3.8 


585 


153 


3.7 


7.6 


76 


1 


5 


4.7 


381 


81 


6.4 


11.7 


117 


1 


10.1 


6.8 


68 


1 


5.6 


40.2 


402 


1 


4 


18.6 


186 


10 


1.5 


9.3 


93 


1 


8.4 


6.5 


65 


1 


5.6 


12.7 


311 


25 


24 


4.6 


427 


93 


10.4 
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133199 AF231981 Hs.250175 homolog of yeast long chain polyunsatura 

133240 AKQ01489 Hs.242894 ADP-ribosytation factor-like 1 

133271 Z48633 Hs.283742 H.sapiens mRNA for retrotransposon 

133640 AW246428 Hs.75355 ublquitin-conjugaling enzyme E2N (homolo 

5 133746 AW410035 Hs.75862 MAO (mothers against decapentaplegic, Or 

133999 AA535244 Hs.783Q5 RAB2, member RAS oncogene family 

134110 U41060 Hs.79136 UV-1 protein, estrogen regulated 

134485 X82153 Hs.83942 cathepsin K (pycnodysoslosis) 

134654 AK001741 Hs.8739 hypothetical protein FU1Q879 

10 134880 AI879195 Hs.90606 15 kDa selenoprotein 

135029 H58818 Hs.187579 hydroxysteroid (17-beta) dehydrogenase 7 

135389 U05237 Hs.99872 fetal Alzheimer antigen 

128305 AI954968 Hs.279009 matrix G!a protein 



3 


816 


275 


3.9 


8.1 


81 


1 


4.6 


12.4 


124 


6 


10.8 


8.5 


85 


1 


72 


9.3 


93 


1 


7.8 


7.8 


78 


1 


5.6 


4.5 


1472 330 


2.1 


34.3 


411 


12 


5.1 


6.4 


64 


1 


5.1 


5.7 


57 


1 


5 


11.5 


115 


1 


10 


20.6 


206 


4 


19.1 


9.4 


94 


3 


5.3 
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TABLE 11A 



Table 11A shows the accession numbers for those pkeys lacking unigeneDD's for Table 1 1. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Accession: Genbank accession numbers 
15 

Pkey CAT number Accession 

123619 371681 1 AA602964 AA6O9200 
20 113702 genbank J97307 T97307 
114988 genbank_AA251089 AA251089 



10 



Pkey: 

CAT number 



Unique Eos probeset identifier number 
Gene duster number 
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TABLE 12: Figure 12 from BRCA 001-3 PCT 



5 Table 12 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

1 0 ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal body tissue 

R2: Ratio of 90* percentile tumor to body 

1 5 R3: Ratio of 75 D > percentile body to tumor 

R4: Ratio of tumor to normal breast tissue 

Pkey ExAccn UnigenelD Unigene Title 

20 100131 D12485 Hs.11951 phosphodiesterase I (PC-1) 

105500 AW602166 Hs.222399 ESTs 

112244 AB029000 Hs.70fi23 K1AA1077 protein 

114124 W57554 Hs.125019 ESTs 

119771 AI905687 Hs.2533 ESTs 
25 121723 AA243499 Hs.104800 ESTs 

128790 AF026692 Hs.1O570O secreted frizzied-related protein 4 

131148 AW953575 Hs.303125 ESTs 

131985 AA503020 Hs.36563 ESTs 

133199 AF231981 Hs.250175 Homo sapiens clone 23904 mRNA sequence 3 



R1 


R2 


R3 


R4 


13.2 


244 


19 


9.9 


25.4 


508 


20 


3 


5.7 


567 


100 


6.7 


24.2 


242 


10 


5.6 


3.5 


2073 


595 


2.1 


2.9 


214 


74 


3.7 


17.4 


409 


24 


7.8 


3.8 


585 


153 


3.7 


40.2 


402 


1 


4 


3 


816 


275 


3.9 
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TABLE 13: Table 1 from BRCA 001-5 US 
5 Table 13 depicts a preferred group of genes upregulated in breast cancer cells. 
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Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title ■ 



R1: 



Ratio of tumor to normal body tissue 



Pkey 


ExAccn 


UnlGene ID 


100038 


M97935 




100039 


M97935 




100040 


M97935 




100041 


M97935 




100082 


AB003103 


Hs.4295 


100091 


AF000177 


Hs.1 11783 


100100 


AF006084 


Hs.11538 


100103 


AF007875 


Hs.5085 


100114 


D00596 


Hs.82962 


100121 


010495 


Hs.155342 


100123 


D10523 


Hs.168669 


100128 


D11094 


Hs.61153 


100131 


D12485 


Hs.11951 


100137 




Hs.15071 


100144 


D13643 


Hs.75616 


100147 


D13666 


Hs.136348 


100154 


D14657 


Hs.81892 


100164 


D14812 


Hs.173714 


100169 


D14878 


Hs.82043 


100190 


D21090 


Hs.178658 


100203 


D25538 


Hs.172199 


100209 


D26308 


Hs.76289 


100215 


D26598 


Hs.82793 


100216 


D26599 


Hs.1390 


100219 


028137 


Hs.118110 


100227 


D28915 


Hs.82316 


100248 


D31888 


Hs.78398 


100287 


D43950 


Hs.1600 


100294 


049395 


Hs.75454 


100307 


D50525 


Hs.699 


100335 


D63391 


Hs.6793 


100340 


063487 


Hs.82563 


100355 


D78129 


HsJ1465 


100363 


078514 


Hs.78563 


100368 


D79987 


Hs.153479 


100372 


D79997 


Hs.184339 


100375 


D800O4 


Hs.75909 


100379 


082060 


Hs.278721 


100387 


083777 


Hs.75137 


100393 


084145 


Hs.39913 


100398 


084557 


Hs.155462 


100405 


D85425 


Hs.82733 


100406 


086479 


Hs.118397 


100409 


D86957 


Hs.80712 


100421 


086985 


Hs.79276 


100446 


D87464 


Hs.10037 


100447 


D87465 


Hs,74583 


100448 


087469 


Hs.57652 



control 
control 
control 
control 

proteasome (prosome; macropain) 26S sub 
Lsm1 protein 

actin related protein 2/3 complex; subunit 

dolfchyl-phosphate mannosyltransferase p 

thytrodylate synthetase 

protein kinase C; delta 

oxoglutarate dehydrogenase (lipoamide) 

proteasome {prosome; macropain) 26S sub 

phosphodiesterase l/nucleotide pyrophosp 

chapefonin containing TCP1; subuni' 8 ft 

Human mRNA for KIAA0018 gene; comp 

osteoblast specific factor 2 (fasciclin Mike 

K1AA0101 gene product 

MORF-related gene X 

D123 gene product 

RAD23 (S. cerevisiae) homolog B 

adenylate cyclase 7 

biliverdin reductase B (flavin reductase (N 
proteasome (prosome; macropain) subunit 
proteasome (prosome; macropain) subunit 
bone marrow stromal cell antigen 2 
Interferon-induced; hepatitis C-assocJated 
K1AA0071 protein 

chaperonin containing TCP1; subunit 5 (e 
antioxidant protein 1 
hypothetical protein 

platelet-activating factor acetylhydrolase; 
KIAA0153 protein 

Homo sapiens mRNA for squalene epoxid 
ubiquitin-conjugating enzyme E2G 1 (horn 
extra spindle poles; S. cerevisiae; homolo 
KIAA0175 gene product 
KIAA0182 protein 

Ke4 gene; mouse; human homolog of 
KIMQ193 gene product 
novel RGD-containing protein 
minichromosome maintenance deficient (m 
nidogen 2 

AE-binding protein 1 
KIAA0202 protein 

Human mRNA for KIAA0232 gene; comp 
KIAA0274 gene product 
KIAA0275 gene product 
EGF-Tike-domaln; multiple 2 
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100467 


D89052 


Hs.7476 


ATPase; H+ transporting; lysosomal (vacu 


7.5 


100468 


D89289 


Hs.118722 


fucosyl transferase 8 (alpha (1;6) fucosyltr 


5 


100486 


HT1112 


Hs.10842 


Ras-Uke Protein Tc4 


16.9 


100497 


HT1400 


Hs.79137 


Carboxyl Methyltransferase, Aspartate, A 


5.6 


100618 


HT2710 


Hs.1 14599 


Collagen. Type Viii, Alpha 1 


7.5 


100661 


HT3018 


HS.132748 


Ribosomat Protein L39 Homolog 


4.4 


100667 


HT3127 


Hs.169610 


Epican, Alt. Splice 11 


4.6 


100668 


HT3938 


Hs. 169610 


Epican, AIL Splice 12 


4.4 


100676 


HT3742 


Hs.287820 


Rbronectin, AIL Splice 1 


9 


100775 


HT26388 


Hs.89603 


Mucin 1, Epithelial, Alt Splice 9 


4.7 


100783 


HT4018 


Hs.191356 


Basic Transcription Factor, 44 Kda Subun 


13.7 


100829 


HT4343 


Hs.278544 


Cytosolic Acetoacetyl-Coenzyme A Thb 


10.6 


100830 


HT4344 


Hs.4756 


Rad2 


5.5 


100840 


HT4392 


Hs.183418 


Protein Kinase Pitslre, Alpha, Alt Splice 


4.1 


100850 


HT417 


Hs.297939 


Caihepsin B 


4 


100866 


HT4582 


Hs.75113 


Transcription Factor Ilia 


4.9 


100906 


HT5158 


Hs.5398 


Guanosine ^-Monophosphate Synthase 


8.7 


100914 


HT511 


Hs.324178 


Ras Inhibitor Inf 


7.2 


100916 


HT544 


Hs.73946 


Endothelial Cell Growth Factor 1 


5.9 


100945 


HT884 


Hs.1 80686 


Oncogene E6-Ap, Papillomavirus 


4.6 


100975 


J02923 


Hs.76506 


lymphocyte cytosolic protein 1 (Uplastin 


30.1 


100988 


J03589 


Hs.76480 


ubtquitin-fike 4 


8.3 


100996 


J03909 


Hs.14623 


interferon; gamma-inducible protein 30 


6.9 


100999 


J03934 


Hs.80706 


dtaphorase (NADH/NADPH) (cytochrom 


4.3 


101011 


J04430 


Hs.1211 


acid phosphatase 5; tartrate resistant 


5.9 


101017 


J04599 


Hs.821 


bigtycan 


5.1 


101031 


J05070 


Hs.151738 


matrix metaltoprateinase 9 (gelatinase B; 


372 


101038 


J05249 


Hs.79411 


replication protein A2 (32kD) 


6,1 


101054 


K02405 


Hs.73931 


Human MHC ctess « HLA-OOoeta mRN 


A3 


101061 


K03515 


Hs.180532 


glucose phosphate isomerase 


4.3 


101091 


L06132 


Hs.149155 


voltage-dependent anion channel 1 


7.4 


101097 


L06797 


Hs.89414 


chemokine (C-X-C mottf); receptor 4 (fus 


4.6 


101104 


L07615 


Hs.169266 


Human neuropeptide Y receptor Y1 (NPY 


18.3 


101143 


L12723 


Hs.90093 


heat shock 70kO protein 4 


17.4 


101152 


L13800 


Hs.9884 


Homo sapiens liver expressed protein gen 


7.6 


101183 


L19779 


Hs.795 


H2A histone family; member 0 


10.9 


101216 


L25876 


Hs.84113 


cyciin-dependent kinase Inhibitor 3 (CDK 


7.4 


101233 


L290Q8 


Hs.878 


sorbitol dehydrogenase 


14.6 


101247 


L33801 


Hs.78802 


glycogen synthase kinase 3 beta 


7.5 


101282 


L38810 


Hs.79387 


proteasome (prosome; macropain) 26S sub 


4.4 


101326 


L42572 


Hs.78504 


inner membrane protein; mitochondrial (m 


5.8 


101332 


U7276 


Hs.156346 


Homo sapiens (cell Cne HL-6) alpha topo 


18.9 


101348 


L77213 


Hs.30954 


phosphomevalonate kinase 


7.5 


101352 


L77701 


Hs.16297 


COX17 (yeast) homolog; cytochrome c ox 


9.3 


101378 


M13755 


Hs.833 


mterferon-stimulaied protein; 15 kDa 


18.1 


101396 


M15796 


Hs.78996 


proliferating ceD nuclear antigen 


8.6 


101404 


M16342 


Hs.182447 


heterogeneous nuclear riborojdeoprotein C 


4.5 


101439 


M20902 


Hs.268571 


apoCpoprotein OA 


6.1 


101464 


M22538 


Hs.51299 


NADH dehydrogenase (ubiquinone) ftavo 


8.7 


101469 


M22877 


Hs.169248 


Human somatic cytochrome c (HCS) gene 


4.2 


101472 


M22960 


Hs.1 18126' 


protective protein for beta-galactosidase ( 


6.5 


101478 


M23379 


Hs.758 


RAS p21 protein activator (GTPase activa 


14 


101484 


M24594 


Hs.20315 


interferon-mduced protein 56 


92 


101539 


M30818 


Hs.926 


myxovlrus (influenza) resistance 2; homo! 


5.1 


101540 


M30938 


Hs.84981 


X-ray repair complementing defective rep 


4.7 


101544 


M31169 




Human propionyi-CoA carboxylase beta-s 


5.5 


101552 


M31642 


Hs.82314 


hypoxanthlne phosphoribosyltransferase 1 


8.5 


101580 


M34677 


Hs.83363 


DNA segment on chromosome X (unique) 


4.5 


101600 


M37583 


Hs.119192 


H2A histone family, member Z 


5.7 


101663 


M60750 


Hs.2178 


H28 histone family; member A 


5.8 


101664 


M60752 


Hs.121017 


H2A histone family; member A 


135 


101667 


M60858 


Hs.79110 


nudeolin 


4 


101684 


M63256 


Hs.75124 


cerebellar degeneration-related protein (62 


7.6 


101702 


M64929 


Hs.179574 


protein phosphatase 2 (formerly 2A); regu 


4.2 


101754 


M77142 


Hs.239489 


TiAl cytotoxic granule-associated RNA-b 


4.5 


101758 


M77836 


Hs79217 


pyrrofine-5-carboxytate reductase 1 


5.7 


101767 


M81057 


Hs.180884 


carboxypeptidase B1 (tissue) 


21.7 
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101770 M81601 
101791 M83822 
101803 M86546 
101809 M86849 
101839 M93036 
101851 M94250 
101888 M99701 
101973 S82597 
101991 U00968 
102009 U02680 
102025 U03911 
102047 U07158 
102051 U07550 
102083 U10323 
102095 U11313 
102130 U15009 
102133 U15173 
102148 U16954 

102179 U19713 

102180 U19718 
102193 U20758 
102198 U21Q9Q 
102202 U21931 
102209 U22970 
102211 U23070 
102220 U24389 
102224 U24704 
102234 U26312 
102250 U28014 

102260 U28386 

102261 U28488 
102273 U30B88 
102238 u 32343 
102302 U33052 
102305 U33286 
102320 U34683 
102330 U35451 
102348 U37519 

102361 U39400 

102362 U39412 
102369 U39840 
102395 U41767 
102409 U43286 
102418 U43923 
102425 U44772 
102457 U48807 
102465 U49352 
102495 U51240 
102534 U56833 
102546 U57877 
102549 U58046 
102557 U58766 
102562 U59309 
102568 U59877 

102580 U60808 

102581 U61145 

102590 U62136 

102591 U62325 

102592 U62389 

102617 U65928 

102618 U65932 
102638 U67319 
102663 U70322 

mm U706&) 

102679 U72661 
102687 U73379 
102704 U76638 



Hs.78869 

Hs.62354 

Hs.155691 

Hs.323733 

Hs.692 

Hs.82045 

Hs.95243 

Hs.80120 

Hs.166 

Hs.82643 

Hs.78934 

Hs.83734 

Hs.1197 

Hs.75117 

Hs.75760 

Hs.1575 

Hs.155596 

Hs.75823 

Hs.76364 

Hs.83551 

Hs.313 

Hs.74598 

Hs.574 

Hs.265827 

Hs.78776 

Hs.65436 

Hs.148495 

Hs.276554 

Hs.74122 

Hs.1 59557 

Hs.155935 

Hs.75981 

H5.54483 

Hs.69171 

Hs.90073 

Hs.82327 

Hs.77254 

Hs.87539 

Hs.75859 

Hs.75932 

Hs.299867 

Hs.92208 

HS.U8725 

Hs.79058 

Hs.3873 

Hs.2359 

Hs.81548 

HS79356 

Hs.198307 

Hs.3577 



Hs.264428 

Hs.75653 

Hs.223025 

Hs.152981 

Hs.77256 

Hs.79300 

Hs.324125 

Hs.11223 

Hs.198767 

Hs.81071 

Hs.9216 

Hs.168075 

Hs.279910 

Hs.11342 

Hs.93002 

Hs.54089 



transcription elongation factor A (SIJ); 1 

cell division cycle 4-llke 

pre-B-cell leukemia transcription factor 1 

Homo sapiens connexin 26 (G JB2) mRNA 

membrane component; chromosomal 4; su 

midkine (neurite growth-promoting factor 

transcription elongation factor A (SHJ* 

UDP-N-acetyl-aIpha-D^alactosamine:po 

Human SREBP-1 mRNA; complete cds 

protein tyrosine kinase 9 

mutS (E. col]) homolog 2 (colon cancer; n 

syntaxin 4A (placental) 

heat shock 10kD protein 1 (chaperonin 10 

interteukin enhancer binding factor 2; 45k 

sterol carrier protein 2 

small nuclear ribonucleoprotein 03 polyp 

BCL2/adenovirus E1B 19kD-interacting p 

ALU-fused gene from chromosome 1q 

allograft inflammatory factor 1 

microfibnllar-associated protein 2 

secreted phosphoproteln 1 (osteopontin; b 

polymerase (DMA directed); delta 2; regu 

fructose-bisphosphatase 1 

interferon; afpha-lnducible protein (clone 

putative transmembrane protein 

fysyloxidase-fikel 

proteasome (prosome; macropain) 26S sub 
chromobox homolog 3 (Orosophfla HP1 g 
caspase 4; apoptosis-related cysteine prot 
karyopherin alpha 2 (RAG cohort 1; impo 
complement component 3a receptor 1 
ubiquitin specific protease 14 (tRNA-guan 

protein kinase C-iike 2 
chromosome segregation 1 (yeast homolo 
glutathione synthetase 
chromobox homolog 1 (Drosophila HP1 b 
aldehyde dehydrogenase 8 
chromosome 11 open reading frame 4 
N-ethylmaleimide-sensitive factor attachm 
hepatocyte nuclear factor 3; alpha 
a disintegrin and metalioproteinase domai 
seienophosphate synthetase 2 
suppressor of Ty (S.cerevisiae) 4 homolog 
palmitoyl-protein thtoesterase (ceroid-lipo 
dual specificity phosphatase 4 
2;4-dienoyl CoA reductase 1; mitochondri 
Lysosomal-associated multispanning mem 
von Hippel-Undau binding protein 1 
succinate dehydrogenase complex; subuni 
eukaryotic translation InttiaOon factor 3; s 
tissue specific transplantation antigen P35 



RAB31; member RAS oncogene family 
CDP-diacylgrycero) synthase (phosphatid 
enhancer of zeste (Drosophila) homolog 2 
Homo sapiens enterocyte differentiation a 
amyloid beta (A4) precursor proteln-bindi 
Human putative cytosollc NADP-depende 
Jun activation domain binding protein 
extracellular matrix protein 1 
caspase 7; apoptosis-related cysteine prot 
karyopherin flmportin) beta 2 
ATX1 (antioxidant protein 1; yeast) homo 
ninjurin 1; nerve injury-induced proteln-1 
ubfcjuitin carrier protein E2-C 
BRCA1 associated RING domain 1 
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102907 
102919 
102929 
102973 
102983 
102985 
103003 
103018 
103023 
103036 
103050 
103073 
103075 
103080 
103094 
103105 
103121 
103149 
103180 
103182 
103188 
103191 
103193 
103194 
103195 
103206 
103207 
45 103208 
103216 
103226 
103230 
103262 
5.0 103278 
103297 
.103302 
103316 
103330 
55 103349 
103352 
103364 
103374 
103380 
60 103395 
103402 
103410 
103420 
103421 
65 103427 
103430 
103438 



U77180 
U79241 
U79254 
U79282 
U79293 
U82130 
U86602 
U87269 
U896G6 
U90426 
U90904 
U90914 
U91327 
U94592 
U95006 
U96113 
X02419 
X06985 
X12447 
X13238 
X16663 
X17620 
X17644 
X52003 
X53296 
X53793 
X54925 
X57766 
X59417 
X59543 
X59798 
X60787 
X61970 
X63679 
X66363 
X69433 
X69819 
X70040 
X70218 
X70476 
X70649 
X70940 
X72755 
X72790 
X72841 
X74262 
X75042 
X75861 
X78565 
X79882 
X81786 
X82103 
X83301 
X85373 



X90872 
X91788 
X92396 
X94754 
X95404 
X96506 
X97065 
X97074 
X97303 
X97544 
X98263 



Hs.50002 

Hs.1 18666 

Hs.181311 

Hs.155572 

Hs.159264 

Hs.1 18910 

Hs.74407 

Hs.154196 

Hs.38041 

Hs.179606 

Hs.83724 

Hs.5057 

Hs,6456 

Hs.80658 

Hs.37616 

Hs.324275 

Hs.77274 

Hs.202833 

Hs.74649 

Hs.14601 

Hs.1 18638 

Ha.2707 

Hs.1406 

Hs.81134 

Hs.1 17950 

Hs.83169 

Hs.155324 

Hs.74077 

Hs.2934 

Hs.82932 

Hs.296281 

Hs.76913 

Hs.4147 

Hs.171834 

Hs.5337 

Hs.99995 

Hs.2942 

Hs.2903 

Hs.75724 

Hs.78580 

Hs.2642 

Hs.77367 

Hs.31314 

Hs.16003 

Hs.44313 

HS.74637 

Hs.289114 

Hs.80680 

Hs.9078 

Hs.3059 

Hs.324728 

Hs.77496 

Hs.78853 

Hs.279929 

Hs.84974 

Hs.24167 

Hs.279946 

Hs.180370 

Hs.295362 

Hs.173497 

Hs.1 19591 

Hs.250655 

Hs.20716 

Hs.152720 



small inducible cytokine subfamily A (Cy 
Human done 23759 mRNA; partial cds 
asparaginyl-tRNA synthetase 
Human done 23801 mRNA sequence 
Human done 23948 mRNA sequence 
tumor susceptibility gene 101 
nudeolar protein p40 
E4F transection tactcr 1 
pyridoxal (pyridoxine; vitamin B6) kinase 
nuclear RNA heBcase; DECO variant of D 
Human done 23773 mRNA sequence 
carboxypeptidaseO 

chaperonin containing TCP1; subunit 2 (b 
Human uncoupling protein homoiog (UCP 
Human D9 splice variants mRNA; comp 
Homo sapiens Nedd-4-tike ubiquitin-prot 
plasminogen activator; urokinase 
heme oxygenase (decycDng) 1 
aldolase A; frudose-bisphosphate 
cytochrome c oxidase subunit vie 
hematopoietic cell-specific Lyn substrate 
non-metastatic cells 1; protein (NM23A) 
G1 to S phase transition 1 
trefoil factor 1 (breast cancer; estrogen-ind 
Interleukin 1 receptor antagonist 
multifunctional polypeptide similar to SA 
matrix metaKoproteinase 1 (interstitial col 
matrix metaJtoproteinase 1 1 (stromelystn 
proteasome (prosorne; macropain) subunit 
ribonucleotide reductase Ml potypepfide 
cyclin D1 (PRAD1: parathyroid adenomat 
interleukin enhancer binding factor 1 
proteasome (prosorne; macropain) subunit 
translocating chain-associating membrane 
PCTAIRE protein kinase 1 
Isodtrate dehydrogenase 2 (NADP+); mit 
Intercellular adhesion molecule 3 
macrophage stimulating 1 receptor (c-met 
protein phosphatase 4 (formerly X); cataly 
coatomer protein complex' subunit beta 2 
DEAD/H (Asp-Glu-Ala-Asp/HIs) box pol 
eukaryotic translation elongation fador 1 
monokine induced by gamma interferon 
Human endogenous retrovirus mRNA for 
retinoblastoma-binding protein 7 
retirrobtastorra -binding protein 4 
v-rel avian retlajtoendotheliosis viral onco 



hexabrachion (tenascin C; cytotactin) 
lung resistance-related protein 
Immature colon carcinoma transcript 1 
coatomer protein complex; subunit beta 
SMA5 

small nudear ribonudeoprotein polypepti 

serine/threonine kinase 9 

uraca-ONAgtycosylase 

SULT1C suifotransferase 

chloride channel; mideofele'Sensitive; 1A 

synaptobrevin-Gke 1 

methionlne-tRNA synthetase 

cofilin 1 (non-muscle) 

DR1-assodated protein 1 (negative cofact 

Sec23 (S. cerevisiae) homoiog 8 

adaptor-related protein complex 2; slgma 

H.sapiens mRNA for Ptg-12 protein 

translocase of Inner mitochondrial membr 

M-phase phosphoproteln 6 
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103464 Y00285 Hs.76473 insulin-like growth factor 2 receptor 

103470 Y00796 Hs.1 74103 integrin; alpha L (antigen CD11 A {p1 80); 

103494 Y08991 Hs.83050 phosphatidylinositol 3-kinase-associated p 

103505 Y09912 Hs.33102 transcription factor AP-2 beta (activating 

5 103547 Z14982 Hs.180062 proteasome (proscme; macropain) subun'rt 

103551 Z15115 Hs.75248 topoisomerase (DNA) il beta (180kD) 

103565 Z22548 Hs.146354 thtoredoxln-dependent peroxide reductase 

103587 229083 Hs.82128 5T4 oncofetal trophobiast glycoprotein 

103621 Z47727 Hs.150675 polymerase ^NA) II (ONA directed) pol 

10 103622 Z48Q42 Hs.278672 membrane component chromosome 11; s 

103658 Z74615 Hs.172928 collagen; type I; alpha 1 

103680 Z93784 Homo sapiens DNA sequence from PAC 

103772 AA092473 Hs.278554 chromobox homolog 3 (Drosophila HP1 g 

103774 AA092898 Hs.92918 ESTs; Weakly similar to R07G3.8(C.eleg 

15 103821 AA157623 Hs.198793 KIAA0750 gene product 

103835 AA172215 Hs,93748 ESTs; Moderately similar to TRANSCRIP 

103886 AA236384 Hs.105737 ESTs; Weakly similar to gene 9306 protei 

103890 AA236843 Hs,72085 ESTs; Weakly similar to unknown (S,cere 

103892 AA243523 Hs.239189 ESTs 

20 104054 AA393432 Hs.7100 hypothetical protein 

104115 AA428090 Hs.26102 ESTs 

104136 AA442669 Hs.268371 zv68f6.r1 SoaresJotaLfetus_Nb2HF8_9w 

104147 AA451992 Hs.283037 ESTs; Highly similar to HSPC039 protein 

104173 AA476564 Hs.76561 ESTs; Weakly similar to finger protein HZ 

25 104181 AA479521 Hs.283740 ESTs 

104183 AA480838 Hs.114309 ESTs 

104192 AA486946 Hs.21321 Homo sapiens mRNA; cDNA DKFZp564 

104209 ABD00221 Hs.16530 small inducible cytokine subfamily A (Cy 

104234 AB002357 Hs. 168212 kinesln family member 3B 

30 104271 C01687 Hs.7381 ATP synthase; H+ transporting; mKochon 

104278 C02582 Hs. 109253 ESTs; Highly similar to N-termfnal acetyl 

104307 D52818 Hs.111680 endosulfine alpha 

104309 D55869 Hs.284123 Homo sapiens mRNA full length insert cO 

104370 H19378 Hs.21851 Homo sapiens mRNA; cONA DKFZp586 

35 104446 L44497 Hs.7351 ESTs 

104453 M19169 Hs.123114 cystatinSN 

104476 N33807 Hs,324275 protease; serine; 15 

104558 R56678 Hs.88959 Human DNA sequence from clone 967N2 

104592 R81003 Hs.325820 serine protease; umbilical endoMum 

40 104634 AA004274 Hs.19151 ESTs 

104636 AA004415 Hs.106106 ESTs 

104658 AA007145 Hs.27268 Homo sapiens mRNA; cDNA DKFZp564 

104667 AA007234 Hs.30098 ESTs 

104675 AA009596 Hs.301553 ESTs; Moderately similar to 111! ALU SU 
45 104767 AA025534 Hs.8852 . ESTs 

104785 AA027163 Hs.7942 ESTs 

104791 AA029046 Hs.301871 ESTs; Moderately similar to cAMP indue 

104804 AA031357 Hs.31803 ESTs; Weakly similar to N-WASP (H.sap 

104807 AA032147 Hs.23296 ESTs 

50 104837 AA039469 Hs.21126 ESTs; Weakly similar to KIAA0299(H.s 

104849 AA040270 Hs.241507 Homo sapiens mRNA; cDNA DKFZp564 

104867 AA045481 Hs.225979 Human gene from PACs 37M1 7 and 305B 

104884 AA053021 Hs. 14511 SCO (cytochrome oxidase deficient; yeast 

104906 AA055809 Hs.26802 ESTs; Weakly similar to phosphoprotein [ 

55 104919 AA057193 Hs.25252 ESTs 

104921 AA057839 Hs.1508 ESTs 

104926 AA058846 Hs.33363 DKF2P434N093 protein 

104938 AA064627 Hs,318725 ESTs; Highly similar to CGl-72 protein (H 

104943 AA065217 Hs.114218 ESTs 

60 104957 AA074919 Hs.10026 ESTs; Weakly similar to ORF YJL063C [S 

104961 AA076672 Hs.33905 ESTs 

104968 AA084602 Hs.29669 ESTs 

104975 AA086071 Hs.50758 chromosome-assodated polypeptide C 

104977 AA088228 Hs.18272 ESTs 

65 104978 AA088458 Hs.19322 ESTs 

104987 AA101723 Hs.11861 ESTs 

1(^002 AA1 13266 Hs.182704 ESTs; Moderately similar to alternatively 
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105012 AA116036 


Hs.9329 


chromosome 20 open reading frame 1 


10.7 


105019 AA121879 


Hs.9280 


proteasome (prosome; macropain) subunit 


5.7 


105029 AA126855 


Hs.13268 


ESTs 


4.4 


105033 AA127964 


Hs.274329 


TP53 target gene 1 


6.3 


105035 AA128466 


Hs.8859 


ESTs 


6.5 


105039 AA130349 


Hs.36475 


ESTs 


4 


105062 AA134968 


Hs.36529 


ESTs 


4.3 


105076 AA142858 


Hs.37810 


ESTs 


6.4' 


105087 AA147884 


Hs.9812 


ESTs 


9.2 


105091 AA148859 


Hs.179909 


ESTs; Weafdy similar to till ALU SUBFA 


5.7 


105093 AA149051 


Hs.32405 


ESTs 


6.3 


105107 AA152302 


Hs.25035 


OKFZP566G223 protein 


6.2 


105127 AA158132 


Hs.301957 


ESTs; Weakly similar to contains similar! 


5.7 


105132 AA159501 


Hs.247280 


HBV associated factor 


4.2 


105143 AA165333 


Hs.24808 


ESTs 


4.7 


105154 AA171736 


Hs.35947 


methyl-CpG binding domain protein 4 


9 


105162 AA176690 


Hs.4084 


KtAA1025 protein 


9.1 


105186 AA191512 


Hs.28005 


Homo sapiens mRNA; cONA DKFZp564 


19.3 


105209 AA205072 


Hs.227743 


KIAA0980 protein 


7,4 


105223 AA211388 


Hs.7750 


ESTs 


5.1 


105252 AA227428 


Hs.9728 


ESTs; Weakly similar to WAAQ51 2 prate 


11.1 


105253 AA227448 


Hs.5003 


K1AA0456 protein 


6.4 


105261 AA227871 


Hs.6361 


MEK partner 1 


9.1 


105263 AA227926 


Hs.6682 


ESTs 


6.7 


105274 AA228122 


Hs.281866 


ATPase; H+ transporting; lysosomal (vacu 


5.3 


105297 AA233451 


Hs.183858 


transcriptional intermediary factor 1 


8.7 


105309 AA233790 


Hs.4104 


ESTs; Weakly simaar to cONA EST yk38 


7.4 


105312 AA233854 


Hs.23348 


S-phase kinase-assoclated protein 2 (p45) 


5.8 


105342 AA235286 


Hs.157078 


ESTs 


4.5 


105376 AA236559 


Hs.8768 


ESTs; Weakly similar to till ALU SUBFA 


5.8 


105386 AA236950 


Hs.8115 


ESTs 


5.5 


105397 AA242868 


Hs.7395 


ESTs; Weakly similar to house-keeping p 


7.7 


105399 AA243007 


Hs.16420 


ESTs; Highly similar to SH3 domain-bind 


5.6 


105400 AA243052 


Hs.65648 


RNA binding motif protein 8 


5.8 


105404 AA243303 


Hs.21187 


ESTs 


9.1 


105409 AA243562 


Hs.301855 


ESTs 


4.4 


105436 AA252172 


Hs.237856 


ESTs; Moderately similar to cAMP Indue 


5.1 


105483 AA255874 


Hs.23458 


ESTs 


4.9 


105493 AA256268 


Hs.10283 


ESTs 


6 


105495 AA256317 


Hs.28785 


Homo sapiens mRNA; cONA DKFZp586 


5.2 


105496 AA256323 


Hs.301997 


DKFZP434N126 protein 


8.7 


105500 AA256485 


Hs.222399 


CGl-96 protein 


9.5 


105507 AA256678 


Hs.226318 


ESTs; Moderately similar to CCR4-associ 


4.1 


105538 AA258860 


Hs.32597 


ring finger protein {C3H2C3 type) 6 


4.1 


105544 AA261954 


Hs.24678 


ESTs 


8 


105546 AA262032 


Hs.268281 


ESTs; Weakly similar to 62D9.a p.melan 


8.1 


105549 AA262417 


Hs.5415 


ESTs - 


. - 4.6 


105551 AA262477 


Hs.25292 


ribonuclease HI; large subunit 


9.1 


105560 AA262783 


Hs.306915 


ESTs 


4.5 


105565 AA278302 


Hs.18349 


ESTs; WeaWy slmilarto partial COS [C.e 


4.2 


105566 AA276323 


Hs.17481 


Homo sapiens clone 24606 mRNA sequen 


11.9 


105575 AA278717 


Hs.12772 


ESTs 


5.9 


105584 AA279012 


Hs.3454 


ESTs; Weakly similar to KIAA0665 prote 


4.4 


105596 AA279418 


Hs.18490 


ESTs 


4 


105604 AA279787 


Hs.15467 


ESTs; Moderately similar to putative pho 


5.6 


105610 AA279991 


Hs.99872 


ESTs; Weakly similar to trithorax homoto 


5.3 


105621 AA280865 


Hs.6375 


Homo sapiens mRNA; cDMA DKFZp564 


4.8 


105627 AA281245 


Hs.23317 


ESTs 


75 


105638 AA281599 


Hs.247817 


Homo sapiens mRNA for for histone H2B 


5.9 


105645 AA282138 


Hs.11325 


ESTs 


6.4 


105650 AA282347 


Hs.25635 


ESTs; Highly slmflar to HSPC0O3 (H.sap 


11.3 


105666 AA283930 


Hs.34906 


ESTs 


4.7 


105674 AA284755 


Hs.279789 


CDW52 antigen (CAMPATH-1 antigen) 


8 


105687 AA286809 


Hs.28423 


ESTs 


7.1 


105700 AA287643 


Hs.35254 


ESTs; WeaWy sim3arto hypothetical pro 


4.9 


105705 AA290767 


Hs.101282 


Homo sapiens mRNA; cONA DKFZp434 


8 


105709 AA291268 


Hs.26761 


DKFZP586L0724 protein 


6.8 
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105731 AA292711 Hs.29131 ESTs 

105753 AA299789 Hs.1 10857 ESTs 

105774 AA348014 Hs.23412 ESTs 

105784 AA350771 Hs,17850 ESTs 

5 105791 AA358038 Hs.14368 SH34inding domain glutamic add-rich p 

105807 AA393803 Hs.16869 ESTs; Moderately similar to COLLAGEN 

105808 AA393808 Hs.286131 KIAA0438 gene product 

105812 AA394126 Hs.20814 ESTs; Highly similar to CGl-27 protein [H 

105813 AA394140 Hs.18585 ESTs 

10 105819 AA397920 Hs.28783 Homo sapiens mRNA; cONA DKFZp564 

105870 AA399623 Hs.101067 ESTs 

105874 AA400074 Hs.171118 ESTs 

105896 AA400999 Hs.7838 Human ring zinc-finger protein (ZNF127- 

105934 AA404248 Hs.16577 ESTs 

15 105935 AA404277 Hs.263727 ESTs; Weakly similar to bisphosphate 3'- 

105966 AA406105 Hs.5344 adaptor-related protein complex 1; gamma 

105974 AA406321 Hs.6224 K1AA0895 protein 

105990 AA410336 Hs.29403 ESTs; Weakly similar to PROBABLE AT 

105995 AA410510 Hs.5345 ESTs 

20 106000 AA410972 Hs.20726 ESTs 

106007 AA411462 Hs.11042 ESTs; Weakly similar to veU 1 [H^apiens 

106016 AA411819 Hs.8164 . KIAA0898 protein 

106034 AA412473 Hs.14928 ESTs 

106042 AA412700 Hs.169895 ubiquitin-conjugating enzyme E2L6 

25 106057 AA417067 Hs.289074 ESTs 

106065 AA417558 Hs.25206 ESTs 

106070 AA417761 Hs.5957 Homo sapiens clone 24416 mRNA sequen 

106103 AA421104 Hs.12094 ESTs 

106126 AA424006 Hs.22972 ESTs; Moderately similar to H5AR (M.m 

30 106154 AA425304 Hs.6994 ESTs 

106157 AA425367 Hs.34892 ESTs 

106166 AA425872 Hs.19561 NAOH dehydrogenase (ubiquinone) 1 alp 

106204 AA428024 Hs.21479 ESTs 

106210 AA428239 Hs.10338 ESTs 

35 106220 AA428582 Hs.32196 ESTs; Moderately similar to metargidin p 

106236 AA429951 Hs.21104 ESTs 

106240 AA430074 Hs.18552 ESTs; Weakly similar to Ylr218cp [S.cere 

106263 AA431462 Hs.28329 ESTs 

106288 AA435536 Hs.24336 ESTs 

40 106293 AA435591 Hs.301444 signal sequence receptor; gamma (transloc 

106310 AA436244 Hs,17240 ESTs 

106317 AA436568 Hs.108124 ESTs 

106328 AA436705 Hs.28020 K1AA0766 gene product 

106341 AA441798 Hs.5243 ESTs; Moderately similar to plL2 hypoth 

45 106348 AA442253 HS.1Q702 ESTs 

106350 AA442763 Hs.19469S cydinB2 

106371 AA443923 Hs.170310 ESTs 

106389 AA446949 Hs.6236 ESTs 

106394 AA447223 Hs.25320 Homo sapiens done 25142 mRNA sequen 

50 106426 AA448282 Hs.16206 ESTs; Weakly similar to F55C12.5 [C.ele 

106459 AA449741 Hs.4029 glioma-amplified sequence-41 

106462 AA449912 Hs.30532 ESTs; Highly similar to CGl-77 protein [H 

106463 AA450047 Hs.14770 ESTs 
106479 AA450351 Hs.75251 ESTs 

55 106494 AA452108 Hs.18387 transcription factor AP-2 alpha (activating 

106503 AA452411 Hs.29679 ESTs; Highly similar to mediator (H^aple 

106507 AA452584 Hs.267819 protein phosphatase 1; regulatory {inhibito 

106533 AA453786 Hs.145998 ESTs 

106568 AA455970 Hs.28285 patched related protein translocated in ren 

60 106586 AA456598 Hs.57787 ESTs 

106589 AA456646 Hs.28661 ESTs 

106606 AA457730 Hs.283437 Homo sapiens clone 23851 mRNA sequen 

106611 AA458904 Hs.26267 ESTs; Weakly similar to torsinA [H.sapte 

106614 AA458934 Hs.256150 ESTs 

65 106628 AA459657 Hs.12311 Homo sapiens done 23570 mRNA sequen 

106637 AA459961 Hs.250824 ESTs 

106644 AA460239 Hs.12680 ESTs 
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106664 AA460969 Hs7510 


mitogen-activated protein kinase kinase ki 


8.4 


106698 AA463745 Hs.29403 


ESTs; Weakly similar to PROBABLE AT 


5.3 


106719 AA465171 Hs.236844 


ESTs 


5.6 


108726 AA465339 Hs.3886 


ESTs 


10.1 


106747 AA476473 Hs.171957 


triple functional domain (PTPRF interact! 


10.4 


106759 AA477263 Hs.25584 


ESTs 


4.2 


106765 AA477717 Hs.306117 


Interieukin 13 receptor, alpha 1 


6.9 


106784 AA478558 Hs.227913 


APl&40ce1 


5.1 


106831 AA482014 Hs.29463 


centrin; EF-hand protein; 3 (C0C31 yeast 


5.1 


106836 AA482112 Hs.238707 


ESTs 


4.8 


106840 AA482548 Hs.5534 


ESTs 


10.3 


106856 AA486183 Hs.285123 


ESTs; Weakly similar to similar to oxyste 


6.2 


106865 AA487228 Hs.19479 


ESTs 


4.5 


106878 AA488872 Hs.12314 


Homo sapiens mRNA; cDNA DKFZp586 


7.9 


106888 AA489101 Hs^4734 


oxysterol binding protein 


6.4 


106895 AA489665 Hs.25245 


ESTs 


4.6 


106909 AA490323 Hs.250747 


SUMO-1 activating enzyme subunit 1 


4.2 


106919 AA490885 Hs 21766 


ESTs 


12.3 


106920 AA490899 Hs.296323 


ESTs 


6.2 


106941 AA496204 Hs.237971 


ESTs 


4 


106942 AA496347 Hs.31314 


retinobtastoma-blnding protein 7 


4.8 


106948 AA496788 Hs.21077 


KIAA0532 protein 


4 


106968 AA504631 Hs.26813 


ESTs; Weakly similar to hypothetical 43.2 


4.4 


106973 AA505141 Hs.11923 


Human ONA sequence from clone 167A1 


5.4 


106980 AA521 121 Hs.8858 


bromodomaln adjacent to zinc finger dom 


4.1 


106981 AA521157 Hs.74101 


ESTs 


5.7 


106998 AA598461 Hs.1 95464 


insufin-fike growth factor binding protein 


18.7 


107008 AA598710 Hs.23740 


ESTs 


6.2 


107028 AA599214 Hs.24143 


ESTs 


4.1 


107032 AA599472 Hs,247309 


sucdnate-CoA Dgase; GOP-forming; beta 


5.3 


107052 AA600134 Hs.12482 


glycerophosphate O-acyi transferase 


4.8 


107053 AA600147 Hs.5741 


ESTs; Weakly similar to NADH-cytochro 


5.8 


107056 AA600310 Hs.18720 


programmed cell death 8 {apoptosls-induc 


4.9 


107080 AA609210 Hs.19221 


ESTs 


8.4 


107102 AA609723 Hs.30652 


ESTs 


8 


107109 AA609943 Hs.32793 


ESTs 


9.5 


107129 AA620553 Hs:4756 


flap structure-specific endonuclease 1 


4.9 


107132 AA620598 Hs.9052 


ESTs 


5.3 


107136 AA620795 Hs.8207 


ESTs 


4 


107140 AA620889 Hs.170088 


ESTs 


6.7 


107151 AA621169 Hs.8687 


ESTs 


19 


107159 AA621340 Hs.10600 


ESTs; Weakly similar to ORF YKR081c [ 


8.1 


107174 AA621714 Hs.25338 


ESTs 


8.5 


107217 D51095 Hs.35861 


DKFZP586E1621 protein 


7.2 


107252 059971 Hs.25925 


ESTs 


7.9 


107295 T34527 Hs.80120 


UDP^-acetyl-alpha-0-galactosamine:po 


5.6 


107299 T40327 Hs.30661 


tung resistance-related protein 


8.4 


107324 T81665 Hs.278422 


OKF2P586G1 122 protein 


7.5 


107372 U85625 Hs.8297 


ribonuctease 6 precursor 


4.7 


107373 U85773 Hs.154695 


phosphomannomutase 2 


4.8 


107481 W58247 Hs.279766 


Homo sapiens kinesin supedamity motor 


6.3 


107531 Y13936 Hs.17883 


protein phosphatase 1G (formerly 2C); ma 


8.3 


107859 AA024835 Hs.47584 


potassium voltage-gated channel; delayed 


7.3 


107890 M026030 Hs.61311 


ESTs; Weakly similar to CALPAIN 2; LA 


7.3 


107908 AA026894 Hs.42826 


ESTs 


4.9 


108039 AA041341 Hs.46670 


ESTs 


5.4 


108040 AA041551 Hs.159971 


ESTs 


8.4 


108102 AA046424 Hs.49433 


ESTs; Weakly similar to HYPOTHET1CA 


6.6 


108217 AA058686 Hs.62588 


ESTs 


7.7 


108255 AA063157 Hs.172608 


ESTs 


4 


108358 AA071514 Hs.1634 


ESTs 


4 


108609 AA100694 Hs.69499 


Human DNA sequence from BAC 15E1 o 


5.5 


108647 AA1 12396 Hs.44276 


ESTs; Moderately similar to HOMEOBO 


14.3 


108676 AA115562 Hs.274417 


Homo sapiens mRNA; cDNA DKFZp564 


5.2 


108687 AA120785 Hs.54347 


ESTs 


5.6 


108695 M121315 Hs.70823 


K1AA1077 protein 


10.5 


108733 AA126422 


zn84f1.s1 Stratagene lung carcinoma 9372 


4.4 
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108774 AA128125 Hs.71040 ESTs; Moderately similar to CELL GROW 

108828 AA131584 Hs.273344 DKFZP564O0463 protein 

108872 AA134063 Hs.11 1680 ESTs 

108884 AA134958 Hs.293591 ESTs 

5 108893 AA135894 Hs.194691 refoiolc acid induced 3 

109008 AA156360 Hs.87128 ESTs 

109010 AA156460 Hs.44229 dual specificity phosphatase 12 

109011 AA156542 Hs.72127 ESTs 

109042 M159525 Hs.71779 Homo sapiens DNA from chromosome 19 

10 109086 AA166695 Hs.270737 tumor necrosis factor (lig and) superfamiiy 

109090 AA167006 Hs.70499 ESTs 

109101 AA167708 Hs.52184 ESTs 

109112 AA169379 Hs.257924 ESTs 

109160 AA179387 Hs.301997 DKFZP434N1 26 protein 

15 109166 AA179845 Hs.73625 RAB6 interacting; kinesin-like (rabkinesin 

109178 AA181600 Hs.283707 ESTs 

109179 AA181902 Hs.192789 ESTs; Weakiy similar to fill ALU SUBFA 
109261 AA195255 Hs.61779 ESTs 

109270 AA195515 Hs.3585 ESTs; Weakiy similar to alternatively spli 

20 109277 AA196332 Hs.86043 ESTs 

109313 AA206800 Hs.86276 ESTs; Moderately similar to zinc finger p 

109415 AA227219 Hs.110826 trinucleotide repeat containing 9 

109454 AA232255 Hs.295232 ESTs 

109467 AA232904 Hs.63187 ESTs 

25 109481 AA233342 Hs.289069 ESTs; Weakly similar to WD40 protein C 

109508 AA233892 Hs.55902 ESTs; Weakly simflar to III! ALU SUBFA 

109514 AA234087 Hs.262346 ESTs; Weakly similar to ORF2: function 

109572 F02027 Hs.171937 ESTs 

109632 F04165 Hs.235873 ESTs; Weakiy similar to K11C4 2 [C.eleg 

30 109644 F04477 Hs.291531 ESTs; Moderately similar to GLYCERAL 

109703 F09684 Hs.24792 ESTs; Weakly similar to ORF YOR283w 

109726 F10009 Hs.9196 ESTs 

103747 F10161 Hs.22969 ESTs 

109799 F10770 Hs.180378 Homo sapiens done 669 unknown mRNA 

35 109814 F10979 Hs.153106 Homo sapiens done 23728 mRNA sequen 

110189 H20543 Hs.6278 OKFZP586B1621 protein 

110240 H25577 Hs.176588 ESTs; Weakly similar to CYTOCHROME 

110280 H29285 Hs.32468 ESTs 

110520 H56965 Hs.4082 yr09f06.s1 Soares fetal fiver spleen 1NFL 

40 110561 H59617 Hs.5199 ESTs; Weakly similar to UBIQUITIN-CO 

110707 H95079 Hs.15617 ESTs; Weakly similarto HI! ALU SUBFA 

110734 H98714 Hs.24131 ESTs 

110770 N22262 Hs.131705 ESTs 

110780 N23174 Hs.22891 solute carrier family 7 (cationlc amino aci 

45 110787 N24716 Hs.12244 ESTs; Weakly similarto C44B9.1 [C.eleg 

110794 N25262 Hs.27931 ESTs 

110799 N26101 Hs.323401 Human ring zinc-finger protein (ZNF127- 

110818 N29454 Hs.27552 ESTs; Weakly similar to putative p150 [H 

110839 N30856 Hs.30246 solute carrier family 19 {thiamine transpo 

50 110844 N31952 Hs.167531 Homo sapiens mRNA fuD length insert cD 

110854 N32919 Hs.27931 ESTs 

110856 N33063 ESTs; Weakly similar to S164 {H^apiens 

110860 N33438 Hs.170065 ESTs 

110897 N39148 Hs.6880 DKFZP434D 156 protein 

55 * 110915 N46252 Hs.29724 ESTs 

110935 H48787 Hs.305979 protease inhibitor 1 (anli-elastase); alpha- 

110970 N51374 Hs.96870 Homo sapiens mRNA full length insert cD 

111006 N53375 Hs.166146 Homer; neuronal immediate early gene; 3 

111008 N53388 Hs.7222 ESTs 
60 111018 N54067 Hs.3628 m'rtogen^activated protein kinase kinase ki 

111084 N59543 Hs. 15456 PD2 domain containing 1 

111100 N62522 Hs.20450 ESTs 

111125 N63823 Hs.269115 ESTs 

111132 N64378 Hs.83293 ESTs; Highly similar to unknown function 
65 111139 N64683 Hs.290943 ESTs 

111164 N66857 Hs.14808 ESTs; Weakly similar to 111! ALU CLASS 
111172 N67102 Hs.21851 Homo sapiens mRNA; cDNA OKFZp5 86 
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111178 N67227 


Hs.24633 


ESTs 


5.7 


111179 N67239 


Hs 10760 


ESTs 


37 


111181 N67278 


Hs. 171802 

1 W. I* 1 Wfc 


ESTs' WeaJdv similar to hvoofhetica] nro 


5.6 


111184 N67437 


Hs.243901 


Homo sapiens mRNA; cDNA DKFZp564 


8.7 


111221 N68869 


Hs 15119 

I I0< IJ 1 IV 


ESTs 


7.3 


111223 N68921 


Hs 297939 


FST*;* W/paklv sfmifar In npnnpnin W san 

Luis, ricoMj ouiiuai iu iicuyouui i.gap 


9 


111229 N69113 


He 1 10855 

(Id* 1 IVOwv 


ESTs 


8.9 


111241 N69514 


H&288880 


ESTs; Weakly similar to CGl-82 protein ( 


6.9 


111268 N70481 

III lUU Vil VTV 1 


Hs 26118 


Hnmn <^an!pn<; rionp 24766 mRMA ^poiipn 


4.5 


111295 N73275 


Hs.21275 


FSTs* Wpakh/ similar tn uhlnnitin-onnliin 


5.6 


111299 N73808 


Hs.24936 


ESTs 


8.5 


111336 N79565 


Hs 29694 


ESTs 


6.7 


111357 W91023 


Hs 87128 

f 13* wf \£SJ 


ESTs 


15 


111370 N92915 


Hs.94631 


brefetdin A-inhibited guanine nudeotide-e 


5.2 


111806 R33468 

111 Uvv lAwsTtW 


Hs 279008 

1 IhJ<4.4 WW WW 


ESTs 


10 


111825 R35885 


Hs.286148 


stromal antigen 1 


4.5 


111836 R36228 


Hs.25119 


ESTs 


7.2 


111890 R38678 


Hs 12365 


ESTs 


17.3 


111923 R39995 


Hs.25925 


Homo sapiens done 23860 mRNA sequen 


7.3 


111942 R40576 


Hs 21590 

f 1 WWW 


ESTs 


9.2 


111987 R42038 


Hs 6763 


KIAA0942 oratein 


10,6 


112101 R44793 

1 14. IV 1 1 WW 


Hs.296341 


adenytyt cyclase-associated protein 2 


5.3 


112134 R46025 


Hs.7413 


ESTs 


17,4 


112197 R49482 


Hs.5637 


ESTs 


4.4 


112244 R51309 


Hs 70823 


KIAA1077 orotein 


11 


112253 R51818 




Homo saoiens mRNA* cDNA DKFZd566 


9.3 


112305 R54822 


Hs.26244 


ESTs 


4.4 


112449 R63802 


Hs 124186 


rirvi finnfir nrnfpfn 2 

Miiy iiiivjQi ptvjicui £> 


6.3 


112483 R66534 


Hs.285885 


ESTs 


4.9 


112519 R68631 


Hs.11861 


ESTs 


14.3 


112610 R79392 


Hs 23643 


ESTs 


5.2 


112693 R88741 


Hs 91065 


ESTs* Mnrifiratp.lv similar to DroQfsration 


4.6 


112751 R93507 


Hs.8207 


ESTs 


5.6 


112801 R97486 


Hs.157160 


protein kinase; DMA-activated; catalytic p 


8.7 


112869 T03313 


Hs.4747 


dyskeratosis congenita 1; dyskerin 


5.9 


112A71 TQ3352 


Hs 12285 


ESTs 


5.8 


112908 T100S5 


Hs3530 


TLS-assoeiated serlne-arainlne Drotein 


4^1 


112966 T17119 

1 It JUU 1 1 1 1 1 J 


Hs 102548 


Glucocorticoid receotor DNA bindina fact 


5.7 


112971 T17185 


Hs.83883 


ESTs 


64 


112995 T23528 


Hs.7155 


ESTs; WeaWy similar to TYW protein (M 


9.1 


113047 T25867 


Hs.7549 


ESTs' 


5.4 


113075 T34660 


Hs.6986 


ESTs; Weakly similar to iill ALU SUBFA 


5.7 


113117 T47819 

IIJIII l~» V lv 


Hs 159153 

1 Iw* Ivv I WW 


ESTs' 


5.8 


113206 T58044 


Hs 241471 


ESTs- Mnderatplv similar hi till ALU SU 


6.4 


113248 T63857 




vc16e1 s1 Strataaene lima (#93721) Homo 

jw iw 1 1« i wuQiay wi iq mi iy yrww iwiiiw 


4.6 


111260 T64AQ6 


Hs 287420 


ESTs 


6.9 


113277 T65797 


Hs.11774 


orntem fnentidvl-nrolvl cisTtrans isomaras _ ~ 


5^6 


11197ft T65ftf)2 

1 1 Jt/ Q 1 0 JOUi 


H«s 1113*5 


vr11hin q1 Straianpnp him ^37210) Hn 


4.5 


113440 T86121 


Hs.191445 


ESTs 


6.4 


113523 T90037 


Hs.95549 


ESTs 


6.4 


113604 T92735 


Hs.296083 


ESTs 


8.7 


113702 T97307 




ESTs; Moderately simflar to !!!! ALU SU 


9.5 


113783 W19222 


Hs.7041 


ESTs; WeaWy similar to till ALU SUBFA 


5.2 


113794 W37382 


Hs.11090 


ESTs 


11.9 


113808 W44735 


Hs.9286 


ESTs 


16.7 


113811 W44928 


Hs.6994 


ESTs 


4 


113822 W47350 


Hs.17466 


retinolc acid receptor resportder (tazaroten 


4.8 


113823 W47388 


Hs.55099 


rab6 GTPase activating protein (GAP and 


4 


113836 W56792 


Hs.12040 


ESTs; Weakly similar to KIAA0881 prate 


4.1 


113857 W65477 


Hs.5297 


Homo sapiens mRNA; cDNA DKFZp564 


4.3 


113886 W72471 


Hs.23920 


ESTs 


4.6 


113895 W73738 


Hs.12921 


ESTs 


7.1 


113923 W80763 


Hs.3849 


ESTs; Weakly similar to FK506-binding p 


6.8 


113931 W81205 


Hs.3496 


ESTs 


6.1 


113950 W85765 


Hs.30504 


Homo sapiens mRNA; cONA DKF2p434 


14 


113970 W86748 


Hs.8109 


ESTs 


15 


114051 W94942 


Hs.177534 


dual specificity phosphatase 10 


5.4 
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114057 W96222 
114086 Z3B266 
114098 Z38347 
114109 Z38435 
114124 Z38595 
114138 Z38763 
114149 Z38814 
114162 Z38909 
114177 Z39062 
114196 Z39211 
114208 Z39301 

114250 Z39897 

114251 239898 
114292 Z40715 
114297 Z40758 
114334 Z41342 
114460 M024604 
114471 AA028074 
114480 AA032243 
114518 AA046407 
114542 AA055768 
114549 AA056484 
114652 M101416 
114673 AA113303 
114698 AA126951 
114767 AA148885 
114799 AA159323 
114804 AA160363 
114811 M161161 
114821 AA165313 
114852 AA235035 

114901 AA236276 

114902 AA236359 
114940 AA243012 
114965 AA250737 
115047 AA252627 
115054 AA252863 
115061 AA253217 
115082 AA255557 
115116 AA256486 
115140 AA258030 

115205 AA262470 

115206 AA262491 
115239 AA278650 
115242 AA278755 
115249 AA278961 
115259 AA279071 
115285 AA279799 
115291 AA279943 
115357 AA281793 
115377 AA282247 
115400 AA283198 
115439 AA284561 
115471 AA287138 
115506 AA292537 
115522 AA331393 
115572 AA398392 
115587 AA399264 
115600 AA400247 
115612 AA400948 
115646 AA404352 
115652 AA405098 

115657 AA405620 

115658 M405625 
115675 AA406546 
115721 M417102 
115763 AA421560 



Hs.34192 
Hs.288649 
Hs.118338 
Hs.184108 
Hs.125019 
Hs.15740 
Hs.27196 
Hs.22265 
Hs.23740 
Hs.150926 
HsJ859 
Hs.13297 
Hs.21948 
Hs.184641 
Hs.173091 
Hs.22941 
Hs.26102 
Hs.104613 
Hs.151678 
Hs.106469 
Hs.293380 
Hs.292833 
Hs.107149 
Hs.95583 
Hs.1 10857 
Hs.154443 
Hs.109929 
Hs.269956 
Hs.95907 
Hs.55468 
Hs.38260 
Hs.196437 
Hs.39504 
Hs.75928 
Hs.72472 
Hs.82916 
Hs.87729 
Hs.41271 
Hs.198269 
Hs.62275 
Hs.279938 
Hs.284216 
Hs.186572 
Hs.73291 
Hs.283732 
Hs.71124 
Hs.13453 
Hs.293736 
Hs.122579 
Hs.72988 
Hs.193063 
Hs.89113 
Hs.193090 
Hs.59346 
Hs,45207 
Hs.47378 
Hs.59594 
Hs.283037 
Hs.42173 
Hs.71243 
Hs.305971 
Hs.38178 
Hs.55158 
Hs.183056 
Hs.82065 
Hs.90960 



ESTs 

Homo sapiens PAC clone DJ0777O23 fro 

ESTs; Weakly similar to similar to S, cere 

ribosomal protein L21 

ESTs; Highly similar to KIAA0886 prate 

amyloid beta (A4) precursor protein-bindi 

ESTs 

ESTs 

ESTs 

fucose-1-phosphate guanylyltransferase 

ESTs 

ESTs 

ESTs 

delta-6 fatty acid desaturase 

OKFZP434K151 protein 

ESTs 

ESTs 

ESTs 

UOP-N-acetyl-alpha-C>galactosamine:po 
suppressor of varl {S.cerevisiae) 3-iike 1 
ESTs 
ESTs 

ESTs; Weakly similar to PTB^SSOCIAT 
transmembrane 4 superfamHy member (te 
ESTs; Highly similarto putative DNA-dlr 
minichromosome maintenance deficient (S 
ESTs 
ESTs 

multiple inositol polyphosphate phosphata 
ESTs 

ESTs; Moderately similar to ubiquitin spe 

ESTs; WeaWy similar to R26660J; parti 

ESTs 

ESTs 

ESTs 

homeo box B5 

ESTs 

ESTs 

NADH dehydrogenase (ubiquinone) 1 alp 
ESTs 

ESTs; WeaJdy similar to supported by GE 

ESTs 

ESTs 

ESTs; Weakly similar to similarto the bet 

ESTs 

ESTs 

splicing factor 3b; subunit 1 ; 155kD 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs; Weakly similar to ASPARTYL-TR 
Human DNA sequence from clone 620E1 
ESTs 

ESTs; WeaWy similar to F33G12.3 gene p 
ESTs; Highly similar to HSPC039 protein 
ESTs 

ESTs; WeaWy similar to zinc finger prote 

ESTs 

ESTs 

ESTs; Weakly similar to weak similarity t 
Human DNA sequence from done 34B21 
Homo sapiens mRNA; cONA OKFZp564 
ESTs 
ESTs 
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115764 AA421562 Hs.91011 


anterior gradient 2 {Xenepus taevis) homo 


41.6 


115835 AA428576 Hs.41371 


ESTs 


4.2 


115844 AA430124 Hs.7773 


ESTs 


11.9 


115875 AA433943 Hs.43946 


ESTs; Weakly similar to Weak similarity 


33.5 


115888 M435839 Hs.76591 


KIAA0887 protein 


7.2 


115922 AA441911 Hs.71869 


ESTs; Weakly similar to KIAA0926 prote 


5.1 


115941 AA443602 Hs.46679 


ESTs 


4.8 


115947 AA443793 Hs.94761 


ESTs 


8.3 


115948 AA443798 Hs.43445 


poly(A)-spedfIc ribonudease (deadenyiat 


13.5 


115951 AA443918 Hs.301048 


cofilin 1 (non-muscle) 


7.5 


115967 AA446887 Hs.42911 


ESTs 


8.8 


115984 AA447687 Hs.91109 


ESTs 


13.1 


116009 AA449448 Hs.44238 


ESTs 


5.5 


116024 M451748 Hs.83883 


Human DNA sequence from done 718J7 


7.5 


116028 AA452112 Hs.42644 


thloredoxin-Uke 


12.7 


116050 AA453656 Hs.88417 


ESTs 


7.2 


116097 AA456099 Hs.176376 


ESTs 


11.8 


116108 AA457566 Hs.28777 


ESTs 


4.5 


116121 AA459254 Hs.48855 


ESTs 


4.5 


116127 AA459703 Hs.279884 


v-myc avian myelocytomatosls viral onco 


4.3 


116129 AA459956 Hs.49163 


ESTs; Highly similar to putative rlbonude 


7.6 


116142 AA460649 Hs.39457 


ESTs 


4.8 


116204 AA465701 Hs.108646 


ESTs 


6,8 


116221 AA478397 H$.50180 


ESTs 


4.9 


116222 AA478415 Hs.89986 


ESTs 


4 


116238 AA479362 Hs.47144 


DKFZP586N0819 protein 


4.6 


116246 AA479961 Hs.250646 


ESTs; Highly similar to ubiquituvconjuga 


4 


116249 AA480886 Hs.86693 


ESTs 


18.5 


116250 AA480975 Hs.44829 


ESTs 


10.8 


116254 M481146 Hs.41086 


ESTs; Weakly similar to OXYSTEROL-B 


9.1 


116256 AA481256 Hs.88201 


ESTs; Weakly similar to lysophosphoBpa 


8.4 


116264 AA482594 Hs.272239 


Homo sapiens mRNA; cDNA DKFZp586 


12 


116265 AA482595 Hs.55189 


ESTs; Weakly similar to F25B5.3 [C.eleg 


11.1 


116282 AA486550 Hs.204501 


ESTs; Weakly similar to Wiskott-Aldrich 


6.2 


116298 AA489046 Hs.94109 


ESTs 


4.9 


116300 AA489194 Hs.159471 


ESTs; Weakly similar to snRNP protein B * 


4.6 


116327 AA490959 Hs.28005 


Homo sapiens mRNA; cDNA DKFZp564 


5.8 


116334 AA491457 Hs.48946 


ESTs 


4.3 


116337 AA496127 Hs.44070 


ESTs 


8.4 


116351 AA504116 Hs.82501 


Homo sapiens mRNA; cDHA DKFZp434 


5.3 


116357 AA504806 Hs.90797 


Homo sapiens clone 23620 mRNA sequen 


5.2 


116415 AA609204 Hs.27973 


KIAA0874 protein 


6.6 


116443 AA620313 Hs.190488 


ESTs; Weakly similar to KERATIN; TYP 


4.5 


116470 C13992 Hs.83484 


ESTs 


4.5 


116480 C14088 


giyceraldehyde-3-phosphate dehydrogena 


5.6 


116578 D51272 Hs.75337 


nucleolar phosphoprotein p130 


4.1 


116579 D51276 Hs.81915 


leukemia-associated phosphoprotein p18 


5.8 


116626 F02028 Hs.81907 


ESTs 


4.9 


116647 F03069 Hs.15395 


ESTs; Weakly similar to ARGINYL-TRN 


6.1 


116674 F04816 Hs.92127 


ESTs 


10.6 


116680 F08813 Hs.273829 


UNE retrotransposabte element 1 


.4.2 


116700 F09983 Hs.317589 


ESTs 


13 


116724 F13665 Hs.65641 


ESTs 


8.5 


116726 F13681 Hs.53913 


ESTs 


5.6 


116732 F13779 Hs.165909 


ESTs 


11.6 


116734 F13789 Hs.93796 


DKFZP586D2223 protein 


5.4 


116760 H11054 Hs. 155342 


protein kinase C; delta 


4.3 


116780 H22566 Hs,30098 


ESTs 


5.7 


116786 H25836 Hs.301527 


tumor necrosis factor (Hgand) superfamfly 


8.8 


116787 H28581 Hs.15641 


ESTs 


8.6 


116790 H29532 Hs.101174 


mtcrotubuie-assodatfid protein tau 


22.2 


116803 H47357 Hs.109701 


ESTs; Moderately similar to weak similar 


6.7 


116877 H68116 Hs.168732 


ESTs 


6.5 


116921 H72948 Hs.821 


blglycan 


20.7 


117216 N20083 Hs.42792 


ESTs 


4.4 


117232 N20579 Hs.61153 


ESTs 


7.4 


117284 N22162 Hs.183779 


ESTs; Weakly similar to cONA EST yk33 


4.1 
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117344 N24046 


Hs.210706 


ESTs 


117367 N24954 


Hs.42502 


ESTs 


117392 N26175 


Hs.93405 


ESTs 


117394 N26257 


Hs.39871 


KIAA0727 protein 


117412 N26722 


Hs.42645 


ESTs 


117498 N31726 


Hs.44268 


ESTs; Highly similar to myelin gene expr 


117557 N33920 


Hs.44532 


diubiquitin 


117634 IM36421 


Hs.13323 


ESTs; Weakly similar to SODIUM- AN0 


117639 N36923 


Hs.44833 


ESTs 


i 1/(34 vv\i *tu9 


Hs 59757 


ESTs 


117852 N49408 


Hs.136102 


KIAA0853 protein 


11787Q NSflD'iO 


Hs.303025 


ESTs; Weakly similar to keratin; 67K typ 


417Q94 KKIflRR 
11/5^1 PJOIVOO 


Lie 'XRO.Qi 


ESTs 


117950 N51394 


Hs.75478 


KIAA0956 protein 


1179Q9 M59000 


Hs 172089 


Homo sapiens mRNA; cDNA DKFZp586 


1 101 JO lw/ i 


M<? 93560 


ESTs; WeaWy similar to tig [R.norvegicu 


11A915 


Hs 77910 


34iydroxy-3-mefliylglLitary^Coen^me A 


11A99Q NR933Q 


Hs 166254 


heat shock 90kO protein 1; alpha 


11fi?fi5 N62827 

1 I0&Q3 iMu£U£f 


Hs 48645 


EST 


1 10000 mOOOUH 


H* 471fifi 


ESTs 


118363 N64168 


Hs.48938 


ESTs 


11R49Q Nfifi15fl 


Hs 74649 


ESTs 


118470 N66769 


Hs.291033 


ESTs 


11A479 NfifiRIA 

1 104// JTOOO 10 


Hs.42179 


ESTs 


118475 N68845 




ESTs; Weakly similar to IU! ALU CLASS 


1 1AAQ^ NR714Q 
110490 NO/ WSf 




ESTs 


11RS9R NG7RRQ 




ESTs 


118542 NG8010 


Hs 49427 


ESTs 


Hftfinn NfLQ??2 




ESTs 


11ARQ5 N71781 


Hs 50081 


Homo sapiens mRNA full length insert cD 


HARQfl MT7in 

i looyo m&iio 


H<! 5(1187 
no.uu to/ 


ESTs 




Hs.94445 


ESTs 


«1flQ5? "WJi 




ESTs; Highly simQar to GGI-90 protein {H 


11RQ7R NQ3R9Q 


u« 93391 


ESTs 




Hs 195830 


ESTs 




Hs 45105 


ESTs 


11Qn97 MQQ25R 

1 \Z)\)C( IH93£30 


Hs 114611 


ESTs 


119042 R05316 


Hs.5472 


ESTs 


1 ion7«; r'v^i 

1 19U/3 r\00Hg 1 


9A7fi9(l 


fihrnnprfin 1 


HOOKA T1RQ1R 
119Z0U 113910 


1-k 109950 
ns». luzijjvj 


PSTs* Hiahlv similar to coat Drotein Qamrn 




Hs.65328 


ESTs 


1 19190 1 COQiM 


Hs 155478 


cyciin T2 


I I90UZ leal to 




ESTs 


1 1Q1d1 TR9T71 
1 7904 1 1 0ZOY 1 




microtubule-associated protein 7 


119495 W35390 


Hs.55533 


ESTs 


119580 W42451 


Hs.92260 


high-mobility group protein 2-like 1 


119602 W46286 


Hs.233694 


ESTs; Weakly similar to ZK1058.5 {Cele 


119620 W47620 


Hs.56009 


y-S'ofigoadenytate synthetase 3 


119676 W60473 


Hs.57787 


ESTs 


119717 W69134 


Hs.57987 


ESTs 


119729 W69747 


Hs.94806 


K1AA1062 protein 


119805 W73788 


Hs.43213 


ESTs 


119859 W80702 


Hs.58461 


ESTs 


119867 W80852 


Hs.250696 


KDEL (Lys-Asp-Gtu-Leu) endoplasmic re 


119673 W81129 


Hs.44865 


Homo sapiens mRNA; cDMA DKFZp586 


119899 W84767 


Hs.58698 


ESTs 


119940 W86779 


Hs.272531 


DKFZP586B0319 protein 


119943 W86835 


Hs.14158 


copine 111 


119970 W87812 


Hs.93581 


Homo sapiens mRNA; cDNA DKFZp586 


120131 Z38656 


Hs.75887 


coatomer protein complex; subunit alpha 


120150 Z39549 


Hs.153746 


ESTs 


120206 Z40805 


Hs.91668 


ESTs 


120241 Z41815 


Hs.65946 


ESTs 


120255 AA169752 Hs.5672 


ESTs; WeaWy similar to Similarity to Yea 


120314 AA194166 Hs.221040 


KIAA1038 protein 


120325 AA195651 Hs.104106 


ESTs 


120352 AA211400 Hs.193172 


ESTs 
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120428 AA236822 Hs.173694 


K1AA1097 Dioteln 


5.6 


120524 AA261852 Hs 192905 


ESTs 


5.6 


120528 AA262107 Hs 104413 


ESTs 


4.5 


120571 AA280738 Hs 34892 


ESTs 


4.9 


190840 AA9ft711'i He 1Q9A43 


ESTs 


4.5 


120855 AA9R7347 He 9^8905 


ESTs 


6.7 


190Rfift AA9R7R33 He909Q13 


ESTs 


ft 3 
o.o 


120719 AA9Q9R54 He Ifl^Rnfi 
Itvlf It /Vu94V9t no. lutouo 


piikarvntir traneln(!rtn fnitlatinn farinr 9 a\ 

CUlVdiyUUU UalloldUUII UllUaUUJI laUUJI t ai 


4.6 


190711 AA2Q9R55 ^08557 


ESTs 


10.6 


190794 AA9Q3470 He 100747 


PSTs 




120873 AA358015 


EST 


7 i 
/ . 1 


120885 AA365515 Hs 301872 


ESTs: Moderately similar to f((! ALU SU 


4.6 


190010 AA3ft1195 He 301444 


F<5Te 


R9 
o.t 


19004ft AA3Q7A99 He 10AR50 


CQTc Hinhtv similar tn eimltar fn mann n 

no i b, niymy oiiniidi iu sinuiai 10 nsdyu n 


ft R 
o.o 


190080 AA30R11R He 19090R 

ItUsJCKJ MMOoO I ID rio. ItStUu 




10 *i 
IU.O 


190Q77 AA"Wft155 HcQ7ftA0 


CO 15 


10 Q 


191 105 AAlQRQ^fi He Q7RQ7 


CO 1 


7 4 


191901 AA4017R3 HcftlAR 
itityt MAiUi/oo ns.oiou 


UJliy LalHAJI UalBjlUdlc 


^ 3 
3.0 


191390 AA4030flfl He 301097 


Tj*pI1 rRnpnlnr ntnha A/'D* I'fl^ 


13.5 


1914R1 4A4117A5 He 91QRR1 


PQTe' Wpalflu eimliar tn klAAO^RA nmfa 

co 1 5, vveawy simudi 10 rxi/Anuo^ proio 


ft Q 


191 SOB AA41R740 He174104 


ESTs 


22.6 


1 2 1 723 AA4 1 9822 Hs 1 04800 

1 1 1 f *-«J rV^^ 1 OV/tt 1 IO. 1 UHyUU 


ESTs* Weaklv similar to Mouse 19 5 mRN 


Q 


19174A AA491171 He93454 l > 


ESTs 


5.6 


19919*i AA43AA11 H«? QSROfi 


ESTs 


5.3 


199599 AA449444 HeQBQRO 


FSTs 

cot a 


4 


199RR5 AAi1d7^fi Hc07fl*l7 


Colo 


A 
H 


122704 AM56326 Hs 99445 


ESTs 


6.2 


199789 AA4^QftQ4 He 00479 

Itt/Ot /VVfdlJO;f*l no.J73H/ t 


COI o 


5.3 


199ft5fi AA483740 He753R7 


Ol Lr1ll\c*aUapvci 


13 1 

Is). 1 


122882 AA465381 Hs 108812 


ESTs* Weaklv similar to B0041 5 IC etea 


5.5 


122998 AA47S578 Hs 101840 


ESTs 


6*3 


122974 AA478625 H«; 194215 


• ESTs 


6 


122997 AA479295 Hs 106290 


Kplch motif containinn nmtpin 


12.5 


123016 AA480103 Hs 323931 


FSTe* Wftaktv eimflar to altpjTrwfiuAlv enfi 


4.4 


123107 AA486071 Hs.104207 


ESTs 


8.3 


123111 AA486273 Hs.191721 


ESTs 


4.2 


123114 AA486407 Hs. 129928 


F^Te* Mnffpratph/ eimilar tn KIAA041d r» 


5.2 


123136 AA487449 Hs.194024 


ESTs 


4.2 


193137 AA4ft74fift He 100R8R 


F^Te* lA/paHu eimilar fn eprrpfpH rpmonf 


14.6 


123189 AA488892 


FSTs* WpaJctv eimn^r to RafuPnf nofvnm 

WIO, ¥IC«UMj OINIiMU IU WwyrVI ^MiJUIW 


4.5 


19T17R AA4AQ090 HeRQ933 


ESTs 


52 


19333ft b&WAJAQ He 1R75R5 


ESTs 


4 


193438 AA50A714 He 993014 


nmtpaeo* eprinp' 1R 


7.3 


191449 AA50ftA03 He1114QR 
1 tO*r»t rvvjoOOUO no. 1 1 1 Hw 


ESTs 




19144Q AA^QRRQO He 1194Q3 




4 1 


1914Q4 AAR0Q7RR He 119110 
itOWt M/wjj/00 nS.lltllU 


CCTc 
CO 19 


4 


ItoJUO AnOUUitl nS.tydljD 


COI s 


19 ft 
lt.0 


19T;i1 AARAft751 


F3Te- WpalHu eimilar tn (Ml AH 1 31 tRFA 
COI o, VVBdWy olilular (0 1UI rU-U OUOrA 


7 0 


123619 AA609200 


ESTs 


23.1 


123673 AA609471 Hs.158549 


ESTs 


6.6 


123729 AA609778 Hs^78672 


membrane component; chromosome 11; s 


4.7 


123819 AA620636 Hs.1 12264 


ESTs 


4 


123960 AA621785 Hs.287733 


methytmaionate-semialdeh/de dehydroge 


7.6 


124000 D57317 Hs.74861 


activated RNA polymerase II transcription 


4.4 


124006 D60302 Hs.270016 


ESTs 


20.6 


124012 080240 Hs.241471 


HUM5G 1 1 A Human fetal brain (TFujiwa 


6.7 


124021 F02859 Hs.13974 


ESTs 


4.7 


124049 F10523 Hs.74519 


primase; polypeptide 2A (58kD) 


4.7 


124059 F13673 Hs^83713 


ESTs 


7,7 


124243 H66710 Hs.133525 


ESTs 


5.5 


124308 H93575 Hs.241507 


Homo sapiens mRN A; cONA OKFZp564 


11.4 


124314 H94877 . Hs.215766 


GTP-binding protein 


13.7 


124315 H94892 Hs.288757 


v-ral simian leukemia viral oncogene horn 


14 


124350 N21359 Hs.101282 


Homo sapiens mRNA; cONA DKFZp434 


8.6 


124352 N21626 Hs.102406 


ESTs 


7.2 


124357 N22401 


yw37g07.s1 Morton Fetal Cochlea Homo 


5.2 


124390 N29325 Hs.7535 


ESTs; Highly similar to COBVWike place 


7.9 
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124438 N40186 Hs.11090 ESTs 

124447 N480GO Homo sapiens mRNA; cDNA DKFZp586 

124457 N50114 Hs266175 ESTs 

124539 N63172 Hs.146409 eel! division cycle 42 (GTP-binding prate 

5 124626 N74604 Hs.11090 ESTs 

124632 N79515 Hs.306117 interieukin 13 receptor; alpha 1 

124644 N91279 Hs.109654 ESTs; Moderately similar to outer membr 

124676 R01037 Hs.181013 phosphoglycerate mulase 1 (brain) 

124677 R01073 ESTs; Weakly similar to 111! ALU CLASS 
10 124724 R12405 Hs.1 12423 Homo sapiens mRNA; cDNA DKFZp586 

124773 R40923 Hs.106604 ESTs 

124777 R41933 ESTs 

124792 R44357 Hs.48712 ESTs; Weakly similar to cONA EST EMB 

124857 R53652 Hs.137190 ESTs 

15 124911 R88992 Hs.180612 ESTs 

124955 T10598 Hs.324841 ESTs; Weakly similar to ill! ALU SUBFA 

124958 T11134 Hs.431 murine leukemia viral (bmM) oncogene h 

125038 T78089 Hs.270134 ESTs 

125092 T92544 Hs.137548 CD84 antigen {leukocyte antigen) 

20 125132 W15495 Hs.129781 chromosome 21 open reading frame 5 

125144 W37999 Hs.24336 ESTs 

125154 W38419 ESTs 

125243 W86423 Hs.105413 ESTs 

125279 W93540 Hs.4779 ESTs; Moderately similar to similar to AO 

25 125299 Z39436 Hs.102720 ESTs 

125303 Z39821 Hs.288193 ESTs 

125304 Z39833 Hs.124940 GTP-binding protein 

125474 AA151216 Hs.75103 tyrosine 3<monooxygenase/uyptophan 5-m 

125509 AA044232 Hs.288967 ESTs 

30 125580 AA126504 Hs.267812 sorting nexin 4 

125582 AA507383 Hs.74649 cytochrome c oxidase subunit Vic 

125670 A1432621 Hs.82685 CD47 antigen (Rrwelated antigen; integri 

125698 AA748483 Hs.191356 general transcription factor IIH; polypepti 

125745 A1283493 Hs.75722 ribophorinll 

35 125852 H09290 Hs.76550 Homo sapiens mRNA; cDNA DKFZp564 

125972 AA434562 Hs.35406 ESTs 

126160 N90960 Hs.265398 ESTs; Weakly slmflar to transformaOon-r 

126257 N99638 Hs.124084 tumor necrosis factor receptor superfamily 

126337 AI066486 Hs.40500 similar to S. cerevbiae RE R1 

40 126405 U46278 Hs.122489 ESTs 

126537 W40262 Hs.146310 ESTs; Weakly similar to putative p1 50 [H 

126590 W78968 Hs.181307 H3 histone; family 3A 

126712 AA205862 Hs.7942 ESTs 

126721 T72569 Hs.125359 Thy-1 cell surface antigen 

45 126764 AI334393 Hs.102178 ESTs 
126804 AI203334 Hs.160628 ESTs 
126819 AA305536 Hs279607 ESTs 
126877 A1052047 Hs.26102 ESTs 
126991 R31652 Hs.821 biglycan 

50 127479 AA513722 Hs.179729 collagen; type X; alpha 1 (Schmid melaph 
127514 AA826926 Hs.204214 ESTs 

127663 W07286 Hs.10340 ESTs; Weakly similar to weak similarity t 
127677 AA916752 Hs.264190 ESTs; Highly similar to MEM3 (M.muscu 
127814 AA761755 ■ Hs.136713 ESTs; Weakly similar to V4-1 {Rsapiens 
55 127997 AI281549 Hs.311054 ESTs 
128092 AA904617 Hs.166229 ESTs 

128218 H02682 Hs.292154 ESTs; Moderately similar to recombinatio 
128466 059653 Hs.241471 EST 
128482 U83908 Hs.296251 programmed cell death 4 
60 128517 AA280617 Hs.100861 ESTs; Weakly similar to p60 katanin [H.s 
128530 AA504343 Hs.183475 Homo sapiens clone 25061 mRNA sequen 
128559 AA226801 Hs.101448 metastasis associated 1 
128574 AA412048 Hs.38260 keratin 8 

128595 U31875 Hs.152677 short-chain alcohol dehydrogenase family 
65 128610 L38608 Hs.10247 activated leucocyte cell adhesion molecule 
128629 AA399187 Hs.102708 DKF2P434A043 protein 
128649 AA142853 Hs.103106 Homo sapiens mRNA for G7b protein (G 



9.5 

4.8 

6.1 

5.6 

12.8 

6.4 

8.3 

12.3 

5.4 

6.6 

4.9 

72 

8.6 

4.9 

4.7 

4.4 

12.6 

4.1 

14.8 

6.7 

4.8 

5.3 

6.6 

5.8 

12.2 

10.2 

6.8 

8 

5.4 
4.1 
11.5 
4 

9.4 " 

6.2 

25.9 

4.1 

16.4 

9.5 

5.6 

7.5 

4.1 

4.5 

5.2 

4.4 

4.6 

11.7 

4 

7 

5.6 

14.3 

4.5 

5.1 

17.3 

4.1 

5.5 

5.8 

5.8 

7.4 

5.8 

8.3 

6.6 

5.2 

5.1 

27.1 
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6.7 
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128651 


AA446990 


Hs.103135 


ESTs 


6.1 


128553 


R48943 


Hs.10315 


solute carrier family 7 (cationic amino ad 


4.4 


128656 


AA458542 


Hs.10326 


coatomer protein complex; subunit epsilon 


14.3 


128717 


T30617 


Hs.104222 


Homo sapiens mRNA; cDNA DKFZp566 


24.5 


128727 


M64174 


Hs.50651 


Janus kinase 1 (a protein tyrosine kinase) 


7.3 


128764 


N49308 


Hs.104938 


ESTs; Weakly similar to alpha 1(XVI1I) c 


9.2 


128781 


X85372 


Hs.105465 


small nuclear ribonucleoproteln pofypepti 


5.4 


128793 


W93562 


Hs.105749 


KIAA0553 protein 


4.6 


128835 


W15528 


Hs.106390 


Homo sapiens mRNA; cDNA DKFZp586 


4 


128845 


AA455658 


Hs.10649 


basement membrane-induced gene 


6.9 


128871 


AA400271 


Hs.106778 


Homo sapiens mRNA for putative Ca2*-t 


4.5 


128922 


AA252023 


Hs.9589 


ESTs; WeaWy similar to HRIHFB2157 (H 


6.4 


128925 


D61676 


Hs.21851 


Homo sapiens mRNA; cDNA DKFZp586 


6.4 


128938 


AA410325 


Hs.107260 


ESTs 


7 


128946 


N29353 


Hs.107318 


kynurenine 3-monooxygenase (kynurenln 


5.2 


128948 


AA485655 


Hs.223025 


proteasome (prosome; macropain) subunit 


13.1 


128955 


F10290 


Hs.185807 


Homo sapiens done 24758 mRNA sequen 


5.8 


129005 


AA460049 


Hs.13323 


ESTs; Weakly similar to SODIUM- AND 


12.6 


129009 


AA131421 


Hs.75607 


ESTs 


9.8 


129017 


H131Q8 


Hs.107968 


ESTs 


13.9 


129057 


X62466 


Hs.276770 


CDW52 antigen (CAMPATH-1 antigen) 


10.7 


129075 


AA129465 


Hs.83765 


ESTs 


4.7 


129095 


L12350 


Hs.108623 


thrombospondin 2 


4.4 


129124 


AA234530 


Hs.1088O2 


N-ethylnratetmide-sensifive factor 


20.7 


129160 


AA131252 


Hs.109007 


ESTs 


5.9 


129164 


AA282183 


Hs.109045 


ESTs 


5.8 


129180 


R40556 


Hs.318401 


ESTs; Highly similar to HSPC039 protein 


7.6 


129224 


X89109 


Hs.109606 


coronin; actin-binding protein; 1A 


12 


129229 


AA211941 


Hs.109643 


polyadenylate binding protein-interacting 


7.9 


129240 


W24360 


Hs.237868 


interieukin 7 receptor 


5.3 


129241 


AA435665 


Hs.109706 


ESTs; Moderately similar to HN1 (M.mus 


8.4 


129243 


H88033 


Hs.109727 


KIAA0733 protein 


7.8 


129247 


AA151574 


Hs.109733 


pilin-fike transcription factor 


6.4 


129259 


AA090695 


Hs.181385 


ESTs 


6.2 


129270 


Z35227 


Hs.109918 


ras homotog gene family; member H 


5.4 


129281 


AA026318 


Hs.289101 


glucose regulated protein; 58kD 


4.4 


129300 


C20976 


Hs.1 10165 


ESTs; Highly similar to ribosomal protein 


5.7 


129318 


N93155 


Hs.285976 


calmodulin 1 (phosphorylase kinase; delta 


7.7 


129319 


AA037467 


Hs.30340 


ESTs 


6 


129351 


AA167268 


Hs.62349 


Human ras Inhibitor mRNA; 3' end 


9.3 


129366 


H18027 


Hs.184697 


plexin C1 


182 


129383 


W92984 


Hs.288224 


ESTs 


5.9 


129388 


AA151621 


Hs.1 10964 


ESTs 


4.1 


129391 


T80814 


Hs.11101 


discs; large (Drosophila) homoJog 3 (neur 


105 


129404 


AA172056 


Hs.317584 


ESTs 


5.3 


129406 


N23707 


Hs.1 11 138 


KIAA0712geneprodud 


4 


129426 


AA412087 


Hs.111323 


EST; Highly similar to protein Inhibitor o 


- 8 


129453 


AA421213 


Hs.111632 


Lsm3 protein 


5.5 


129513 


C00225 


Hs.306163 


ESTs; Weakly similar to fos39554_1 [H.s 


5.5 


129519 


AA298786 


Hs.1 12242 


ESTs 


6.8 


129606 


R21443 


Hs.166254 


heat shock 90kD protein 1; alpha 


5 


129622 


AA278243 


Hs.323949 


ESTs 


6.8 


129626 


AA447410 


Hs.111334 


ESTs; Weakly similar to HI! ALU SUBFA 


5.1 


129627 


AA258308 


Hs.71968 


Homo sapiens mRNA; cDNA DKFZp564 


5.3 


129628 


U26727 


Hs.1 174 


cycOn-dependent kinase inhibitor 2A (mel 


82 


129642 


R50008 


Hs.1 1806 


7-dehydrocholesterol reductase 


4.3 


129663 


AA442768 


Hs.1 1866 


translocase of inner mitochondrial membr 


4.4 


129665 


M86458 


Hs.1 18778 


KDEL (Lys-Asp-Glu-Leu) endoplasmic re 


4 


129691 


X06700 


Hs.1 19571 


collagen; type 111; alpha 1 (Ehlers-Daruos 


6 


129783 


AA454618 


Hs.12479 


associated molecule with the SH3 domain 


6.4 


129800 


AA252436 


Hs.12540 


r/sophospholipase 1 


7.7 


129836 


AA452161 


Hs.206521 


YME1(S.cerevisIaeH3ke1 


5 


129850 


N20593 


Hs.288932 


GDP dissodation inhibitor 2 


6.9 


129869 


AA102520 


Hs.13015 


ESTs; WeaWy similar to heat shock prote 


5 


129896 


AA043021 


Hs.13225 


UDP-Gal:betaGlcNAc beta 1;4- galactosy 


6.6 


129982 


M87789 




Immunoglobulin gamma 3 (Gm marker) 


4 


129985 


AA450045 


Hs.140452 


cargo selection protein (mannose 6 phosp 


5.8 
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130029 AA236412 
130033 M90696 
130036 AA195260 
130069 AA055896 
130077 T24055 
1300&0 X14850 
130096 AA223874 
130114 AA234717 
130125 M36803 
130135 M61764 
130170 M610070 
130189 043947 
130208 AA620556 
130211 D5084Q 
130235 X14046 
130276 S75295 
130280 L13738 

130313 AA620323 

130314 086967 
130328 AA135673 
130356 X84373 
130367 Z38501 
130378 T47333 
130384 X66364 
130393 013630 
130399 AA449417 
130407 N29888 
130414 M21121 
130417 U58522 
130421 D21260 
130441 U35835 
130455 X17059 

130498 L38951 

130499 AA416723 
130511 L32137 
130553 AA430032 
130558 H96654 
130568 AA232535 
130583 W24957 
130585 H66211 
130604 X03635 
130614 AA132007 
130619 AA477739 
130622 AA235247 
130625 F03969 
130627 L23808 
130629 M60346 
130635 M87503 
130639 059711 
130677 H17861 
130681 082808 
130693 AA487202 
130703 N63295 
130706 AA488843 
130712 AA292Q66 

130714 X92896 

130715 T98227 
130744 AA203527 
130747 AA471293 
130751 AA435633 
130796 R39390 
130800 AA223386 
130855 AA425439 
130859 AA287327 
130866 M58028 
130880 D14678 
130891 D31891 



Hs.236510 

Hs.181301 

Hs.125849 

Hs.146428 

Hs.91379 

Hs.147097 

Hs.197955 

Hs.14992 

Hs.1504 

Hs.21635 

Hs.151469 

Hs.151761 

Hs.15250 

Hs.23703 

Hs.153053 

Hs.169149 

Hs.153937 

Hs.154320 

Hs.154332 

Hs.154668 

Hs.155017 

Hs.8768 

Hs.155188 

Hs.166071 

Hs.155291 

Hs.155356 

Hs.155410 

Hs.241392 

Hs.155485 

Hs.178710 

Hs.155637 

Hs.155956 

Hs.180446 

Hs.158286 

Hs.1584 

Hs.252587 

Hs.15984 

Hs.16085 

Hs.293907 

Hs.16331 

Hs.1657 

Hs.16697 

Hs.12532 

Hs.16846 

Hs.260720 

Hs.1695 

Hs.1697 

Hs.1706 

Hs.17132 

Hs.17767 

Hs.17820 

Hs.17962 

Hs.18103 

Hs.201673 

Hs.279762 

Hs.18212 

Hs.171952 

Hs.18747 

Hs.6879 

Hs.18879 

Hs.19525 

Hs.19574 

Hs.143323 

Hs.20478 

Hs.2055 

Hs.20830 

Hs.20991 



ESTs; Moderately similar to PFT27 (M.m 
calhepsin S 

ESTs; Moderately similar to !!!! ALU SU 

collagen; type V; alpha 1 

ribosomal protein L26 

H2A hlstone family; member X 

KIAA0704 protein 

ESTs 

hemopexln 

tubulin; gamma 1 

caldunvcalmodulin-dependent serine pro 

KIAA0100 gene product 

peroxisomal D3;D2-enoyl-CoA isomerase 



CD37 antigen 
karyopherin alpha 1 (importin alpha 5) 
activated p21cdc42Hs kinase 
ubiquitin-activating enzyme E1C (homoto 
KIAA0212 gene product 
KIAA0391 gene product 
nuclear receptor interacting protein 1 
ESTs; Weakly similar to fill ALU SUSFA 
TATA box binding protein (TBP)-associa 
cyclin-dependent kinase 5 
WAA0005 gene product 
Homo sapiens mRNA for putative glucosy 
ESTs 

small inducible cytokine A5 (RANTES) 
huntingtin-interacting protein 2 
dathrin; heavy polypeptide^ 2 
protein kinase; DNA-activated; catalytic p 
N-acetyltransfe/ase 1 (arylamine N-acetyl 
karyopherin (importin) beta 1 
Homo sapiens mRNA for K1AA0446 prot 
cartilage cBgomenc matrix protein (pseud 
pituitary tumor-transforming 1 
ESTs; Weakly similar to gene pp21 protei 
ESTs; Highly similar to CGI-13 protein [H 
ESTs; Moderately similar to similar to C.e 
ESTs 

estrogen receptor 1 

ESTs 

ESTs 

ESTs; Weakly similar to cytochrome P45 
matrix metailoproteinase 2 (gelatinase A; 
matrix metailoproteinase 12 (macrophage 
ATPase; H+ transporting; lysosomal (vacu 
interferan-stimutatsd transcription factor 
ESTs 
ESTs 

Rho-associated; coiled-coi containing pro 

ESTs 

ESTs 

comichon-llke 
adenylate cyclase 7 

ONA segment on chromosome X (unique) 
occtudin 

POP7 (processing of precursor, S. cerevis 
ESTs 

Homo sapiens clone 23965 mRNA sequen 
ESTs 

ESTs; Weakly similar to katanin p80 subu 
putative DNA/chromatin binding motif 
ceroid-lipofuscinosis; neuronal 2; late infa 
ubiquitirwctivating enzyme E1 (A1S9T a 
kinesin-like 2 
SET domain; bifurcated; 1 

231 



5.6 
5.4 
7.4 
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12.1 
5 

7.8 
7,2 
5.6 
7.5 
6.4 
6.4 
4.5 
9.1 
8.6 
5 

6.1 

10 

6.1 

10.6 

8.3 

7.1 

5.6 

4.1 

4.6 

7 

4.1 
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130905 AA056489 Hs.129398 ESTs 8.7 

130913 W03592 Hs.21198 translocase of outer mitochondrial membr 205 

130919 AA291710 Hs.21276 collagen; type IV; alpha 3 (Goodpasture a 9 

130921 AA074596 Hs.194688 brornodomain adjacent to zinc finger dom 5.3 

5 130944 M97935 H&21486 signal transducer and activator of transcrip 18.8 

130974 X57985 Hs.2178 H2B hlstone family; member Q 13.4 

130987 R45698 Hs.21893 ESTs; Weakly similar to cAMP inducible 8.5 

130999 N489S3 Hs.21992 K1AA0689 protein 7.2 

131010 AA435748 Hs.169341 ESTs; Weakly similar to phosphatide acid 5.2 

10 131046 X02530 Hs.2248 small Inducible cytokine subfamily B (Cy 10.1 

131091 T35341 Hs.22880 ESTs; Highly similar to dipepGdyl peptid 6.3 

131153 H11760 Hs.23606 ESTs 7.3 

131185 M25753 Hs.23960 cycfinBI 6.2 

131200 AA609427 Hs.293732 ESTs; Moderately similar to I!!! ALU SU 4.3 

15 131206 AA044078 Hs.24210 ESTs 5.5 

131210 AA430047 Hs.95549 ESTs 7.1 

131227 AA429472 Hs.236522 OKF2P434P1 06 protein 5.6 

131244 038076 Hs,24763 RAN binding protein 1 5.5 

131245 AA620599 Hs.24766 DKFZP564E1962 protein 6.7 
20 131257 AA256042 Hs.24908 ESTs 5.8 

131319 U25997 Hs.25590 stanntocaicin 8.9 

131339 AA463450 Hs.25812 N5megen breakage syndrome 1 (nlbrin) 6.5 

131388 R34531 Hs.92200 KIAAQ460 gene product 9.2 

131410 H84658 Hs.279836 ESTs 12.1 

25 131472 AA608962 Hs.27258 cateycfin binding protein 18.1 

131475 Z39053 Hs.27263 ESTs 7.5 

131501 AA121127 Hs.8207 H3 histone; family 3A 5.5 

131514 X02152 Hs.2795 lactate dehydrogenase A 5.1 

131524 N39152 Hs.301804 ESTs 4.3 

30 131528 D60856 Hs.28309 UDP-gluoose dehydrogenase 8.4 

131544 N33236 Hs.28555 ESTs; Weakly similar to B051 1.8 [C.eleg 5.6 

131557 D30946 Hs.28707 signal sequence receptor; gamma (transloc 8.7 

131562 U90551 Hs.28777 H2A hlstone family; member L 18.8 

131564 AA491465 Hs.28792 ESTs 11.8 

35 131586 AA235385 Hs.26966 ESTs; Moderately similar to alternatively 4.7 

131587 M15182 Hs.183868 glucuronidase; beta 5.2 

131589 U52100 Hs.29191 epithelial membrane protein 2 4.4 

131615 D14533 Hs.192803 xeroderma pigmentosum; complementatio 4.6 

131664 AA136126 Hs.30327 mitogen-acQvatecI protein kinase-activated 4.3 

40 131679 AA136660 Hs.30579 ESTs 9.4 

131684 U26174 Hs.3066 granzymeK (serine protease; granzyme 3 9.7 

131687 L11066 Hs.3069 heat shock 70kD protein 98 (mortalin-2) 6.2 

131689 AA599653 Hs.30696 transcription factor-like 5 (basic helix^oo 8.3 

131693 W60913 Hs.110796 ESTs; WeaWy similar to cONA EST yk45 9 

45 131710 AA233225 Hs.30985 MRS1 protein 5.2 

131716 D49738 Hs.31053 cytoskeleton-associated protein 1 6.6 

131742 D31352 Hs.31433 ESTs 11 

131762 H46831 Hs.107767 ESTs; Moderately similar to CaM-Kll inh 4.9 

131781 AA460450 Hs.31989 DKFZP586G 1722 protein 9.2 

50 131795 N32724 Hs.32317 SoxJike transcriptional factor 4.5 

131809 176517 Hs.3260 present 1 (Alzheimer disease 3) 5.4 

131814 AA437226 Hs.157 interleukin 10 receptor; alpha 4 

131838 AA091932 Hs.180628 dynamin-like protein 6.7 

131877 J04088 Hs.156346 topoisomerase(DNA) II alpha (170k0) 5 

55 131885 AA044095 Hs.3402 ESTs 11.1 

131891 AA1 58258 Hs.30376 heterogeneous nuclear protein similar to r 5.6 

131925 AA248470 Hs.183180 ESTs; WeaJdy similar to RING finger pro 4.5 

131930 AA205460 Hs.69476 ESTs 14.3 

131941 D62657 Hs.35086 ublquilin-spedHc protease 1 6.2 

60 131965 W90146 Hs.35962 ESTs 6.3 

131970 D86960 Hs.3610 KIAA0205 gene product 4.2 

131971 R70167 Hs.154938 ESTs 4.3 
131974 AA410424 Hs^68122 Homo sapiens mRNA; cONA DKFZp586 4.6 
131977 F09788 Hs.3622 proconagen-proDne; 2-oxogtutarate 4-diox 6.4 

65 131994 AA479515 Hs.279882 Human DNA sequence from done 703H1 12 

131997 D82399 Hs.136644 Homo sapiens clone 23714 mRNA sequen 10 

132017 W67251 Hs.267659 Homo sapiens vav 3 oncogene (VAV3)m 4.7 
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132021 T68246 Hs.306079 chaperonin containing TCP1;subunit 5 {e 

132065 D82226 Hs.211594 proteasome(prosome;rracropain)26Ssub 

132085 D44466 Hs.3887 proteasome (prosome; macropain) 26S sub 

132089 AA131971 Hs.39122 ESTs 

5 132109 AA599801 Hs.40098 ESTs 

132143 AA257056 HsJ972 KIAA0871 protein 

132149 T10822 Hs.324743 ESTs 

132153 N90141 Hs.41068 ESTs; Moderately similar to ELONGATI 

132160 AA281770 Hs.295923 seven in absentia (Drosophila) homolog 1 

10 132164 U84573 Hs.41270 procollagerviyslne; 2-oxogiutaiate 5-dioxy 

132180 AA405569 Hs.418 fibroblast activation protein; alpha; sepras 

132183 L19183 Hs.199695 hypothetical protein 

132225 AA128980 ESTs 

132227 AA412620 Hs.4248 ESTs 

15 132235 F09058 Hs.42656 ESTs 

132256 AA608856 Hs.431 murine leukemia viral (bmM) oncogene h 

132298 N41849 Hs.7120 Homo sapiens cytokine receptor related p 

132314 AA285290 Hs.44499 small EDRK-rich factor 2 

132325 N37065 Hs.44856 ESTs 

20 132384 AA479933 Hs.46967 Human DNA sequence from done 167A1 

132387 R70914 Hs.281434 heat shock 70kD protein 1 

132393 W85888 Hs.47334 ESTs; Moderately similar to III! ALU SU 

132406 F09979 Hs.4774 ESTs 

132407 AA431459 Hs.47783 ESTs 

25 132413 AA132969 Hs.260116 K1AA1 104 protein 

132446 AA426218 Hs.46764 ESTs 

132465 AA047896 Hs.49169 ESTs 

132482 AA429478 Hs.238126 ESTs; Highly similar to CGi-49 protein [H 

132492 T03749 Hs.4990 K1AA1089 protein 

30 132528 AA283006 Hs.50758 chromosome-associated polypeptide C 

132540 AA488987 Hs.5097 synaptogyrin 2 

132543 AA417152 Hs.5101 protein regulator of cytokinesis 1 

132580 L37042 Hs.283738 casein kinase 1; alpha 1 

132586 AA412452 Hs.52515 DKFZP434N024 protein 

35 132608 AA199588 Hs.5321 ARP3 (acrjn-related protein 3; yeast) horn 

132616 AA386264 Hs.283558 isocitrate dehydrogenase 2 (NADP+); mil 

132617 AA171913 Hs.5338 carbonic anhydrase XI) 

132618 AA253330 Hs.279916 adaptor-related protdn complex 1; gamma 
132640 U33821 Hs.5437 Taxi (human T-cell leukemia virus type I 

40 132668 AA453614 Hs.5460 KIAA0776 protein 

132694 M60830 Hs.5509 ecotropic viral integration site 2B 

132700 N47109 Hs.5521 ESTs 

132724 AA417962 Hs.55498 geranyigeranyl diphosphate synthase 1 

132738 W42674 Hs.264636 ESTs; Moderately similar to neuronal thre 

45 132742 AA490862 Hs.292812 ESTs; Weakly similar to C43H8.1 (C.eteg 

132744 X54326 Hs.55921 glutamyl-prolyl-tRNA synthetase 

132795 H99152 Hs.57079 ESTs 

132807 AA331777 Hs.57301 mutL (E. coii) homolog 1 (colon cancer; n 

132811 U25435 Hs.57419 transcriptional repressor 

50 132817 AB004884 Hs.57553 tousled-te kinase 2 

132840 N23817 Hs.5807 Homo sapiens clone 23675 mRNA sequen 

132845 D62588 Hs.5813 ESTs 

132847 T48195 Hs.58189 eukaryotic translafjon initiation factor 3; s 

132856 W79865 Hs.58367 glypican4 

55 132869 N26855 Hs.203961 ESTs 

132874 AA425776 Hs.58609 ESTs 

132880 AA444369 Hs.177537 ESTs 

132894 D82422 Hs.5944 ESTs 

132900 N56451 Hs.5978 UM domain only 7 

60 132903 AA235404 Hs.5985 Homo sapiens clone 25186 mRNA sequen 

132904 X83618 Hs.59889 3-hydroxy-3-mefoylglutaryl-CoenzymeA 

132906 AA142857 Hs.234896 ESTs; Highly similar to gemlnin [Ksapie 

132914 AA496037 Hs.60293 ESTs 

132918 AA252605 Hs.6051 K1AA061 6 protein 

65 132936 AB002305 Hs.6111 K1AA0307 gene product 

132951 U04209 Hs.61418 microfibrillar-associated protein 1 

132957 AA234791 Hs.61469 Human gene from PAC 753P9; chromoso 
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132959 
132968 
132984 
132990 
5 132994 
132998 
133002 
133005 
133015 

10 133016 
133039 
133050 
133056 
133062 

15 133083 
133091 
133093 
133124 
133126 

20 133196 
133214 
133225 
133228 
133239 

25 133240 
133257 
133264 
133274 
133281 

30 133283 
133287 
133294 
133297 
133318 

35 133362 
133370 
133391 
133395 
133422 

40 133431 
133435 
133449 
133468 
133484 

45 133506 
133517 
133551 
133569 
133572 

50 133577 
133589 
133608 
133617 
133627 

55 133633 
133634 
133640 
133644 
133649 

60 133652 
133674 
133700 
133705 
133716 

65 133720 
133752 
133765 



AA028103 Hs.61472 ESTs; Weakly similar to unknown (S.cere 

N77151 Hs.61638 myosin X 

H80409 Hs.62112 zinc finger protein 207 

AA458761 Hs.18387 transcription factor AP-2 alpha (activating 

AA505133 Hs.279905 solute carrier family 2 (facilitated glucose 

Y00062 Hs.1 701 21 protein tyrosine phosphatase; receptor typ 

AF006082 Hs.42915 ARP2 (actirwelated protein 2; yeast) horn 

C21400 Hs.278605 WAA0970 protein 

AA047036 Hs.246315 ESTs 

W81 298 Hs.6289 growth factor receptor-bound protein 2 

X62055 Hs,63489 protein tyrosine phosphatase; non-recepto 

S67325 Hs.63788 propionyl Coenzyme A carboxylase; beta 

AA071387 Hs.6396 jumping translocation breakpoint 

R33663 Hs.64056 ESTs 

N7Q633 Hs.6456 chaperonin containing TCP1; subunit 2 (b 

AA122147 Hs.64691 K1AA0463 protein 

AA598749 Hs.285996 ESTs 

AA156049 Hs.267923 ESTs 

D16469 Hs.6551 ATPase; H+ transporting; lysosomal (vacu 

R37367 Hs.6727 Ras-GTPase activating protein SH3 doma 

Y10659 Hs.285115 Interleukin 13 receptor, alpha 1 

241415 Hs.6823 ESTs; Weakly similar to intrinsic factor-B 

N90029 Hs.6831 Homo sapiens done 1400 unknown prate 

AA059405 Hs.179882 Homo sapiens clone 24655 mRNA sequen 

D31161 Hs.242894 ESTs 

AF006086 Hs.6895 actin related protein 2/3 complex; subunit 

W72187 Hs.69192 ESTs; Weakly similar to cONA EST yk37 

AA488886 Hs.6949 ESTs 

AA421079 Hs.69594 ESTs; Weakly similar to Sox-like transcri 

AA410507 Hs.6968 ESTs 

L15702 Hs.69771 B-factor; properdin 

R79723 Hs.69997 zinc finger protein 238 

AA6G0057 Hs.70266 KIAA0905 protein 

AA256168 Hs.152316 ESTs 

H06195 Hs.7194 ESTs; Highly similar to CGI-59 protein (H 

AA156897 Hs.72157 OKFZP56411922 protein 

X57579 Hs.727 Inhib1n;betaA(acllvlnA;activinABa!p 

AA491296 Hs.72805 ESTs 

N7951 6 Hs.73287 ESTs; Weakly similar to eyelid [D.melano 

AA255438 Hs.7358 Homo sapiens mRNA; cONA DKF2p566 

T23983 Hs.323966 ESTs 

AA094989 Hs.7381 voltage-dependent anion channel 3 

X03068 Hs73931 major histocompatibility complex; class II 

X78710 Hs.21 1581 metakegulatory transcription factor 1 

AA316868 Hs.74346 ESTs; Weakly similar to 140G11.h (Djne 

X52947 Hs.74471 gap junction protein; alpha 1 ; 43kO (conn 

D63480 Hs.278634 KIAA01 46 protein 

AA313977 Hs.172772 transcription elongation factor 6 (Sill); po 

W94333 Hs.27991 5 transtocase of inner mitochondrial membr 

F03717 Hs.75063 human immunodeftciency virus type I enh 

L37368 Hs.75104 RNA-blnding protein SI; serine-rich dom 

013315 Hs.75207 glyoxalasel 

M148318 Hs.75249 KIAA0069 protein 

009587 Hs75280 grycyMRNA synthetase 

D21262 Hs.75337 nucleolar phosphoprotein p1 30 

U24166 Hs.234279 mlcrotubule-associated protein; RP/EB fa 

D83004 Hs.75355 ubiquitin-conjugating enzyme E2N (homo 

D89077 Hs.75367 Src-Cke-adapter 

AA479139 Hs.75393 add phosphatase 1; soluble 

AA287383 Hs.7540 ESTs 

AA458946 Hs.75497 ESTs 

K01396 Hs.297681 protease inhibitor 1 (antkilastase); alpha- 

N21648 Hs75659 MpV17 transgene; murine homolog; glom 

Y00282 Hs.75722 ribophorfntl 

L27841 Hs.75737 pericentriolar material 1 

U4927B Hs.75875 ubiquitin-conjugaflng enzyme E2 variant 

D21255 Hs.75929 cadherin 11 (OB-cadherin; osteoblast) 
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133772 W73693 Hs.76038 IsopentenyWiphosphate delta isomerase 

133774 Z23090 Hs.76067 heat shock 27kD protein 1 

133776 J03473 Hs.177766 ADP-ribosyltransferase(NAD-Kpo!y(AO 

133784 AA214305 Hs.301064 ESTs 

5 133814 M33882 HsJ6391 myxovirus (influenza) resistance 1; homo! 

133829 AA453783 Hs.76550 Homo sapiens mRNA; cONA DKFZp564 

133834 AA147510 Hs,288660 serine protease; umbilical endothelium 

133839 M59815 Hs.170250 complement component 4A 

133842 U73477 Hs.285013 putative human HLA dass II associated p 

10 133845 T68510 Hs.76704 ESTs 

133859 U86782 Hs.178761 26S proteasome-associated pad1 homolog 

133867 D43948 Hs.76989 WAA0097 gene product 

133868 U58090 Hs.183874 culiin 4A 
133871 AA454597 Hs.182793 ESTs 

15 133893 X01060 Hs.77356 transferrin receptor (p90;CD71) 

133914 N32811 Hs.77542 ESTs 

133918 W72783 Hs.58382 ESTs; Weakly similar to C13F10.5 [Cele 

133944 AA045870 Hs.7780 Homo sapiens mRNA; cONA DKFZp564 

133946 AA156565 Hs.1 73878 4-nitrophenylphosphatase domain and non 

20 133963 134587 Hs.184693 transcription elongation factor B (SHI); po 

133980 D00760 Hs.250811 proteasome (prosome; macropaln) subunit 

133990 C02374 Hs.7822 Homo sapiens mRNA; cONA DKF2p564 

133999 M28213 Hs.78305 RAB2; member RAS oncogene family 

134030 J03077 Hs.78575 prosaposin (variant Gaucher disease and v 

25 134032 Z81326 Hs.78589 protease inhibitor 12 (neuroserpin) 

134045 S82470 Hs.78768 BB1 

134046 028473 Hs.172801 isoleuctne-lRNA synthetase 
134064 D87685 Hs.78893 K1AA0244 protein 
134070 H98621 Hs.78946 culiin 3 

30 134087 U51166 Hs.173824 thymine^ glycosylate 

134090 M22382 Hs.79037 heat shock 60kD protein 1 (chaperonin) 

134098 X06323 Hs.79086 ribosomal protein; mitochondrial; L3 

134110 U41060 Hs.79136 UV-1 protein; estrogen regulated 

134132 U32519 Hs.220689 Ras-GTPase-activaling protein SH3«doma 

35 134168 AA398908 Hs.1 81634 Human Chromosome 16 BAC done CiT9 

134170 M63138 Hs.79572 cathepsin D (lysosomal aspauiyi protease) 

134208 U88871 Hs.79993 peroxisomal biogenesis fador 7 

134258 L28010 Hs.808 heterogeneous nudear ribonucleoprotein F 

134288 AA430008 Hs.8117 ESTs 

40 134310 AA313414 Hs.8148 Homo sapiens done 24856 mRNA sequen 

134326 U16306 Hs.81800 chondroitin sulfate proteoglycan 2 (versic 

134329 D38551 Hs.81848 * RAD21 (S. pombe) homolog 

134331 AA452020 Hs.1 11222 ESTs; Weakly similar to CGM28 protein 

134351 R82074 Hs.82109 syndecan 1 

45 134357 L43575 Hs.82171 Human done 191 B7 placenta expressed m 

134363 M37033 Hs.82212 CD53 antigen 

134367 X54199 Hs.82285 phosphoribosylgtycinamideformylfransfe 

134374 D62633 Hs.8236 ESTs 

134375 AA412720 Hs.82389 ESTs; Highly similar to CGI-1 1 8 protein 
50 134376 X02874 Hs,82395 2*;5^oligoadenylate synthetase 1 

134381 U56637 Hs.184270 capping protein (actin filament) musde Z- 

134388 -M15841 Hs.82575 small nudear ribonudeoprotein polypepti 

134395 L09717 Hs.8262 lysosomal-assoclated membrane protein 2 

134399 H99801 Hs.82689 tumor rejection antigen (gp96) 1 

55 134401 AA243746 Hs.211577 kinectin 1 (kinesin receptor) 

134405 J04177 Hs.82772 collagen; type XI; alpha 1 

134415 AA329274 Hs.82911 protein tyrosine phosphatase type IVA; m 

134417 D87969 Hs.82921 solute carrier family 35 (CMP-sialic acid t 

134419 108044 Hs.82961 trefoil factor 3 (intestinal) 

60 134421 AA122386 Hs.82985 collagen; type V; alpha 2 

134423 W96151 Hs,83006 ESTs; Highly similar to CGt-139 protein 

134438 AA449984 Hs.246857 ESTs; Highly similar to proteine kinase JN 

134446 T25732 Hs.83419 KIAA0252 protein 

134453 X70683 Hs.83484 SRY (sex determining region Y)-box 4 

65 134470 X54942 Hs.83758 CfX28 protein kinase 2 

134487 R38185 Hs.83954 Homo sapiens unknown mRNA 

134495 D63477 Hs.84087 K1AA0143 protein 
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Hs.84131 

Hs.84285 

Hs.848 

Hs.172280 

Hs.66041 

Hs.86347 



Hs.8739 



Hs.284226 
Hs.89603 
Hs.89718 
Hs.89751 



Hs.90061 

Hs.90280 

Hs.904 

Hs.90419 

Hs.9071 

Hs.92308 



Hs.278338 

Hs.173685 
Hs.278589 



134498 M63180 
134506 U45328 
134529 H24460 
134570 U66615 
5 134582 AA234966 
134600 R68884 
134623 X74496 

134654 W23625 

134655 AA454070 Hs. 123090 
10 134675 AA250745 Hs.87773 

134711 X04011 Hs.68974 
134714 U89922 Hs.890 
134722 W47183 
134776 J05582 

15 134806 Z49099 
134810 M27394 
134840 U51477 
134843 H60595 
134853 D82348 

20 134866 U84011 
134868 Z39762 
134885 N27670 
134982 N46086 
134989 AA236324 Hs.92331 

25 134992 H05625 Hs.5831 
134993 AA282343 Hs.301005 
135010 D59675 Hs.92927 
135015 U54999 
135029 AA224180 

30 135032 AA243497 
135037 U77948 
135059 AA598449 Hs.93832 
135071 LO8069 Hs.94 
135083 M495950 Hs.94262 

35 135117 W52493 Hs.94694 
135144 AA044842 Hs.95260 
135154 AA126433 Hs.267812 
135218 D31157 Hs.324277 
135237 AA454930 Hs.9691 

40 135243 AA215333 Hs.97101 
135335 H20989 Hs.198281 
135349 D83174 Hs.9930 
i35367 AAW1Q3 Hs.9963 
135389 U05237 Hs.99872 

45 135400 M23263 
135411 L10333 
300019 M97935 

300021 M97935 

300022 M97935 
50 300089 AI199738 

3X107 A1694585 
300254 AW079607 Hs.188417 
300328 AW015860 Hs.224623 
300549 AA699328 Hs.298119 

55 300711 AI492179 Hs,166244 
300921 AW293224 Hs.232165 
301124 T79326 Hs.298262 
301165 N85789 Hs.150186 
301576 AI682905 Hs.270431 

60 301604 AA373124 Hs.24809 
301704 AA526313 Hs.293691 
301782 N99399 Hs.143046 
301884 AA312082 Hs. 105445 
301936 NMJ)04694Hs.1 14924 

65 302002 AF013956 Hs.5637 
302032 NMJ01992HS.128087 
302067 H05698 Hs.222399 



Hs.99915 
Hs.99947 



Hs.208275 
Hs^70464 



threonyMRNA synthetase 
ubiquitin-conjugating enzyme E21 (homol 


6.1 
4.6 


FK506-bindfog protein 4 (59kD) 


6.2 


SW1/SNF related; matrix associated; acfin 


4.8 


CGG triplet repeat binding protein 1 


4.7 


ESTs; Weakly similar to predicted using G 
pror/lenoopeptWase 


5.8 
4.5 


ESTs; Weakly similar to ORF YGR200c [ 


13.7 


ESTs 


5.8 


protein kinase; cAMP -dependent; catatyti 


8.9 


cytochrome b-245; beta polypeptide (chro 


6.8 


lymphotoxin beta (TNF superfamily; mem 


35.7 


ESTs; Weakly similar to neural F box pro 


8.1 


mucin 1; transmembrane 


6.2 


spermine synthase 


4.2 


membrane-spanning 4-domains; subfamily 


7 


diacylglycerol kinase; zeta (104kD) 


4.1 


progesterone binding protein 


4.7 


5-aminolm]dazole-4-carboxamlde ribonuc 


10.2 


amyio-1;6-giucosidase; 4-aipha-glucanotr 


12.1 


K1AA0882 protein 


6 


progesterone membrane binding protein 


5 


ESTs 


4.1 


Homo sapiens mRNA; chromosome 1 spe 


16.8 


ESTs 


4 


purtne-nch element binding protein 8 


4.4 


ESTs 


7 


LGN protein 


4.8 


ESTs; Moderately similar to 17-beta-hydr 


13.6 


Human DNA sequence from clone 30M3 


4 


general transcription factor It; i 


8 


Homo sapiens clone 24483 unknown mRN 


5.4 


heat shock protein; DNAJ-like 2 


9.3 


ESTs 


6.7 


Homo sapiens done 24837 mRNA sequen 


10.2 


Homo sapiens mRNA; cDNA OKFZp586 


6.6 


sorting nextn 4 


7.4 


ESTs; Weakly similar to growth factor-res 


6.2 


ESTs 


19.5 


putative G protein-coupled receptor 


8.8 


pyruvate kinase; muscle 


12.4 


collagen-binding protein 2 (colligen 2) 


5.5 


TYRO protein t/msine kinase binding pro 


5.4 


fetal Alzheimer antigen 


7.8 


androgen receptor (dihydrotestosterone re 


9.1 


reticutan 1 


5.3 


AFFX control: STAT1 


- 8.3 


AFFX control: STAT1 


7 


AFFX control: STAT1 


14 


ESTs; Weakly similar to HIl ALU CLASS 


9.1 


ESTs; Weakly similar to !!ll ALU CLASS 


7.4 


ESTs; WeaWy similar to ZnT-3 [H.sapien 


30.1 


ESTs 


11.9 


ESTs 


5.5 


ESTs; Weakly simPar to cDNA EST yk40 


11 


ESTs 


11 


ESTs; Weakly similar to (U88J8.1 [H.sapi 


8.8 


ESTs; Weakly similar to PTERIN^ALP 


6 


ESTs; Weakly similar to UII ALU SUBFA 


. 4.7 


ESTs;Wea)oysimflartoC17Giai [C.ele 


8 


ESTs 


4.2 


EST duster (not in UniGene) with exon h 


18 


GDNF family receptor alpha 1 


20.7 


EST duster (not in UniGene) with exon h 


11.6 


chromobox homolog 4 (DrosophUa Pc da 


9.2 


EST cluster (not in UniGene) wim exon h 


4.3 


ESTs; Weakly similar to proteirv-tyroslne 


7.8 
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302145 NM 00361 3HS.151407 
302236 AI128606 Hs.6557 
302276 NMJ>04448Hs.323910 
302290 AL117607 Hs.175563 
302326 NMJ»4271Hs.184018 
302342 AB023141 Hs.190386 
302372 AL1 17406 Hs.200102 
302422 AB021227 Hs.3743 
302431 AF129530 Hs.226434 
302501 AF022726 Hs.251446 
302505 AL049650 Hs.247874 
302533 L36149 Hs.248116 
302638 AA463798 Hs.102696 
302656 AW293005 Hs.70704 
302792 AA343696 Hs.46821 
302820 X04588 Hs.85844 
302838 U66049 Hs.82171 
302892 N5B545 Hs.42346 
302977 AW263124 Hs.315111 
302989 N46406 Hs.84700 
303007 AM78876 Hs.317714 
303052 AF140242 Hs.279926 

303131 AW081061 Hs.103180 

303132 AI929819 Hs.4055 
303153 U09759 Hs.246857 
303387 AA908797 Hs.180799 
303499 AI815990 Hs.293515 
303502 AA488528 

303576 T07216 Hs.301226 
303620 AA397546 Hs.119151 
303634 AI953377 Hs.28444 
303642 AW299459 Hs.111977 
303654 AA436942 Hs.288529 
303733 AW502498 Hs.15220 
303780 AM24014 Hs.18995 
303792 C75094 Hs.199839 
303842 A1337304 Hs.126268 
303951 AW475081 Hs.172928 
304465 AA421948 
304507 AA456426 
304591 AA505702 
304601 AA507875 
304659 AA533185 
305040 AA630582 Hs.169476 
305134 AA653159 Hs.179661 
305415 AA725116 Hs.78465 
305453 AA738110 
305898 AA872838 
305913 AA876109 
305950 AA884479 
306004 AA889992 Hs.2186 
306009 AA894560 Hs.283370 
306060 AA906161 Hs.76277 
306398 AA970548 Hs.297681 
306505 AA987722 Hs.172928 
306576 AA995761 Hs.276092 
307117 AJ184111 Hs.76067 
307138 A1185516 Hs.172928 
307187 A1190870 Hs.276417 
307542 A1280859 Hs.62954 
307554 A1281603 Hs.172928 
307806 AI351739 Hs.276726 
308079 AI472733 Hs.270208 
308307 AI581398 Hs.172928 
308511 AJ687580 Hs.169476 
308615 A1738593 Hs.101774 
308677 AJ761173 



EST duster (not in UniGene) with exon h 
zinc Anger protein 161 
EST duster (not in UniGene) with exon h 
Homo sapiens mRNA; cDNA OKFZp564 
EST cluster (not in UniGene) with exon h 
K1AA0924 protein 

Homo sapiens mRNA; cDNA OKFZp434 
matrix metalioproteinase 24 (membrane-in 
EST cluster (not in UniGene) with exon h 
EST duster (not in UniGene) with exon h 
multiple UniGene matches 
chemokine (C motif) XC receptor 1 
ESTs; Weakly similar to C11D2.4 [C.eieg 
ESTs 

ESTs; Weakly similar to putative fltsapie 

EST duster (not in UniGene) with exon h 

EST duster (not in UniGene) with exon h 

histone deacetylase 3 

EST duster (not in UniGene) with exon h 

EST cluster (not In UniGene) with exon h 

pallid (mouse) homolog; pallidin 

EST duster (not in UniGene) with exon h 

acMke6 

ESTs 

mltogen-activated protein kinase 9 

ESTs 

ESTs 

EST duster (not in UniGene) with exon h 
EST duster (not in UniGene) with exon h 
ESTs 

ESTs; Weakly similar to predicted using G 
EST duster (not in UniGene) with exon h 
ESTs 

ESTs; Weakly similar to zinc finger prote 

ESTs; Moderately similar to K1AA0456 p 

ESTs; Highly simflar to NG22 IH^apiens 

ESTs; Weakly similar to similar to PDZ d 

collagen; type I; alpha 1 

EST singleton (not in UniGene) with exon 

EST 

EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
glyceraldehyde-3-phosphate dehydrogena 
EST singleton (not in UniGene) with exon 
EST singleton (not In UniGene) with exon 
EST singleton (not in UniGene) with exon 
keratln8 

EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not In UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not In UniGene) with exon 
heat shock 27kD protein 1 
collagen; type I; alpha 1 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
ESTs 

collagen; type I; alpha 1 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
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308852 A1829848 Hs.1 82937 


pepGdylprotyt isomerase A (cyclophitin A 


5.9 


308974 AI872290 Hs.300697 


immunoglobulin gamma 3 (Gm marker) 


4.5 


308981 AI873242 


EST singleton (not In UniGene) wilh exon 


7.6 


308995 AI880172 


EST singleton {not In UniGene) with exon 


6.6 


309177 AI951118 


EST singleton (not in UniGene) with exon 


24.3 


309186 AI952723 Hs.90207 


EST singleton (not In UniGene) with exon 


6.1 


309198 AI955915 


major histocompatibility complex; class 1; 


5.6 


309226 AI969897 


EST singleton (not in UniGene) with exon 


6.2 


309279 AI990102 


EST singleton (not in UniGene) with exon 


7.9 


309583 AW170035 


EST 


645 


309624 AW191929 Hs.252989 


EST 


5.3 


309629 AW1S2764 H$.i72928 


coflagen; type 1; a/pha i 


6.9 


309641 AW194230 Hs.253100 


EST 


11.4 


309698 AW236461 Hs.73742 


ribosomal protein; large; P0 


4.3 


309700 AW241170 Hs.1 79661 


Homo sapiens clone 24703 beta-tubulin m 


11.9 


31007^ AI335004 Hs.148558 


ESTs 


42 


310094 AW450967 Hs.235240 


ESTs 


5.7 


310373 AW080778 Hs.145582 


ESTs 


4.6 


310438 AW022192 Hs.200197 


ESTs 


39.1 


310470 AI281848 Hs.194691 


ESTs 


4.9 


310583 AW205632 Hs.211198 


ESTs 


7 


310877 T47784 Hs.188955 


ESTs 


4.1 


311067 AI587332 Hs.209115 


ESTs 


11.2 


311166 AI821294 Hs.1 18599 


ESTs 


24.1 


311199 T57896 Hs.1 91 095 


EST cluster (not in UniGene) 


5.7 


311465 AI758660 Hs.206132 


ESTs 


15.7 


311587 AI828254 Hs.271019 


ESTs 


6.4 


311774 AA700870 Hs.14304 


ESTs 


6.2 


311785 AIQ56769 Hs.133512 


ESTs 


5 


311923 T60843 .Hs.189679 


ESTs 


5.9 


311935 AA216387 


EST cluster (not in UniGene) 


5.5 


311972 N51511 Hs.1 88449 


ESTs 


5.2 


312014 AI435650 Hs.128778 


ESTs 


4.3 


312047 AA588275 Hs.1 80669 


ESTs 


14.7 


312147 T89855 Hs.195648 


EST cluster (not in UniGene) 


9.8 


312153 AA759250 Hs.153028 


cytochrome b-551 


27.1 


312168 T92251 Hs.198882 


ESTs 


4.2 


312172 AI222168 Hs.191168 


ESTs 


6.1 


312226 AI796815 Hs.199993 


ESTs; Weakly similar to ubiquitous TPR 


5.5 


312292 AW451893 Hs,151124 


ESTs 


18.4 


312312 AI080505 Hs.134529 


ESTs 


11.9 


312369 AA582039 Hs.1 73884 


Homo sapiens mRNA; chromosome 1 spe 


4 


312407 R46180 Hs.153485 


ESTs 


13.6 


312430 AW139117 Hs.117494 


ESTs 


4.1 


312470 AW451347 Hs.175862 


ESTs 


4.6 


312483 A1417526 Hs.7753 


ESTs 


15.3 


312521. AA033609 Hs.319093 


ESTs 


125 


312544 A1498371 Hs.183526 


ESTs 


14.6 


312638 AW439195 Hs.256880 


ESTs 


5.3 


312754 R99834 Hs.250383 


ESTs 


8.4 


312772 H63791 


EST cluster (not in UniGene) 


4.3 


312821 AA699325 Hs.269880 


ESTs 


8.3 


312837 AW292286 Hs.255058 


ESTs 


7.1 


312849 AA846353 Hs.194054 


ESTs 


5.9 


312854 AA828713 Hs.321058 


EST cluster (not tn UniGene) 


4.1 


312992 AA088446 Hs.170298 


ESTs 


7.3 


313096 AI422367 Hs.163533 


ESTs 


6.1 


313112 AA732534 Hs.269099 


ESTs 


42 


313126 AA720887 Hs.283313 


EST cluster (not In UniGene) 


18.1 


313136 N59284 Hs.288010 


ESTs 


17 


313197 AI738851 Hs.222487 


ESTs 


12.9 


313219 N74924 Hs.182099 


ESTs 


7.1 


313258 AW068358 Hs.183918 


ESTs 


13.7 


313328 AW449211 Hs.105445 


ESTs 


275 


313352 AW292127 Hs.144758 


ESTs 


9.8 


313417 AA741151 Hs.137323 


ESTs 


8.2 


313455 AW081702 Hs.98571 


ESTs 


6.9 
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313590 
313663 
313667 
313749 
313832 
313881 
313915 
313955 
313974 
314097 
314129 
314359 
314384 
314394 
314462 
314465 
314470 
314488 
314506 
314510 
314558 
314661 
314691 
314754 
314775 
314943 
314961 
314963 
315006 
315010 
315019 
315033 
315036 
315037 
315051 
315054 
315073 
315080 
315083 
315088 
315175 
315196 
315296 
315303 
315352 
315364 
315368 
315390 
315408 
315458 
315472 
315478 
315498 
315527 
315530 
315562 
315634 
315647 
315652 
315676 
315680 
315735 
315741 
315769 
315978 
315984 
316042 



AA804410 Hs.291677 
A1953261 Hs.169813 
U69201 Hs.13684 
AW450376 Hs.1 19004 
AW271022 Hs.133294 
AA535580 Hs.16331 
AI969390 Hs.163443 
AJ858884 Hs.270647 
AI310151 Hs.173524 
AA648744 Hs.269493 
AA228366 Hs.1 15122 
AA205569 Hs.194193 
AA535840 Hs.162203 
AI380563 Hs.130816 
AA347951 Hs.326413 
AA602917 Hs.156974 
AI934422 Hs.30661 
AA358265 HS.1B2890 
AA833655 Hs.20686B 
AI204418 Hs.190080 
A1873274 Hs.190721 
AA436432 Hs.324239 
AW207206 Hs.136319 
AW026761 Hs.134374 
AI149880 Hs.188809 
AI476797 Hs.184572 
AW008061 Hs.231994 
AI689617 Hs.200934 
AI538613 Hs.298241 
AA531082 Hs.240049 
AA532807 Hs.105822 
A1493046 Hs.146133 
AA534953 Hs.163297 
AW205863 Hs.133988 
AW292425 Hs.163464 
A1968598 HsJ8768 
AW452948 Hs.257631 
AA744550 Hs.136345 
AI221325 HS.205442 
AA557351 Hs.152448 
AI025842 Hs.152530 
AA972756 Hs.44898 
AA876905 Hs.125286 
AW194364 Hs.128022 
AA604799 Hs.136528 
AA643602 Hs.155485 
AW291563 Hs.104696 
AI801565 Hs.200113 
AW273261 Hs.216292 
AA872000 Hs.116104 
AA828850 Hs.165469 
AA665612 Hs.120874 
AA628539 Hs.116252 
A1791138 Hs.116768 
AI2Q0852 Hs.12r780 
AA737415 Hs.152826 
AA837085 Hs.220585 
M648983 Hs.212911 
AI521489 Hs.3053 
AW002565 Hs.124660 
AA814309 Hs.123583 
AI831760 Hs.155111 
AA812168 Hs.122559 
AA744875 Hs.189413 
AA830893 Hs.119769 
AI015862 Hs.131793 
AW297979 Hs. 170698 



EST duster (not in UniGene) 
ESTs 

ESTs; Weakly similar to choline kinase is 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs; Weakly similar to alternatively spli 

ESTs 

ESTs 

ESTs 

ESTs ■ 

ESTs 

ESTs 

ESTs 

ESTs 

EST cluster (not In UniGene) 

ESTs 

ESTs 

ESTs 

cell division cycle 2; G1 to S and G2 to M 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs; Weakly similar to gene MAC25 pr 

EST 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs; Moderately similar to MULTIFUN 

ESTs 

ESTs 

ESTs 

ESTs; Weakly similar to F1G-1 PROTEIN 
ESTs; Moderately similar to III! ALU SU 
ESTs; Highly similar to serine protease (H 
ESTs 

ESTs; Weakly similar to alternatively spli 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs; Moderately similar to 111! ALU SU 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 
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5.3 

7.6 

12.5 

5.5 

4.3 

7.7 

27.1 

5.7 

4.3 

14.5 

9.5 

5.4 

5.3 

13.2 

6.2 

18,1 

4.2 

6.1 

27.8 

9.5 

22.5 

13.3 

21.4 

4.4 

4.4 

18.4 

10.2 

5.3 

20.7 

5 

6.1 

12 

8.3 

6.1 

12.7 

7.6 

13.9 

4.4 

5.1 

4.7 

11.9 

28.8 

16.1 

25.7 

12.3 

4.6 

4.8 

4.4 

5 

7.6 

4.9 

5.2 

4.8 

4.4 

22.4 

5.9 

8.8 

15 

6.3 

9.2 

8.1 

13.4 

5.4 

4.4 

10.4 

5 

14.7 
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316136 AA830808 


Hs.124366 


ESTs 


4 


316177 AI908272 


Hs.293102 


EST cluster (not in UniGene) 


326 


316313 AA741300 


Hs.202599 


ESTs 


4.8 


316405 AA757900 Hs.270823 


ESTs 


4.8 


316480 AI749921 


Hs.205377 


ESTs 


12.9 


316564 AI743571 


Hs.168799 


ESTs; WeakJy similar to II!! ALU SUBFA 


8.1 


316714 AA809792 Hs.123307 


ESTs 


5 


316715 A1440266 


Hs.170673 


ESTs 


4.2 


316828 AA828116 Hs.173076 


ESTs 


5.2 


316869 A1954880 


Hs.134604 


ESTs 


13.3 


316905 AW138241 Hs.210846 


ESTs 


6.2 


316943 AW014875 Hs.137C07 


ESTs 


5.3 


316949 AA856749 Hs.124620 


ESTs 


7.2 


317008 AW051597 Hs.143707 


ESTs 


4.1 


317028 AA962623 


Hs.189144 


ESTs; Weakly similar to RENAL SODIU 


4.2 


317067 AI805392 


Hs.325335 


ESTs 


4.5 


317069 AI732892 


Hs.190489 


ESTs 


6.4 


317210 AA490718 




EST cluster (not in UniGene) 


4.4 


317298 AI922374 


Hs.158549 


ESTs 


5.9 


317658 AW139077 Hs.202217 


ESTs 


4.6 


317674 AW294909 Hs.132208 


ESTs 


5.2 


317685 AI798630 


Hs.149997 


ESTs 


4.3 


317836 AA983913 


Hs.128929 


ESTs 


12.4 


317881 AI827248 


Hs.224398 


ESTs 


121 


317902 AI828602 


Hs.211265 


ESTs 


8.8 


317916 AI565071 


Hs.159983 


ESTs 


12.6 


318042 AW294522 Hs.149991 


ESTs 


5.6 


318053 AJ074465 


Hs.133469 


ESTs 


4 


318064 AW296888 Hs.170939 


ESTs 


5.2 


318070 AI024594 


Hs.248942 


ESTs 


4.7 


318073 AW167087 Hs.131562 


ESTs 


15.7 


318146 AI040125 


Hs.150521 


ESTs 


5.9 


318186 AW016773 Hs.3709 


ESTs 


5.3 


318481 AI291584 


Hs.145921 


ESTs; Weakly similar to HYPOTHETICA 


7.6 


318566 AI335361 


Hs.226376 


ESTs 


5.8 


318617 AW247252 Hs.75514 


nucleoside phosphorylase 


11.1 


318662 AI285898 


Hs.294014 


ESTs 


16.3 


318691 AW192139 Hs.181307 


H3histone; family 3A 


4 


318740 NJIL002543HS.77729 


EST cluster (not in UniGene) 


21.3 


318744 AI793124 


Hs.144479 


ESTs 


35 


318948 AA317274 


Hs.13996 


ESTs 


11.7 


319163 F15257 


Hs.27 


glycine dehydrogenase (decarboxyiating; 


7 


319478 R06841 


Hs.270307 


EST duster (not In UniGene) 


8.9 


319545 R83716 


Hs.14355 


ESTs 


8.2 


319668 NMJ]02731Hs.87773 


EST duster (not in UniGene) 


25.4 


319763 AA460775 


Hs.6295 


ESTs 


7 


319913 AA179304 


Hs.271586 


ESTs; Moderately similar to !!!! ALU SU 


8.7 


319936 W22152 


Hs.282929 


EST duster (not in UniGene) 


5.6 


319951 AA307665 


Hs.14559 


ESTs 


4.9 


319962 H06350 


Hs.135056 


ESTs 


9.2 


319977 AA632632 




EST duster (not in UniGene) 


4.6 


320074 AA321166 


Hs.278233 


EST duster (not in UniGene) 


16.7 


320092 AF022799 


Hs.1 13292 


cafpain9(nCL4) 


5.4 


320107 AA836461 


Hs.291712 


EST duster (not in UniGene) 


5.3 


320133 D63271 




EST duster (not in UniGene) 


5.5 


320167 AA984373 


Hs.90790 


EST cluster (not in UniGene) 


15 


320187 T99949 


Hs.303428 


EST duster (not in UniGene) 


6.7 


320211 AL039402 


Hs.125783 


DEME-6 protein 


24.3 


320401 U90449 


Hs.152717 


nudeoside diphosphate kinase type 6 (inn 


10 


320458 AI884396 


HS24131 


ESTs 


5.4 


320488 R31386 


Hs.191791 


EST duster (not in UniGene) 


4.9 


320521 N31464 


Hs.24743 


ESTs 


9.5 


320661 AA864846 


Hs.1 15175 


EST duster (not In UniGene) 


6.6 


320691 R61576 


Hs.313951 


hypothetical protein 


5.9 


320699 R63161 


Hs.1 18249 


EST duster (not in UniGene) 


4 


320727 U96044 


Hs.181125 


EST cluster (not in UniGene) 


15.3 


320993 ALD50145 


Hs.225986 


Homo sapiens mRNA; cDNA OKFZp586 


7.2 
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321012 AA737314 Hs.194324 

321050 AW393497 

321051 AF134149 Hs.240395 
321171 AI769410 Hs.221461 
321192 AA295304 Hs.297939 
321354 AA078493 

321387 H68014 Hs.141278 
321412 AW366305 Hs.22891 
321489 AW392474 Hs.172759 
321539 N98619 Hs.42915 
321593 H84762 Hs.253197 
321666 028390 Hs.272897 
321891 AW157424 Hs.165954 
321910 H67065 Hs.271530 
321953 AW068268 Hs.292833 
321978 N77342 Hs.21851 
322017 AA310039 Hs.9192 
322026 AA233527 Hs.283675 
322035 AL137517 Hs.306201 
322171 AF085968 Hs.48474 
322175 AF085975 
322236 AL134970 Hs.104222 
322303 W07459 Hs.157601 
322735 AAD86123 Hs.297856 
322777 AA679082 Hs.269947 
322818 AW043782 Hs.293616 
322882 AW248508 Hs.279727 
322975 C16391 
322991 C18965 Hs.159473 
323011 AA580288 
323091 AW014094 Hs.210761 
323107 AI301107 Hs.150790 

323168 AL120862 Hs.124165 

323195 AI064982 Hs.117950 

323201 AL049370 Hs.13350 

323203 AA203135 Hs.130186 

323243 W44372 Hs.1 10771 

323244 T70731 Hs.193620 
323328 AA228078 Hs.255096 

323332 AI829520 Hs^27513 

323333 AA2288B3 Hs.208558 
323570 AL038623 Hs.208752 
323604 AI751438 Hs.41271 
323685 AA344205 Hs.289088 
323753 M327102 Hs.70266 
323817 AA410943 

323845 AI684674 Hs.41127 
323930 AA570698 Hs.8173 
323997 AA844907 Hs.274454 
324047 AA378201 Hs.271340 
324261 AL044891 Hs.269350 
324302 AA543008 Hs.292471 
324338 AL138357 Hs.145078 
324344 AW502000 Hs.46677 
324432 AA464510 Hs.152812 
324495 AW501411 Hs,122489 
324497 AW152624 Hs.136340 
324598 AA502659 Hs.163986 
324603 AW016378 Hs.292934 
324620 AA448021 Hs.94109 
324727 A1610425 Hs.19597 
324774 AI031771 Hs.132586 
324783 AA640770 Hs.200994 
324824 AI826999 Hs.224624 
324826 AA704806 Hs.143842 
324902 031323 Hs.271492 



EST cluster {not In UniGene) 
EST cluster (not In UniGene) 
EST cluster (not in UniGene) 
ESTs 

ESTs; Weakly similar to neogenin.iH.sap 

EST cluster (not in UniGene) 

ESTs; Weakly similar to till ALU SUBFA 

EST duster (not In UniGene) 

ESTs; Moderately similar to till ALU SU 

ARP2 (actin-related protein 2; yeasty horn 

ESTs 

EST cluster (not In UniGene) 
ESTs 

ESTs; Weakly similar to 111! ALU SUBFA 
ESTs; Weakly similar to l!ll ALU CLASS 
EST duster (not in UniGene) 
ESTs 

low density lipoprotein receptor (familial 

EST duster (not in UniGene) 

EST duster (not in UniGene) 

EST duster (not in UniGene) 

foffistatin-fikel 

EST duster (not In UniGene) 

EST duster (not in UniGene) 

ESTs 

ESTs 

DiGeorge syndrome critical region gene 2 

EST cluster (not in UniGene) 

ESTs 

EST duster (not in UniGene) 

ESTs 

ESTs 

EST duster (not in UniGene) 
ESTs 

multifunctional polypeptide similar to SA 
Homo sapiens mRNA; cDNA DKFZp586 
ESTs 

EST duster (not in UniGene) 
EST cluster (not in UniGene) 
EST duster (not in UniGene) 
ESTs 

EST duster (not in UniGene) 

ESTs; Weakly similar to III) ALU SUBFA 

ESTs; Weakly simflarto MI ALU SUBFA 

EST duster (not in UniGene) 

EST duster (not in UniGene) 

EST duster (not in UniGene) 

ESTs; Weakly similar to wadaw [ftmelan 

ESTs 

EST duster (not in UniGene) 

EST duster (not in UniGene) 

EST duster (not in UniGene) 

ESTs; Weakly simflarto HO ALU SUBFA 

ESTs 

EST cluster (not in UniGene) 

EST duster (not in UniGene) 

ESTs; Weakly similar to 111! ALU CLASS 

ESTs 

ESTs 

ESTs 

EST duster (not in UniGene) 

ESTs 

ESTs 

EST duster (not in UmGene) 

ESTs 

ESTs 

ESTs 
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324961 AA613792 


EST duster (not in UniGene) 


13.3 


324987 T05682 


Hs.172634 


ESTs 


19.6 


324988 T06997 


Hs.121028 


EST duster (not in UniGene) 


24.5 


325146 AJ064690 


Hs.171176 


ESTs 


4.6 


325622 




CH.14Jisgii5867000 


5.2 


326213 




CH.17_hsgiI5867224 


8.1 


326474 




CH.19_hsgil5867405 


12.7 


326816 




CH.20_hs glJ6552458 


9.4 


326817 




CH.20_hsgi|6552458 


11.7 


327110 




CH.2Lhsgii61 17842 


14.7 


327196 




CH.0l_hsgt|5867446 


5.1 


327283 




CH.01 hsgi)5867478 


4.3 


327313 




CH.01 hsgi|5867501 


4.8 


327450 




CH.02JisgiJ5867766 


4.1 


328059 




CH.06 hsgiI6117819 


6.2 


328304 




CK07 hs gi{6004478 


5.4 


328492 




CH.07_hsgi|5868455 


7 


328857 




CH.07 hsgi|6381927 


5.2 


329367 




CHX_hsgi|5868842 


7.6 


329373 




CH.\Jwgil6682537 


12 


329655 




CH.14_p2gi[6448516 


4 


329899 




CH.15_p2gil6563505 


4 


329960 




CH.16j)2gil5091594 


7.6 


330084 




CR19_p2gi|6015302 


4 


330384 M23263 




androgen receptor (dihydrotestosterone re 


5.8 


330385 AA449749 




ESTs; Highly similar to secreted apoptosi 


10.2 


330387 H14624 




ESTs; Highly similar to secreted apoptosi 


4.4 


330388 X03363 




HER2 receptor tyrosine kinase (c-erbB-2; 


17.7 


330409 D50692 


Hs.78221 


c-myc binding protein 


10.1 


330460 TIGR:HT544 


Hs.73946 


Endo 


330486 M13755 


Hs.833 


interferon-stimutated protein; 15 kOa 


67 


330494 M29696 


Hs.237868 


interieukin 7 receptor 


6 


330500 M34423 


Hs.79222 


galactosidase; beta 1 


13.1 


330510 M75099 


Hs.227729 


FK506-btnding protein 2 (13kD) 


29 


330513 M81057 


Hs.180884 


carboxypeptidase B1 (tissue) 


38.5 


330541 U22970 


Hs.265827 


multiple UniGene matches 


7.4 


330542 U23S42 


Hs.226213 


cytochrome P450; 51 (lanosterol 14-alpha 


15 


330547 U32989 


Hs.183671 


tryptophan 2;3-dioxygenase 


11 


330551 U39840 


Hs.299867 


hepatocyte nudear factor 3; alpha 


6.5 


330562 U49082 


Hs.76460 


transporter protein 


7.7 


330573 U62800 


Hs.83393 


cystatin E/M 


4 


330673 D57823 


HS.321403 


Sec23 (S. cerevisiae) homolog A 


10.5 


330711 AA164687 


Hs.177576 


mannosyl (alpha-1;3-)-glycoprotein beta-1 


24.3 


330814 M015730 


Hs.265398 


ESTs; Weafdy similar to transformattorK 


44.1 


330850 AA075298 


Hs.322710 


ESTs 


4.4 


330874 AA127474 


Hs.191157 


ESTs; Weakiy similar to till ALU SUBFA 


8.1 


330884 AA133457 


Hs.102548 


ESTs 


5.2 


330912 M195936 


Hs.82719 


genera! transcription factor IIA; 1 (37kD a 


5 


330924 AA232136 


Hs.159737 


Homo sapiens mRNA; cDNA DKFZp434 


9.1 


330997 H55762 


Hs.9302 


ESTs 


7.6 


331014 H98597 


Hs.30340 


ESTs 


13.5 


331024 N32919 


Hs.27931 


ESTs 


9.1 


331046 N66563 


Hs.191358 


ESTs 


10.5 


331135 R61398 


Hs.4197 


ESTs 


7.4 


331145 R72427 


Hs.129873 


ESTs; Weakly similar to CYTOCHROME 


41.9 


331148 R73816 


Hs.17385 


ESTs 


4.7 


331222 T98531 


Hs.173904 


ESTs 


4.1 


331230 W69807 


Hs.16537 


hypothetical protein; simitar to (U06944) 


4.9 


331306 AA252079 


Hs.63931 


dachshund (DrosophBa) homolog 


15.1 


331327 AA281076 


Hs.109221 


ESTs 


4.8 


331337 AA287662 


Hs.50495 


ESTs 


7.6 


331341 AA303125 


Hs.23240 


ESTs; Weakly similar to ilJl ALU SUBFA 


13 


331344 AA357927 


Hs.126550 


ESTs 


12.4 


331362 AA4 17956 


Hs.40782 


ESTs 


6.5 


331363 AA421562 


Hs.91011 


anterior gradient 2 (Xenepus laevts) homo 


28.2 


331376 AM43802 


Hs.41007 


ESTs; Weakly similar to cDNA EST yk47 


15.1 


331384 AA4560O1 


Hs.93847 


ESTs 


7.9 
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331478 M26608 Hs.40639 ESTs 

331526 N49967 Hs.46624 ESTs 

331533 N51517 Hs.47282 ESTs 

331681 W85712 Hs.119571 collagen; type (II; alpha 1 (Ehlers-Danlos 

5 331686 W88502 Hs.182258 ESTs 

331750 AA284372 Hs.111471 ESTs 

331751 AA284840 Hs.143818 ESTs 

331760 AA292721 Hs.154434 ESTs; Weakly similar to unknown [H.sap 

331763 AA312861 Hs.96704 ESTs 

10 331825 M411144 Hs.292882 ESTs 

331890 AA432166 Hs.3577 succinate dehydrogenase complex; subunl 

331952 AA454756 Hs.97837 ESTs 

332015 AA487910 Hs.208800 ESTs; Weakly simitar toll!! ALU CLASS 

332043 AA490831 Hs.125056 ESTs 

15 332060 AA504779 Hs.191402 ESTs 

332071 AA598594 Hs.205293 ESTs 

332093 AA608794 Hs.112592 ESTs 

332139 AA620669 Hs.112879 EST 

332219 N22508 Hs.139315 ESTs 

20 332225 N33213 Hs.10Q425 ESTs 

332246 N57927 Hs.120777 ESTs; Weakly similar to RNA POLYME 

332247 N58172 Hs.109370 ESTs 
332260 N70088 Hs.13B467 ESTs 

332269 N91279 Hs.109654 ESTs; Moderately similar to outer membr 

25 332336 T96130 Hs.137551 ESTs 

332340 W15495 Hs.129781 chromosome 21 open reading frame 5 

332347 W60326 Hs.288684 ESTs 

332362 W93640 Hs.4779 ESTs; Moderately similar to similar to AD 

332467 AA489630 Hs.119004 KIAA0665 gene product 

30 332499 M12036 Hs.323910 Human tyrosine kinase-type receptor (HE 

332513 AA018182 Hs.154424 defodinase; locbthyranine; type II 

332526 AA281753 Hs.77515 Inositol 1;4;5-triptosphate receptor; type 

332532 N63192 Hs.1892 EST; Highly similar to PHENYLETHAN 

332565 AA234896 Hs.25272 E1A binding protein p300 

35 332607 R41791 Hs.36566 UM domain kinase 1 

332640 AA417152 Hs.5101 protein regulator of cytokinesis 1 

332694 AA262768 Hs.243901 KIAA1067 protein 

332702 H93968 Hs.75725 transgefin2 

332705 T59161 HsJ6293 thymosin; beta 10 

40 332749 AA479968 Hs,88251 arylsulfatase A 

332927 CH22_FGENES.38J 

332929 CH22.FGENES.3B_3 

332930 CH22J r GENES.38_4 
332955 CH22_FGENES.48J2 

45 332958 CH22_FGENES.48J5 

332961 CH22_FGENES.48.18 

332983 CH22.FGENES.54_5 

333009 CH22.FGENES.61_1 

333010 CH22_FGENES.61_2 
50 333013 CH22J=GENES.61J 

333108 CH22_FGENES.79_14 
333139 CH22.FGENES.83.16 
333254 CH22_FGENES.118^2 
333305 CH22.FGENES.137^ 

55 333343 CH22JFGENES.139J2 

333388 "CH22_FGENES.144_3 
333456 CH22JGENES.157..5 
333459 CH22.FGENES.157J 
333517 CH22_FGENES.173_2 

60 333585 CH22.FGENES.203J1 
333679 CH22_FGENES.247_6 
333743 CH22.FGENES.264J 
333758 CH22.FGENES.268J 
333767 CH22_FGENES.271_6 

65 333768 CH22.FGENES.27U 
333769 CH22.FGENES.27U 
333795 Ott^GENES^SJ 



7 

19.8 

6.5 

13.8 

9.9 

5.6 

5.8 

7.4 

7.8 

15.2 

24.3 

5 

10.5 

11.4 

13,6 

9.1 

8.8 

9 

7.1 

12.2 

15.6 

16.9 

4 

8.2 

7.7 

14.1 

4.4 

16.9 

4.6 

10.4 

5.8 

19 

15.3 

12.3 

11.1 

18.2 

15.2 

4.7 

5.5 

9.8 

17.7 

4.7 

7.4 

5.4 

17.8 

10.6 

4.3 

52 

8.1 

8.5 

5.6 

6.3 

6.8 

11.4 

5.1 

12.7 

4.2 

7.6 

8.2 

5 

4.3 

13.4 

4 

5.6 
12.2 
48.3 
6.1 
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333796 


CH22_FGENES.275_3 


6.8 


333892 


CH22_FGENES.29*_14 


4.4 


333904 


CH2£_FGENES.294_2 


6.5 


333905 


CH2Z_FGENES.294_3 


9.3 


333921 


CH22LFGENES.295J2 


9.6 


333968 


CH22_FGENES.307_4 


15.9 


334102 


CH22_FGENES.327_60 


7.1 


334222 


CH2Z_FGENES.360_3 


6.7 


334223 


CH22J=GENES.360_4 


33.5 


334264 


CH2^FGENES.367_15 


18.5 


334343 


CH22_FGENES.375_25 


6.1 


334360 


CH22_FGENES,378_5 


6.1 


334784 


CH22_FGENES.432_9 


4.8 


334789 


CH22_FGENES.432_14 


5.1 


334794 


CH22_.FGENES.434_2 


7 


334889 


CH22_FGENES.452_3 


12.4 


335004 


CH22_FGENES.472_8 


7.9 


335115 


CH22_FGENES.496J2 


18.8 


335287 


CH22_FGENES.526J1 


4,5 


335342 


CH22_FGENES.536_1 


5.3 


335491 


CH22 FGENES.570.23 


24 


335495 


CH22_FGENES.570_28 


7 


335498 


CH22_FGENES.571_7 


12.2 


335544 


CH22_FGENES.576_5 


8.4 


335610 


CH22_FGENES.583_4 


12.9 


335653 


CH22_FGENES.590_4 


6.7 


335682 


CH2*_FGENES.595_2 


12.1 


335687 


CH22 FGENES.596J? 


13.9 


335755 


CH22_FGENES.604_4 


115 


335782 


CH22J=GENES.609_4 


17.9 


335791 


CH22 FGENES.611_7 


27.3 


335809 


CH22_FGENES.617_6 


19.2 


335822 


CH22_FGENES.619_7 


19.1 


335823 


CH22_FGENES.619_8 


4.5 


335824 


CH22_FGENES.619J1 


40.2 


335825 


CH22.FGENES.619J2 


34.3 


335895 


CH22 FGENES.635J 


10.2 


335917 


v CH22 FGENES.636J3 


6 


335920 


CH22_FGENES.636.16 


8.8 


336035 


CH22 FGENES.678 6 


5.9 


336042 


CH22LFGENES.679.4 


5.8 


336093 


CH22 FGENES.691.2 


11.6 


336096 


CH22_FGENES.691_5 


7.6 


336150 


CH22_FGENES.706_6 


6.3 


336152 


CH22_FGENES.706_9 


10.5 


336416 


CH22 FGENES.823_38 


5 


336444 


CH2a_FGENES.827J0 


4.8 


336449 


CH22_FGENES.829_6 


13.6 


336471 


CH22_FGENES.829J0 


6.9 


336512 


CH22_FGENES.834J 


21.4 


336558 


CH22LFGENES.842JJ 


8.2 


336560 


CH22_FGENES.842_5 


9 


336676 


CH22_FGENES.43-4 


9.4 


336959 


CH22 FGENES.367-1,3 


19 


337968 


CH22_EM:AC005500.GENSCAN.103-2 


13.4 


338008 


CH22_EMAC005500.GENSCAN.127-9 


15.2 


338057 


CH22_EM:ACQ05500.GENSCAN.160-1 


13.9 


338410 


CH22_EM:AC0055Q0.GENSCAN.341-6 


8 


338451 


CH22_EM.AC0Q5500.GENSCAN.3594 


11.6 


338588 


CH22_EM:ACG055Q0.GENSCAN.432-1 


10.3 


338665 


CH22_EM*C005500.GENSCAN.464-2 


4.8 


338689 


CH22_EMAC005500.GENSCAN.4754 


6.7 


338832 


CH22_OJ246D7.GENSCAN.64 


4.8 


338980 


CH22_OA59H18.GENSCAN.2-4 


5.1 


339352 


CH22_BA354l12.GENSCAN.29-7 


6.9 


339373 


CH22LBA232E17.GENSCAN.1^9 


4.3 
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TABLE 13A 

Table 13 A shows the accession numbers for those pkeys lacking unigenelD's for Table 13. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number. Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accession 

123619 371681 1 AA602964 AA609200 
20 103207 30635.-4 X72790 
103349 11052_-2 X89059 

110856 19346J4 AA992380 N33063 N21418 H79958 R21911 H79957 

123169 44573V M^^S^mm^mmAmSmi N35320 AI251473 H59397 AW971573 R97276W010^AW^7671 
25 M908598 M251875AI820501 AI82CS32W878^ 

AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL1 34043 AW235363 AA663345 AW008282 AA488964 
M233144AI890387AI95WAI741346AI689^ 

Al7fi?fiflft AA9ftft777 AA488892 A1356394 AW103813 AI539642 AA642789 AA856975 AW505512 AI961530 AW629970 
BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961 101 AA251669 AA251874 
30 AI819225 AW205B62 AI683338 A1858509 AWZ76905 AI633006 AA972584 AA908741 AW072629 AW513996 AA293273 

AA969759 N75628 N22388 H84729 H60052 T92487 A1022058 M780419 AA551005 W80701 AW613456 AI373032 AJ564269 
F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 FO4005 
123533 genbankJ\A608751 AA608751 
116480 genbanK_C14088 C14088 
35 132225 genbankJ\A12898Q AA128980 , 
125154 genbanlLW384l9 W38419 

102919 ffi^^^ 

AA323294 W04965 H38759 AA206622 AA580747 AI541 172 AA381075 AA354229 AW402353 AW405575 AW404021 AW406207 

40 AA075752 AW176066 AA287222 AA195818 T20243 R87945 AA295539 M402533 AA232419 AA224515 AW401583 AA331367 

AW402140 AW249079 R31488 M075757 X05236 AW239490 AA338036 AW239495 AA357262 AA431005 AA306726 R33804 
AA216544 AW275288 AA227044 AL038124 AA243300 C03242 M315615 AL035840 R64336 AA313917 AA018963 M001385 
AA054395 H30840 AW498825 AA086141 AI557324 AA121576 H39128 R77161 AA019688 AA380987 AA348140 M348257 
AW176086 AA362432 AA171389 AA362416 AA299938 AA319093 AA337972 C04921 AA345696 R89640 AA0B5425 AA481708 

45 AA313637 AL039229 H84490 H86153F00656 AA326668 M347304 R65890 H41949 AA339309 AW402002 AW404854 

AA192582 AA1 12802 H09248 N83165 H38367 AA356339 AA455763 R66853 AA294935 H8591 1 AA310414 H93436 N87014 
AA001 186 H83640 AA41 1328 AA317929 C04192 AW406288 U46335 AA323179 AA427649 AI366131 H14328 M197161 
AA379497 AA31 1816 AA017206 AA001137 AA017420 AA012990 AW163775 AA021397 AA295513 AA355248 AA374921 
AW380419 AA345864 AA318058 AA371711 AA363255 AA057094 R88057 AA394045 AW362741 AA479579 AW362789 

50 AW362775 AA223624AW362809AW362746 AW362753AW380412 AA057104 AA295271 X06352 AW362771 X12447 R59553 

R87925AA292864 R83920 F01120AA496740M852551 AL0392Q9 AA093339AA333961 N86416 C03325 C03831 AA194750 
AA359270 AA359460 AW245492 AA339400 M330784 AI908970 N89223 AW361938 A194O063 AA1 34323 AA371741 F01267 
AW372970 AA341973 AA346098 AW372969 AA337549 AA327342 H93855 AK374079 F291 18 AA852940 F35696 AA345963 
AA079578 AA1 13785 C02989 AA095945 C03145 C05199 AA346024 AA1 90506 AW361659 AI909845 AW374374 AW374382 

55 AW374401 AW374373 AW374370 AI909831 AW374367 AA353658 F01041 C02843 AA375948 AW374414 AA213946 AI525039 

K13744 R31007 AA112044 AA134404 H47935 AW177018 M429768 AA336873 AA1 12875 H46393 AA191267 D59131 
AW406037 AA055244 AA341880 AA179024 AA308537 AW406985 AW32731 1 H30301 AA30O7O5 H43788 AW364149 
AA806213 AA481936 C04941 AW375299 M21190 AA410818 AA250940 AI354547 AA317422 AA250903 A1865497 AA890603 
AA366197 AW498538 M78072 AW406461 C03092 F00308 H56488 AA336320 AW406501 AA354102 AA382942 AA096393 

60 AW376830 AA383446 F01259 AW081388 T94036 AA379643 H43842 A1524063 AA621727 AA379099 AA371417 R6681 1 

H44129 N84794 F01 135 AA477852 AA293062 AW361595 H27194 AI831650 H43253 H24797 A1564680 AA380090 W20057 
AI921586 A1192549 AW090808 H25967 AA918121 A1626060 H20221 M812572 H42178 AA887222 H96000 C03180 F00946 
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C03986 AI318091 AI860172 AA582179 AI633388 AA557193 R68075 F24105 AW518239 W56622 AI625219 AI925243 
AW468046 AI921828 AA339164 AI144391 AA643334 AA459631 AA873247 AW373432 AA604384 H27600 AI680458 AA1 59956 
AA610836 AA354298 AW373435 AA604225 W73754 AW087924 AA599776 N89227 AI630871 AI633128 AW514329 AA010455 
AA563928 AI571596 A1128394 W73707 AI423575 AA583809 AA657988 AI950837 AA169782 AA600009 AI885540 AA771884 
AI978829 AA505408 AA533937 AA481469 AA610869 AA775241 AW273870 AW070909 A1905695 AA4801 15 AA574051 
AI889185 AA773167 AA331 375 AAO0 1437 AA194324 AA194300 AA558632 AI038538 AA41 1329 AA781570 AI833176 
AA935520 AW074197 AA583063 AW073099 AW001 198 M948025 AA587857 AA191540 AI450085 AA193244 AI538037 
AA515572 AA758587 AI149311 AA508610 AA206409 AA534004 AA994600 AA827543 AI916349 AW245129 AW517804 025663 
AA781985 AA284536 AI819422 H16040 H27531 AM56564 AA845555 AI423596 AA012908 AA889439 AA71631 1 AA968868 
AA320508 AA725731 AA834202 AA935997 AA724815 AA769353 AA594803 F00827 AI342442 AI003519 AI002503 AI347597 
AI040946 AA197162 AA987883 AA292865 AW001944 AI64071 1 AW244044 M456784 F30588 AA290829 H24754 AI978683 
AA483686 AA583939 AA121382 AA833831 AA477102 AA977322 AA666379 F35456 AA993537 AJ749610 AA226934 AA716204 
AW513025 AA628543 AA583705 F25702 AI368748 A1124097 AI880086 AA477513 AI758834 AI690753 M477746 F37761 
AA642243 AA159957 AA250844 AA459406 AA427566 F25054 AI569314 AA961665 AI922050 AI759000 AA555236 AA514432 
AA293474 AAQ01129 M826789 AA641390 AA134405 F35585 AA477416 AW193359 AI361315 AA284988 F36340 AI361322 
F26969 AA991922 AA021267 F26973 AI361314 F35891 AI918509 AA250964 AA190992 AA577139 AA865535 M134324 
AW192842 AI224046 F18975 AA779626 AA856894 AW269997 AW014614 H95554 F31378 AA374868 F26343 AA654007 
AI830942 AA1 13195 F26432 W56652 AA464690 AA055263 AA340654 AA031448 AA976399 AA972526 AA063476 R83921 
T16240 AA533290 N91545 H44053 AA883451 AA513761 AA086477 H09249 F20482 F26737 AA054148 AA857063 M017259 
AA179789 AA088908 H43704 AA194320 F35950 A1880127 F20441 F32878 AA962483 H39094 H56489 H44621 F19390 
AI302232 F25162 AA826965 AA086052 AA917410 AA454513 R59554 AA196755 AA086369 AAO79530 H28106 AA243301 
AI025737 AA101239 AA088887 AI214910 AA974886 F16089 F26054 AA515092 F33436 F32829 M78061 AA235645 F19715 
F37529AI811549AA665180AA708200F01124F32382AA346220AA627361 F30741 F30010 F28543 AA211715 F20245 
AA331222 F25634 F21996 W28215 F21911 R65793 AI192566 H20130 H84491 AA719223 AA557435 F16967 F26989 F30353 
AA857159 AA291918 F28234 F20840 F25176 F22437 T27904 AA480355 F19528 R87926 H14286 F27532 AW337864 F2841 1 
H13692 F25651 AA975454 F24229 F29657 F18024 AA464779 F17588 F34954 AA947328 F18063 AA657777 AA459644 N91455 
F27850 F29608 F27208 F18418 F31459 F18564 F33496 F16376 F29740 F29843 F29904 F29866 F19135 AW276602 T40337 
F24835 F34672 F26474 AI926215 AA464185 F18217 N26193 AL043256 T41197 F33055 F00386 F29500 F341 91 F33297 
AI9372Q7 F22724 F15909 F26232 F18889 AA318627 F29085 AA872104 F17509 F23373 F15660 F17552 F1741 2 F16863 
F34033 F21515 F17364 F18383 F16546 F17561 F17260 AA292000 F15723 T47438 F16798 F18046 F18319 F17978 F17566 
F34230 F33258 F20860 F17998 AI695701 



129982 221.267 Z14221 AW381862 M97920 AW401444 Z66542 M29470 AW406502 X6101 1 M34024 AA327072 Z14166 Z14167 Z14165 



AW403806 Z14200 AA383972 Z14205 Z14201 M18513 Z14202 AW403684 X14584 AF062221 U43760 X65892 X65883 X62107 
Z80847 X65885 X65893 AF062142 X65891 X17675 Z47274 Z47277 Z47276 X65888 Z47275 X62109 AF062140 L01278 
AF062134 AF0B2139 X81723 Z80840 X81733 X81743 X81744 X81732 Z80843 AW402942 AW403516 X6591 9 AF062190 
AF062177AF062222AF062115Z47240AF062263AF062261 AF062223 AF062211 Z47238AW401714AW404008AW404991 
AF062280 M99595 Z47214 Z47232 Z47218 M26995 AF062184 X65895 L38433 X81731 Z1 1946 Z47226 AF062205 AF174012 
L01276 AF062168 AF062136 X81755 X81748 AF174019 AF062285 L06924 111699 AW402665 L09070 L28049 L08090 
AW407B43 Z14171 AW402944 L08083 AW405627 L33035 L26907 M17750 L28052 M17751 AJ239360 U19885 L14821 X56526 
AB014341 L12087 L12098 U68231 L12184 AF062242 AF022000 U64499 U00570 A1268604 Y15773 X64239 X62969 U00506 
X73605 X99360 U00577 083677 AB021539 AF035796 Z33899 U00588 AJ239353 AF174062 Z33901 X98899 AF174058 X63080 
083676 AB021529 Z18318 U00488 L01412 X81746 U21262 U21272 U00560 AF174060 U00547 U00561 Z18321 S73957 
M26435 AF115130 AF1151 17 Z92896 U21254 Z92895 AF115112 AF062290 T28938 Y09386 AF174067 U27189 Y09384 
U77373 AF174057 M17749 X69692 AF174038 U64478 U64486 AF174098 X99365 AF174077 AF174083 AF174089 AF015133 
AF127792 AJ006171 U00510 U58144 AF004323 AF1 15109 AF004324 AF015130 AF090414 AF090418 A6021536 AB021519 
Z96957 AF021986 Z92898 Z96956 134164 AF062251 AF052524 U00549 U00541 D83687 AF052522 AF087420 AF052520 
AF02201 1 143085 U00536 L19915 L43083 AF090420 AB021530 AB021534 Z96955 Z96954 Z33903 X62964 AF103282 U00509 
AF062298 AF062289 AF062307 AW408326 L33Q37 L04337 L04328 X81742 L04336 L03385 AF032360 S56184 AF062191 
AF191092 AI906954AJF1031S4 104343 AW364860 AF001424 AF103163 Z98717 AF103143 AJ008250 L04323 AF103321 
AF103335 L04342 L03818 L03B17 AW4Q4978 AW403690 U86801 AF103150 AJ010334 AF035027 AJ0073Z7 AF103115 
AF017458 AJ008207 AJ008183 AJ008196 AJ008241 AJ008208 AF103210 AF068668 ARJ68670 AF068671 AF068664 
AF068669 AF068666 AF068665 AF068672 AW371244 AW403670 AW408074 AW404575 AW362153 AW403803 AW406702 
AW351514 D78345 T29140 J00231 NMJJ02179 AW405146 AA301091 X04646 H64660 AW402990 AW406534 T93007 
A1857980 AW368899 AI905833 AW406586 AA482084 A1872299 AA715266 AW404328 AI831674 AI709348 AA6031 12 
AW514864 AA485775 H64492 AW404789 AA487630 AA715498 AA295885 T27613 T981 13 



118600 genbanR_N69222 

118952 genbanK_N92966 

120873 genbanMA358015 

113702 genbanleT97307 



N69222 
N92966 
AA358015 
T97307 



115763 genbank_AA421560 AA421560 

124357 genbanleN22401 . N22401 

108733 504187 J AA121022 AA126422 

101544 entre*_M31169 M31169 

124447 genbanleN48000 N48000 

124677 genbankJW1073 R01073 

124777 genbankj*41933 R41933 

119302 genbanK_T25725 T25725 
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103680 entrez Z93784Z93784 
135029 H58818_at H58818 
112253 genbankJ$1818 R51818 
Pkey CAT number Accession 
5 322175 46877 1 AF085975 H53458 H53459 

323011 139750 1 M580288 AA31 5655 AA1 33031 AA377748 
322975 1510563J C16391 C16413 

317210 211994 1 AW881145 AA490718 M35637 AA304575 T06067 AA331991 

323817 233566 1 AA41Q943 AW948953AA334202AA332882 
10 309583 1046029,-2 AW170035 

324961 376239J AA613792 AW182329 T05304 AW858385 

303502 325188 1 BE 1 74240 AA488528 AL042253 

320133 447553 1 BE151746 BE336853 D63271 T94955 AA774994 

311935 174129 1 AA216387T63548 AA228676 
15 321050 502195 1 C05928 AW393497 

319977 345248 1 AA534222AA632632 T81234 

312772 4380 7 AW962299 AA310349 AW962294 H63791 H63751 

321354 116028_-2 AA078493 

336512 CH22_3941FGJ34_7J.INK.OJ 
20 336558 CH22J992FGJ42_3_LINK_DJ 

336560 CH22_3994FG_842_5_LINICDJ 

329367 c_xjs 

329373 c_x_hs 

336676 CH22_4154FG_43_4_ 
25 338008 CH22_6490FG_UNICEMACOO 

338057 CH22_6558FG_UNK_EM*C0O 

329655 c14_p2 

336959 CH22_4764FG_367_13_ 

329899 c15_p2 
30 329960 c16_p2 

338410 CH2^7O67FG__UNK_EM:AC00 

338451 CH22_7124FG__UNK_EM:ACQ0 

338588 CH22_7331FG__L!NK_EM:ACC0 

338665 CH22_7438FG_UMK_EI^AC00 
35 338689 CH2^7464FG__UNK.EM:AC00 

308677 A1761173 

338832 CH22 7678FG_UNK_DJ246D7 
338980 CH22~7859FG_JJNKJ)A59H18 
333009 CH22_233FG_61J_UNK_EM:A 
40 333010 CH22_234FG_61_2_LINK_EMA 
333013 CH22_237FG_61_5_LINK_EM*A 
308981 AI873242 
308995 AI880172 

333108 CH22 336FGJ9_14_L1NK_EM: 
45 333139 CH22_368FG_83_16_L1NK_EM: 

333254 CH22_495FG_118_2_UNK_EM: 

333305 CH22_550FGJ37_2_UNK_EM: 

333343 CH22 589FGJ39J2JLINK.EM 

333388 CH22 634FGJ44_3.UNK.EM: 
50 326213 c17_hs 

333456 CH22 706FGJ57_5JJNK_EM: 

333459 CH22_709FGJ57_8J.INK.EM: 

333517 CH22_773FG_173_2_LINieEM: 

333585 CH22_846FG_203_4_UNK_EM'. 
55 333679 CH22_941FG_247_6_LINK_EM: 

326474 c19_hs 

333743 CH22_1009FG_264_1_LINK_EM 
333758 CH22 1024FGJ68JJJNK.EM 
333767 CH22J034FG_271_6J.INK.EM 
60 333768 CH2*J035FG_271_7_LINK.EM 
333769 CH22_1036FG_271_8_UNK_EM 

333795 CH22 1063FG_275JJJNK_EM 

333796 CH22_1065FG_275 3_L1NK_EM 
335004 CH22_2326FG_472_8JJNK_EM 

65 333892 CH2^1163FG_292_14_UNK_E 
335115 CH22_2447FGj496_2_LINK_£M 
333904 CH22J176FG_294_2_LINK_EM 
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333905 CH22J177FG_294JJJNK_EM 
333921 CH2^1194FGJ96_12_UNK.E 
333968 CH22J245FGJ07JJJNK_EM 
328059 c_6Jis 
5 335287 CH22_2629FG_526_11_LINK_E 

326816 c20_hs 

326817 c20_hs 

335342 CH22.2689FG 536 1 L!NK_EM 
335491 CH22_2843FG_570_23_UNK_E 
10 335495 CH22_2847FG_570_28J_INK_E 
335498 CH22J850FG 571 _7_UNK.EM 
328304 c_7Jis 
305453 AA738110 

335544 CH22_2699FG_576_5J_INieEM 
15 335610 CH22_2969FG_583JLLINICEM 
335653 CH22_3013FG_590_4_LINK_EM 
335682 CH22J043FG_595_2_UNK^EM 
335687 CH2Z.3048FGJ96_2.LINK.EM 
328492 c_7_hs 
20 335755 CH22_3122FG_604_4JJNK_EM 
335782 CH22J151FGJ09JJJNK_EM 
335791 CH22 3160FG 611 7 LINK EM 
335809 CH22.3181FGJ17 6 LINK_EM 
335822 CH22_3195FG_619 7_LlNK_EM 
25 335823 CH22_3196FG_619_8JJNK_EM 

335824 CH22JS197FG 619 11 LINK_E 

335825 CH22JJ198FG 619 12_LINK_E 
335895 CH22_3272FG_635J3J_1NK_EM 
335917 CH22J294FG_636J3_UNK_E 

30 335920 CH22J297FG_636J6JJNK_E 

305898 AA872838 

305913 AA876109 

305950 AA884479 

328857 c_7_hs 
35 330084 c19_p2 

337968 CH22_6419FG_LINK_EM:AC00 

309177 AI951118 

309198 AI955915 

309226 AI969897 
40 339352 CH22J317FG_UNK_BA354I1 

309279 AI990102 

339373 CH22 8348FG_LINK_BA232E1 
325622 c14 hs 

334102 CH22_1380FG_327_60_UNK^E 
45 332927 CH2ZJ48FG 38J_LINK_C20H 

332929 CH22_150FG_38_3_LIN1CC20H 

332930 CH22L151FG_38J_LIN1CC20H 
332955 CH22J79FGJ8J2JJNK.EM: 
332958 CH22_182FG_48J5JJNK_EM: 

50 332961 CH22_185FG_48J8JJNK_EM: 
332983 CH22_207FG_54_5JJNKJEM:A 

334222 CH22_1506FG„360_3_UNK_EM 

334223 CH22J507FG_360J_UNK_EM 
334264 CH22_1551FGJ67_15_UNK_E 

55 327110 c21 hs 

334343 CH22J636FG_375_25JJNJeE 

334360 CH22_1654FG_378_5_LINK_EM 

327196 cjjis 

327283 c 1_hs 
60 327313 c_1_te 

304465 AA421948 

304507 AA456426 

327450 c_2Jk 

304591 AA505702 
65 304601 AA507875 

304659 AA533185 

334784 CH22_2096FG_432_9_UNK_EM 
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334789 CH22J2101FGJ32J4J-INK-E 
334794 CH22_2106FG 434JJ.INK.EM 
336035 CH22 3420FG 678_6JJNKJ)J 
336042 CH22_3427FG_679J_UNK_PJ 
5 336093 CH22J481FG_691_2J.INKJ)J 
336096 CH22_3484FG_691_5JJNKJW 
334889 CH22_2206FG_452_3_UNK_EM 
336150 CH22_3540FG_706_6_UNK„DA 
336152 CH22J543FGJ06_9J_INKJ3A 
10 336416 CH22_3833FGJ23_38JJNK.B 
336444 CH22J864FG_827J0_LINKJ> 
336449 CH22_3870FG 829„6_UNKJ3J 
336471 CH22J894FG_829_30JJNKJ> 
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TABLE 13B 



Table 13B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 13B. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

1 0 Ret Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et ai' refers to the publication 

entitled The DNA sequence of human chromosome 22/ Dunham I. et a)., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

Ntjxjsition: Indicates nucleotide positions of predicted exons. 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Pkey Ref 



Strand Nt_positlon 



332955 
332958 
332961 
333139 
333254 
333305 
333388 
333517 
333585 
333679 
333767 
333768 
333769 
333795 
333796 
333892 
333921 
333968 
334102 
334264 
334343 
334794 
334889 
335287 
335491 
335495 
335498 
335653 
335687 
335809 
335822 
335823 
335824 
335825 
336035 
336093 
336096 
336444 



338008 
338057 
338410 
338588 
338665 
338832 



Dunham, I. etaL 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etaJ. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham. 1. etal. 
Dunham, L etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, L etal. Plus 
Dunham, I. etal. Plus 
Dunham, I. etal. 
Dunham, 1. etal. 
Dunham, I. etal. 
Dunham. I. etal 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham. I. et.al. 
Dunham, I. etal. 
Dunham. I. etaL 
Dunham. I. etal. 
Ounham, I. etaL 
Dunham, I. etat 
Dunham, 1. etal. 
Dunham. I. etaL 
Dunham, 1. etaL 
Dunham, I. etal. 
Dunham, I. etal 
Dunham, I. etal 
Dunham, I. etal 
Dunham, 1. etal. 
Dunham, i. etal 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal 
Dunham, I etal. 
Dunham, I. etal 



Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
P\us 
Plus 
Plus 
Plus 



Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 



339352 Dunham, I. etal. Plus 



2508896-2508992 

2516164-2516310 

2521424-2521555 

3369495-3369571 

2521424-2521555 

46303884630645 

49137494913805 

5570729-5570925 

62347784234894 

7068795-7068896 

7694407-7694623 

7695440-7695697 

7696625-7696707 

7807688-7807795 

7808253-7808319 

81568254157001 

8380325-8380441 

8681004-8681241 

9995140-9996373 

13234447-13234544 

13655828-13656307 

16374312-16374458 

19286024-19286515 

22299047-22299299 

24128651-24128827 

24140688-24140872 

24172082-24172161 

25329710-25329802 

25445952-25446064 

26310772-26310909 

26364087-26364196 

26365925-26366004 

26376860-26376942 

26378175-26378268 

29016748-29017410 

29556922-29557002 

29578878-29579047 

34190585-34190718 

13233040-13233126 

7697068-7697236 

8526397-8526522 

19292807-19292916 

22896767-22896920 

24472654-24472853 

27775128-27775290 

29896789-29896874 

33544784-33545121 
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332929 Dunham, L etaL Minus 

332930 Dunham, I. etaL Minus 
332983 Dunham, I. etat. Minus 
333009 Dunham, I. etaL Minus 

5 333010 Dunham, I. etaL Minus 
333013 Dunham, I. etal. Minus 
333108 Dunham, I. etaL Minus 
333343 Dunham, L etal. Minus 
333456 Dunham, L etaL Minus 
10 333459 Dunham, I. etaL Minus 
333743 Dunham, i. etaL Minus 
333758 Dunham, 1. etal. Minus 

333904 Dunham, I. etat Minus 

333905 Dunham, I. etaL Minus 
15 334222 Dunham, I. etaL Minus 

334223 Dunham, I. etaL Minus 
334360 Dunham, I. etaL Minus 
334784 Dunham, I. etaL Minus 
334789 Dunham,!. etaL Minus 

20 335004 Dunham, L etai. Minus 
335115 Dunham,!. etaL Minus 
335342 Dunham, I. etaL Minus 
335544 Dunham, I. etal. Minus 
335610 Dunham, L etaL Minus 

25 335682 Dunham, I. etal. Minus 
335755 Dunham, 1. etaL Minus 
335782 Dunham, I. etaL Minus 
335791 Dunham,). etal. Minus 
335895 Dunham, I. etaL Minus 

30 335917 Dunham,!. etal. Minus 
335920 Dunham, I. et.aL Minus 
336042 Dunham, I. etaL Minus 
335150 Dunham, !. etaL Minus 
336152 Dunham,!. etaL Minus 

35 336416 Dunham, I. etaL Minus 
336449 Dunham, I. etat Minus 
336471 Dunham, I. etal. Minus 
336512 Dunham, 1. etal. Minus 
336558 Dunham,!. etaL Minus 

40 336560 Dunham,!. etaL Minus 
336676 Dunham, I. etaL Minus 
337968 Dunham, t. etal. Minus 
338451 Dunham, I. etal. Minus 
338689 Dunham, I. etal. Minus 

45 339373 Dunham, I. etal. Minus 
325622 5867000 Plus 
329655 6448516 Minus 
329899 6563505 Minus 
329960 5091594 Minus 

5 0 326213 5867224 Minus 
326474 5867405 Plus 
330084 6015302 Minus 

326816 6552458 Plus 

326817 6552458 Plus 
55 327110 6117842 Plus 

327196 5867446 Plus 
327283 5867478 Minus 
327313 5867501 Minus 
327450 5867766 Minus 

60 328059 6117819 Plus 
328492 5868455 Minus 
328304 6004478 Minus 
328857 6381927 Minus 
329367 5868842 Minus 

65 329373 6682537 Minus 



2020758-2020664 

2022565-2022497 

2631933-2631797 

2766043-2765856 

2766207-2766119 

2772278-2772039 

3240494-3240389 

469288W692753 

2631933-2631797 

5144548-5144344 

7573218-7573060 

7666413-7666091 

8217374-6217261 

8217795-8217670 

12732417-12732289 

12734365-12734269 

13728850-13728751 

16294548-16294360 

16306095-16305996 

20581911-20581794 

21388250-21388146 

22597448-22597284 

24650505-24650403 

25068943-25068841 

25421215-25421093 

25763806-25763747 

25908578-25908440 

25948563-25948411 

26975307-26975239 

27028481-27028377 

27034927-27034811 

29041694-29041500 

30150423-30150256 

30156053-30155870 

34047408-34047311 

34204707-34204577 

34215091-34214978 

34278373-34278275 

3437582544375698 

3437681444376596 

2022565-2022497 

7095797-7095680 

20174286-20174193 

24893073-24892972 

33860127-33860047 

69994-70075 

35565-35843 

111058-111783 

1031-1162 

60751-60927 

16995-18101 

57019-59337 

198354-198436 

199909-200001 

94608-94785 

180921-181333 

567-962 

89734-89838 

47928-48076 

37052-37204 

46094-46241 

3884-3952 

8055741051 

87201-87587 

3895049301 
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TABLE 14: Table 2 from BRCA 001-5 US 

Table 14, a subset of table 13, depicts a preferred group of genes highly upregulated in breast 
cancer cells. 



10 



Pkey: 
ExAccn: 
UnigenelD: 
Unigene Title: 
R1: 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 



Ratio of tumor to normal breast tissue 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Pkey ExAccn 


UniGene ID 


Unigene Title 


R1 


100038 M97935 




AFFX control: STAT1 


16.7 


100114 D00596 


Hs.82962 


thvmidvlafe svnfrietase 


15.9 


100975 J02923 


Hs.76506 


lymphocyte cytosoKc protein 1 (L-plastin) 


30.1 


101031 J05070 


Hs.151738 


matrix metailoproteinase 9 (gelatlnase B; 9 


37.2 


101104 L07615 


Hs.169266 


Human neuropeptide Y receptor Y1 (NPYY 


18.3 


101143 112723 


Hs.90093 


heat shock 70 kD protein 4 


17;4 


101332 L47276 


Hs.156346 


Homo sapiens (cell line HL-6) alpha topots 


18.9 


101378 M13755 


Hs.833 


interferon-stimulated protein; 15 kDa 


18.1 


101809 M86849 


Hs.323733 


Homo sapiens connexin 26 (GJB2) mRNA 


22.5 


102618 U65932 


Hs.81071 


extracellular matrix protein 1 


23.2 


102721 U79241 


Hs. 118666 


Human clone 23759 mRNA; partial cds 


15 


102817 U90904 


Hs.83724 


Human done 23773 mRNA sequence 


15.2 


102907 X06985 


Hs.202833 


heme oxygenase (decyciing) 1 


22.7 


102985 X17644 


Hs.2707 


G1 to S phase transition 1 


20.6 


103080 X57766 


Hs.155324 


matrix mptaHonmfpinasp 11 fctmmplvsin *? 


17.8 


103180 X69433 


Hs.5337 


Isocitrate dehydrogenase 2 (NADP+); mito 


18^9 


103206 X72755 


Hs.77367 


monokine induced by gamma Interferon 


15.1 


103821 AA157623 


Hs.198793 


K1AA0750 gene product 


23.3 


104115 AA428090 Hs.26102 


ESTs 


28.7 


104667 AA007234 


Hs.30098 


ESTs 


16.6 


105186 AA191512 


Hs.28005 


Homo sapiens mRNA; cDNA DKFZp564G 


19.3 


106103 M421104 


Hs.12094 


ESTs 


15.4 


107151 AA621169 


Hs.8687 


ESTs 


19 


109415 AA227219 


Hs.1 10826 


trinucleotide repeat containing 9 


20.1 


110189 H20543 


Hs.6278 


DKFZP586B1621 protein 


16.6 


110561 H59617 


Hs.5199 


ESTs; Weakly similar to UBIQUITIN-CON 


19.5 


110734 H98714 


Hs.24131 


ESTs 


30.2 


110915 N46252 


Hs.29724 


ESTs 


- 232 


111179 N67239 


Hs.10760 


ESTs 


37 


111357 N91023 


Hs.87128 


ESTs 


15 


112134 R46025 


Hs.7413 


ESTs 


17.4 


113970 W86748 


Hs.8109 


ESTs 


15 


114124 Z38595 


Hs.125019 


ESTs; Highly similar to WAA0886 protein 


22 


114292 Z40715 


Hs.184641 


detta-6 fatty acid desaturase 


19.4 


114901 AA236276 


Hs. 196437 


ESTs; Weakly similar to R26660 1; partial 


16.9 


114965 AA250737 


Hs.72472 


ESTs 


35.1 


115652 AA405098 


Hs.38178 


ESTs 


16.1 


115875 AA433943 


Hs.43946 


ESTs; Weakly similar to Weak similarity t 


33.5 


116790 H29532 


Hs.101174 


rnicratubule-associated protein tau 


22.2 


116921 H72948 


Hs.821 


biglycan 


20.7 


117412 N26722 


Hs.42645 


ESTs 


18.1 


120241 Z41815 


Hs.65946 


ESTs 


15.6 


120325 AA195651 


Hs.104106 


ESTs 


15.2 


121596 AA416740 Hs.174104 


ESTs 


22.6 


123619 AA609200 




ESTs 


23.1 


124006 060302 


Hs.270016 


ESTs 


20.6 


125852 H09290 


Hs.76550 


Homo sapiens mRNA; cONA OKFZp564B 


25.9 


126160 N90960 


Hs.265398 


ESTs; Weakly similar to transformation-^] 
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127677 M916752 Hs.264190 
128595 U31875 Hs.152677 
128717 T30617 Hs.104222 
129124 AA234530 Hs.1O8802 
5 129366 H18027 Hs.184697 
130455 X17059 Hs.155956 
130604 X03635 Hs.1657 
130913 W03592 Hs.21198 
130944 M97935 Hs.21486 

10 131472 M608962 Hs.27258 
131562 U90551 Hs.28777 
132180 AA405569 Hs.418 
132406 F09979 Hs.4774 
132465 AA047896 Hs.49169 

15 132994 AA505133 Hs.279905 
133294 R79723 Hs.69997 
133634 U24166 Hs.234279 
134374 D62633 Hs.8236 
134405 J04177 Hs.82772 

20 134470 X54942 Hs.83758 
134495 D63477 Hs.84087 
134714 U89922 Hs.890 
135237 AA454930 Hs.9691 
301884 AA312082 Hs.105445 

25 302276 NMJ04448HS.323910 
302290 AL1 17607 Hs.175563 
309177 AI951118 
309583 AW170035 
310438 AW022192 Hs.200197 

30 311166 AI821294 Hs.118599 
312153 AA759250 Hs.153028 
313915 AI969390 Hs.163443 
314506 AA833655 Hs.206858 
314558 AI873274 Hs.190721 

35 314691 AW207206 Hs.136319 
314943 AI476797 Hs.184572 
315196 AA972756 Hs.44898 
316177 A1908272 Hs293102 
318073 AW167087 Hs.131562 

40 318662 A1285898 Hs.294014 
318740 NMJ)02543Hs.77729 
318744 AI793124 Hs.144479 
319668 NMJ)02731Hs.87773 
320074 AA321166 Hs.278233 

45 320211 AL039402 Hs.125783 
320727 U96044 Hs.181125 
322818 AW043782 Hs.293616 
322882 AW248508 Hs.279727 
324261 AL044891 Hs.269350 

50 324432 AA464510 Hs.152812 
324603 AW016378 Hs.292934 
324620 AA448021 Hs.94109 
324988 T06997 Hs.121028 
330388 X03363 

5 5 330486 M13755 Hs.833 

330814 AA015730 Hs.265398 
331145 R72427 Hs.129873 
331306 AA252079 Hs.63931 
331890 AA432166 Hs.3577 

60 332526 AA281753 Hs.77515 
332532 N63192 Hs.1892 
332694 AA262768 Hs.243901 
332958 
333769 
65 333968 
334223 
334264 



ESTs; Highly similar to MEM3 (M.muscul 
short-chain alcohol dehydrogenase family m 
Homo sapiens mRNA; cONA DKFZp566L 
N*elhylmaleimide-sensi!ive factor 
plexin C1 

N-aceiyitransferase 1 (arylamine N-acetytt 
estrogen receptor 1 

translocase of outer mitochondrial membra 

signal transducer and activator of transcript 

calcyctin binding protein 

H2A histone family; member L 

fibroblast activation protein; alpha; seprase 

ESTs 

ESTs 

solute carrier family 2 (facilitated glucose t 
zinc finger protein 238 
mlcrotubule-associated protein; RP/EB fam 
ESTs 

collagen; type XI; alpha 1 
CDC28 protein kinase 2 
KIAA0143 protein 

lymphotoxin beta (TNF superfamiTy; memb 
ESTs 

GDNF family receptor alpha 1 

EST cluster {not in UnJGene) with exon hit 

Homo sapiens mRNA; cONA DKFZp564N 

EST singleton (not in UniGene) with exon 

EST 

ESTs 

ESTs 

cytochrome b-561 

ESTs 

ESTs 

ESTs 

ESTs 

ceD division cycle 2; G1 to S and G2 to M 
ESTs 

EST cluster (not in UniGene) 

ESTs 

ESTs 

EST cluster (not in UniGene) 
ESTs 

EST cluster (not in UniGene) 

EST cluster (not in UniGene) 

DEME-6 protein 

EST cluster (not in UniGene) 

ESTs 

DiGeorge syndrome critical region gene 2 
EST cluster {not In UniGene) 
EST cluster (not in UniGene) 
ESTs 

EST cluster (not in UniGene) 
EST cluster (not in UntGene) 
HER2 receptor tyrosine kinase (c-erbB-2; E 
interferon-stimulated protein; 15 kDa 
ESTs; Weakly similar to transformation-rel 
ESTs; Weakly similar to CYTOCHROME 
dachshund (Drosophila) homotog 
succinate dehydrogenase complex; subunit 
inositol 1;4;5-triphosphate receptor; type 3 
EST; Highly similar to PHENYLETHANO 
KJAA1067 protein 
CH22_FGENES.48J5 
CH2*_FGENES.271_8 
CH2£_FGENES.307_4 
CH2i.FGENES.360J 
CH22JGENES.367J5 
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17.3 

27.1 

24.5 

20.7 

18.2 

26.4 

39.9 

20.9 

18.8 

18.1 

18.8 

15.4 

15 

15.4 

26.4 

30.4 

15.2 

15.2 

15.3 

20.3 

16.1 

35.7 

19.5 

20.7 

21.6 

41.4 

24.3 

64.5 

39.1 

24.1 

27.1 

27.1 

27.8 

22.5 

21.4 

18.4 

28.8 

32.6 

15.7 

16.3 

21.3 

35 

25.4 

16.7 

24.3 

15.3 

21 

15.3 

50.1 

16.7 

23.1 

21.2 

24.5 

17.7 

67 

44.1 

41.9 

15.1 

24.3 

19 

15.3 

15.2 

17.8 

48.3 

15.9 

33.5 

18.5 
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335791 CH2^.FGENES.611_7 27.3 

336512 CH23JFGENES.834..7 21.4 

338008 CH2Z.EMACOO550O.GENSCAN.127-9 15.2 
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TABLE 14A 



5 

10 
15 
20 
25 



Table 14A shows the accession numbers for those pkeys lacking unigenelD's for Table 14. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

309583 1046029^-2 AW170035 
336512 CH22_3941FG_834_7JJNIU>J 
338008 CH223W0FG_UNK_EM*CQ0 
333769 CH22_1036FG_271XUNrCEM 
333968 CH22_1245FGJ07_4_UNK_EM 
335791 CH22_3160FG_611JLUNK_EM 
309177 AI951118 

332958 CH2£J82FGJ8J5JJNK^EM: 
334223 CH22_1507FGJ60_4JJNK_EM 
334264 CH22 1551FGJ67J5_LINK_E 
123619 371681 1 AA602964 AA609200 



255 



WO 02/059377 



PCT/US02/02242 



TABLE 14B 



5 Table 14B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 14. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



10 



15 



20 



25 



Ptey: Unique number corresponding to an Eos probesel 

Ref. Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled 'The DNA sequence of human chromosome 22." Dunham i. et al., Nature (1 999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposiGon: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 



332958 Dunham, I. eUl. Plus 

333769 Dunham, I. etaL Plus 

333968 Dunham, I. etaL Plus 

334264 Dunham, I. etaL Plus 

338008 Dunham, I. eta!. Plus 

334223 Dunham, I. etaL Minus 

335791 Dunham, I. etaL Minus 

336512 Dunham, I. etaL Minus 



2516164-2516310 

7696625-7696707 

8681004-8681241 

13234447-13234544 

7697068-7697236 

12734365-12734269 

25948563-25948411 

34278373-34278275 
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TABLE 15: Table 3 from BRCA 001-5 US 



Table 15 shows genes downregulated in breast cancer cells. 



Ptey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

1 0 Unigene Title: Unigene gene title 

R1 : Ratio of normal breast tissue to tumor 



Pkey ExAccn UniGene IDUnigene Tide 



R1 



15 

100115 000632 Hs.172153 glutathione peroxidase 3 (plasma) 1.7 

100499 TIGR:HT1428 Hs.283108 Globin.Beta 1.5 

100502 T1GR:HT1496 Hs.169228 Adrenal-Spedfic Protein Pg2 2.3 

100815 T1GR:HT4268 Hs.9739 L-Glycerol^Phosphate:Nad+Oxidoreduct 1.7 

20 101125 110373 Hs.82749 transmembrane 4 superfamily member 2 1.5 

101367 M12963 Hs.4 alcohol dehydrogenase 1 (class I); alpha po 2.9 

101397 M15856 Hs.180878 lipoprotein lipase 1.6 

101883 M98399 HsJ5613 CD36 antigen (collagen type I receptor; thr 1.6 

102227 U25138 Hs.93841 potassium large conductance calciunvactiv 1.6 

25 102857 X00129 Hs.76461 retinol-binding protein 4; interstitial 3 

103211 X73079 Hs.288579 polymeric immunoglobulin receptor 1.8 

103496 Y09267 Hs.132821 flavin containing monooxygenase 2 1.5 

103562 Z21966 Hs.2815 POU domain; class 6; transaction factor 1 1.8 

104672 AA007629 glycerol-3-phosphate dehydrogenase 1 (sol 2.4 

30 105083 AA146619 Hs.18791 ESTs; Weakly similar to CALCIUM-BIND 1.7 

4fl£400 AK40AC4Ct U~<CWO CCTo 

106075 AA417915 Hs.25930 ESTs 1.5 

106870 AA487576 Hs.26530 serum deprivation response (phosphatidyls 1.6 

107099 AA609645 Hs.21 1568 eukaryotic translation initiation factor 4 gam 2.7 

35 107616 AA004901 Hs.261164 ESTs 1.6 

107997 AA037388 Hs.82223 Human ONA sequence from clone 141H5 o 1.7 

108604 AA099820 Hs.49696 ESTs 2.4 

111130 N64265 Hs.19515 yz44h12.s1 Morton Fetal Cochlea Homo sa 1.7 

111837 R36447 Hs.24453 ESTs 1.6 

40 112538 R70255 ESTs 1.9 

112808 R97970 Hs.281022 EST 1.5 

113086 T40652 Hs.209100 DKFZP434C171 protein 19 

115740 AA418033 Hs.283559 ESTs 1.6 

115949 AA443800 Hs.43125 ESTs 2 

45 115965 AA446661 Hs.173233 ESTs 2.2 

117224 N20300 Hs.218707 ESTs 1.7 

117513 N32174 Hs.44317 SRY (sex-determining region Y)-box 10 1.7 

119059 R15436 Hs.77889 Friedreich ataxia region gene X1 23 17 

119175 R71792 Hs.301002 ESTs; Weakly similar to cell death acthrato 2.8 

50 1 19359 T71021 . Hs.285681 ESTs; Highly similar to WS basic-helix-loo 1 .9 

119798 W73386 Hs.249129 ESTs 3 

120889 AA365784 Hs.97044 ESTs 1.6 

121381 AA405747 Hs.97984 ESTs; Weakly similar to WASP-family pro 1.8 

121750 AA421184 Hs.97549 ESTs 1.5 

55 122127 AA434447 Hs.108771 ESTs 2.5 

122348 AA443695 Hs.293410 ESTs 2.1 

122485 AA448300 Hs.160318 phospholemman 1.5 

123443 AA598841 Hs. 167382 natriuretic peptide receptor A/guanylate cy 1.8 

123505 AA600135 ESTs; Moderately similar to III! ALU SUB 1.5 

60 125284 W94688 Hs.103253 perilipin 1.7 

126300 D81972 HUM427DD8B Human fetal brain (TFujiw 1.8 

126747 R72515 Hs.160318 phospholemman 1.6 

127218 AA309765 Hs.116017 ESTs; Weakly similar to WAA0795 protei 1.5 

127357 AA452788 Hs.75432 zx39g1 1.r1 SoaresJotaUe1usJlb2HF8_.9 1.7 
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127638 AA634405 Hs.122608 ESTs 1.5 
128213 AA972780 Hs.129194 ESTs; Weakly slmfla/ to ALU SUBFA 1.5 
128351 AI092391 Hs.134B86 ESTs 1.5 
128842 N44757 Hs.20340 ESTs 1.6 
128870 R7U03 Hs.75309 eukaryolic translation elonga lion factor 2 1.7 
129146 AA459944 Hs.108924 DKFZP586P 1422 protein 1.5 

129285 T62068 Hs.11006 ESTs 2.1 
129331 N93465 Hs.279772 ESTs; Highly similar to CGl-38 protein [H 1.5 
130085 M62402 Hs.274313 insuEn-Bke growth factor binding protein 6 1.7 

130400 M25079 Hs.283 108 hemoglobin; beta 1.7 

131267 AA211776 Hs.2504 myomesin 1 (skelemin) (185kD) 3.8 

131277 AA131466 Hs.23767 ESTs 1.9 
131282 M12272 Hs.4 alcohol dehydrogenase 3 (class I); gamma p 22 

131304 AA295848 Hs.25475 aquaporin7 1.7 

131810 D49487 Hs.194236 leptci {murine obesity homolog) 2.5 

132788 AA045503 Hs.56874 ESTs; Weakly similar to Homo sapiens p2 1.6 

132931 241452 Hs.6090 deleted in bladder cancer chromosome regl 1.5 

133120 X64559 Hs.65424 tetranectin (plasminogeivtinding protein) 2 

133314 U95367 Hs.70725 gamma-aminobutyric add (GABA) A recep 15 

133507 X74295 Hs.74369 integrin; alpha 7 1.7 

133601 S95936 Hs.284176 transferrin 2.3 

133702 N56898 HsJ5652 glutathione S-transferase M5 1.9 

134111 N79674 Hs.8022 TU3A protein 4.6 

134699 U56614 Hs.88646 deoxyribonuclease Mike 3 1.5 

134749 L10955 Hs.89485 carbonic anhydrase IV 1.6 

135173 M72885. Hs.95910 Human G0S2 protein gene; complete cds 1.9 

300132 AW027556 Hs.156286 ESTs 1.7 

300732 A1369956 Hs.257891 ESTs 1.5 

300750 AA514805 Hs.293055 ESTs 1.8 

301140 AI807692 Hs.129129 ESTs 1.6 

301396 AA923549 Hs.224121 ESTs 2.1 

302910 N77976 Hs.251577 hemoglobin; alpha 1 1.8 

303798 V00505 Hs.36977 hemoglobin; delta 1.6 

303831 T04868 Hs.46780 EST cluster (not in UniGene) with exon hit 1.7 

303844 U94362 Hs,58589 gfycogenin2 1.5 

304182 H91086 EST singleton (not In UniGene) with exon 1.5 

304622 AA516384 EST singleton (not in UniGene) with exon 1.5 

304682 AA550994 EST singleton (not in UniGene) with exon 1.7 

305612 AA782347 Hs.272572 EST singleton (not in UniGene) with exon 1.5 

306193 AA923457 EST singleton (not in UniGene) with exon 1.5 

307206 AM92534 EST singleton (not in UniGene) with exon 1.6 

307377 AI222691 EST singleton (not in UniGene) with exon 1.5 

308023 AI452732 Hs.251577 EST singleton (not in UniGene) with exon 1.9 

308359 A1612774 Hs.79372 retinoid X receptor; beta 1.5 

309838 AW296073 Hs.255504 EST 1.5 

310403 A1720978 Hs.148006 ESTs; Moderately similar to alternatively s 1.8 

311671 AW241947 Hs.232478 ESTs 1.6 

311794 AW238092 Hs.254759 ESTs 2.1 

312082 T79860 Hs.1 18180 ESTs 1.9 

312575 H25237 Hs.306814 ESTs 2.3 

313076 N49684 Hs.143040 ESTs 1.8 

313283 W32480 Hs.157099 ESTs 2.2 

313374 AW328672 Hs.132760 ESTs 1.9 

314701 AI754634 Hs.131987 ESTs 1.7 

315391 AA759098 Hs.192007 ESTs 1.8 

315688 AA680055 Hs.264885 ESTs 1.5 

316249 AA948612 Hs.130414 ESTs 1.6 

316586 AI205077 Hs.294085 ESTs 17 

316890 AA837079 Hs.24647 ESTs 1.5 

316983 AI480204 Hs.177131 ESTs 1.5 

317604 A1650625 Hs.300756 ESTs 1.6 

317951 AW206520 Hs.129621 ESTs 1.5 

319400 W26902 Hs.154085 ESTs 1.7 

320757 H22654 Hs.6382 EST cluster (not in UniGene) 1.5 

321594 AA021402 Hs. 11067 ESTs 17 

322102 H45589 EST cluster (not in UniGene) 1.5 

322814 AI824495 Hs.211038 ESTs 2.2 
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322929 A1365585 Hs.146246 ESTs 2.3 

323831 M335715 Hs.200299 ESTs 1.7 

324044 AL045752 Hs.22350 ESTs W 

324675 AW014734 Hs.157969 ESTs 2.2 

325272 CH.11JB $5866902 1.5 

325558 CH.12Jusgi|6056302 1.6 

325656 CR14_hsgi|6056305 1.6 

326120 CR17Jisgi|5867194 1.5 

326139 CH.17JW #867203 15 

326855 CH.20Jisgi|6552460 1.5 

327438 CH.02_hsgi|6004454 1.6 

329733 CH.14j>2gi|6065783 1.6 

330931 F01443 Hs.284256 ESTs 4.6 

331591 N71677 Hs.42146 ESTs 1.9 

332159 AA621393 Hs.112984 EST 1-5 

332364 W94688 Hs.103253 perifipin 2.1 

332502 H21819 Hs.14896 Homo sapiens done 24590 mRNAsequenc 1.5 

334175 CH22_FGENES.349J0 1.5 

334347 CH22JGENES.375J1 1.8 

334737 CH22.FGENES.424J2 1.8 

335352 CH2*JGENES.539_5 15 

335639 CH22JGENES.584J9 1.6 

336244 CH22JGENES.746.2 1.5 

336336 CH22_FGENES.814_8 1.7 

336865 CH22_FGENES.305-1 1.6 

337494 CH22_FGENES.799-12 1.6 

337764 CH22 EMAC000097.GENSCAN.1 19-1 1.8 

337983 CH22^EMAC005500.GENSCAN.110-1 2 

338192 CH22„EM:AC005500.G£NSCAN.228-1 1.5 

339366 CH22_BA354I12.GENSCAN.34-2 1.5 
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Table 15A shows the accession numbers for those pkeys lacking unigenelD's for Table 15. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eds prabeset identifier number 

CAT number. Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

126300 250375.2 D81972 BE003132 

112538 504579J AA908813 R70255 

123505 genbank_AA600135 AA600135 * 

104672 6735_7 AA349096 AI36B018 F21390 F17759 R48772 AM21485 AI300352 H43971 AI378525 F33552 R47898 AI264177 F22289 N28263 



A1276281 R48205 A1245302 A1190036 A1281050 AW242903 H42892 AA910870 AW473816 H25721 AW451438 F19647 F22375 
H45809 F33447 AA774528 AA007629 H42537 C01077 F32386 



322102 46708 J H45589 H19807 AF075038 H19808 H42437 
336865 CH22J590FG 305_1_ 
338192 CH22__6755FG_JJNK_EMAC00 
329733 c14_p2 
326120 c17Jis 
326139 c17_hs 
326855 c20_hs 

335352 CH22_2699FG_539_5JJNK_EM 

335639 CH22_2999FG 584J9_LINK_E 

307206 AI192534 

307377 AI222691 

337494 CH22_5727FG_799J2_ 

337764 CH22_6115FG_LINK_EM:AC0O 

337983 CH22_6438FG_JJNI^EM:AC0O 

339366 CH22_8336FG_JJNK_BA354I1 

325272 clljis 

325558 c12Jis 

325656 c14_hs 

334175 CH22J455FG_349_10_UNK^E 
304182 H91086 

334347 CH22J640FG_375_31 LINK.E 
327438 c^2Jhs 
304622 AA516384 

334737 CH22J049FG 424 12JJNKJE 
304682 AA550994 

336244 CH22J642FG 746JUINKJ3A 
306193 AA923457 

336336 CK22J746FG 814 8JJNKJA 
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TABLE 15B 

Table 15B shows the genomic positioning for those pkeys lacking unigene DD's and 
5 accession numbers in Table 15. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



10 



15 



20 



25 



30 



35 



entitled -The DNA sequence of human chromosome 22/ Dunham I. et al., Nature (1999) 402:469-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref 



Strand NLposition 



334347 
334737 
335639 
337494 
334175 
335352 
336244 
336336 
336865 
337764 
337983 
338192 
333366 
325272 
325558 
325656 
329733 
326120 
326139 
326855 
327438 



Dunham, 1. etal. 
Dunham, I. etaL 
Dunham, I. eta!. 
Dunham, I. etal. 
Dunham, 1. etal. 
Dunham, L etal. 
Dunham, I. etal 
Dunham, I. etal. 
Dunham, 1. etal. 
Dunham, I. etal. 
Dunham, L etal. 
Dunham, I. etal. 
Dunham, I. etal. 
5866902 
6056302 
6056305 
6065783 
5867194 
5867203 
6552460 
6004454 



Plus 

Plus 

Plus 

Plus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Plus 

Minus 

Plus 

Plus 

Minus 

Minus 

Minus 



13663814-13663926 

15998517-15998685 

25173591-25173696 

33339024-33339148 

11668659-11668597 

22681512-22681384 

3140272941402583 

33797209-33797076 

8622405-8622289 

403564O4035446 

7275495-7275271 

13248453-13248277 

33647431-33647293 

13247-13312 

70930-71030 

78190-78707 

163237-163450 

36116-36276 

218901-218960 

111390-111463 
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TABLE 16: Table 4 from BRCA 001-5 US 

Table 16, a subset of table 15, depicts a preferred group of genes highly downregulated in 
breast cancer cells. 



Pkey. Unique Eos probeset identifier number 

ExAccn: . Exemplar Accession number, Genbank accession number 
10 UnigenelD: Unigene number 
Unigene Title: Unigene gene title 
R1 : Ratio of normal breast tissue to tumor 



15 



20 



25 



30 



35 



40 



45 



Pkey ExAccn 


UniGene fD 


Unigene Titte 


R1 


100502 TIGR:HT1496 


Hs.169228 


Adrenal-Specific Protein Pg2 


2.3 


101367 M12963 


Hs.4 


aicohol dehydrogenase 1 (class I); alpha 


2.9 


102857 X00129 


Hs.76461 


retinol-binding protein 4; interstitial 


3 


104672 AA007629 




glyceroW-phosphate dehydrogenase 1 


2.4 


107099 AA609645 


Hs.211568 


eukaryofic translation initiation factor 4 gam 


2.7 


108604 AA099820 


Hs.49696 


ESTs 


2.4 


115949 AA443800 


Hs.43125 


ESTs 


2 


115965 AA446661 


Hs.173233 


ESTs 


2.2 


119175 R71792 


Hs.301002 


ESTs; Weakly similar to cell death activator 


2.8 


119798 W73386 


Hs.249129 


ESTs 


3 


122127 AA434447 


Hs.106771 


ESTs 


Z5 


122348 AA443695 


Hs.293410 


ESTs 


2.1 


129285 T62068 


Hs.11006 


ESTs 


2.1 


131267 AA211776 


Hs.2504 


myomesin 1 (skelemin) (185kD) 


3.8 


131282 M12272 


Hs.4 


alcohol dehydrogenase 3 (class 1); gamma 


2.2 


131810 D49487 


Hs.194236 


leptin (murine obesity homobg) 


2.5 


133120 X64559 


Hs.65424 


tetranectin (plasminogen-binding protein) 


2 


133601 S95936 


Hs.284176 


transferrin 


2.3 


134111 N79674 


Hs.8022 


TU3A protein 


4.6 


301396 AA923549 


Hs.224121 


ESTs 


2.1 


311794 AW238092 


Hs.254759 


ESTs 


2.1 


312575 H25237 


Hs.306814 


ESTs 


2.3 


313283 W32480 


Hs.157099 


ESTs 


2.2 


322814 AI824495 


Hs.211038 


ESTs 


2.2 


322929 A1365585 


Hs.146246 


ESTs 


2.3 


324675 AW014734 


Hs.157969 


ESTs 


2.2 


330931 F01443 


Hs.284256 


ESTs - 


4.6 


332364 W94688 


Hs.103253 


perilipln 


2.1 


337983 - 




CH22_EM:AC005500.GEN$CAN.110-1 " 


2 
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TABLE 16A 

Table 16A shows the accession numbers for those pkeys lacking unigenelD's for Table 16. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accession 

104672 6735_7 AA349096 AI368018 F21390 F17759 R48772 AI421485 AI300352 H43971 AI378525 F33652 R47898 A1264177 F22289 N28263 
AI276281 R48205 AI245302 AJ190036 AI281050 AW242903 H42892 AA910870 AW473816 H25721 AW451438 F19647 F22375 
H45809 F33447 AA774528 AA007629 H42537 C01077 F32386 
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TABLE 17: Table 1 from BRCA 014 P 

Table 17 shows accession numbers representing 759 sequences of breast cancer genes or 
fragments thereof encoding breast cancer modulating proteins. For each overexpressed gene 
identified, a ratio of the relative amount of expression in breast tumors versus normal breast 
tissue. 



10 



15 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal breast tissue 



Pkey ExAccn UnigenelD Unigene Title 



R1 

20 100227 AV654694 Hs.82316 interferon-induced, hepatitis C-associat 3 

100405 AW291587 Hs.82733 nidogen2 3.2 

100406 AI962060 Hs.1 18397 AE-binding protein 1 3.6 
100420 D86983 Hs.1 18893 Melanoma associated gene 3.2 
100911 X83300 Hs.289103 SMA4 5.2 

25 100960 J00124 Hs.1 17729 keratin 14 (epidermolysis bullosa simple 4.3 

101011 BE387036 Hs.1211 add phosphatase 5, tartrate resistant 3 

101183 AA442324 Hs.795 H2Ahistone family, member 0 3.2 

101194 120971 Hs.188 phosphodiesterase 48, cAMP-spedfic (dun 3 

101329 U66042 Hs.82171 Homo sapiens clone 191 B7 placenta expres 4.1 

30 101378 BE563085 Hs.833 interferon-stimulated protein, 15 kDa 5.3 

101474 R07566 Hs.73817 small Inducible cytokine A3 homologous 3.9 

101491 M25809 Hs.64173 ATPase. H+transportmg, lysosomal (vacu 4.5 

101530 M29874 Hs.1360 cytochrome P450, subfamily 1!B (phenobar 9 

101602 AA353776 Hs.901 CD48 antigen (B-cell membrane protein) 3.4 

35 101663 NMJ03528 Hs.2178 H2B histone family, member Q 5.6 

101758 BE019494 Hs.79217 pyrrofine^-carboxylate reductase 1 3.6 

101767 M81057 Hs. 1 80884 carboxypeptidase B1 (tissue) 12 

101817 M89907 Hs.152292 SWI/SNF related, matrix associated, act! 3.2 

101851 BE260964 Hs.82045 midkine (neurite growrflvprornoting factor 4.1 

40 101878 M97815 Hs.1 83650 cellular retinoic acid-binding protein 2 6.5 

102209 NMJ02038 Hs.265827 interferon, atpha-4ndudble protein (do 3 

102214 U23752 Hs.32964 SRY (sex determining region Y)-box 11 3 

102297 NM.001504 Hs,198252 G protein-coupled receptor 9 3,7 

102299 NMJXJ5824 Hs.1 55545 37 kDa leucine-rich repeat (IRR) protein 37 

45 102301 NM005651 Hs.183671 tryptophan 2,3-dioxygenase 5.2 

102305 AL043202 Hs.90073 chromosome segregation 1 (yeast homolog) 3.5 

102369 U39840 Hs. 299867 hepatocyte nudear factor 3. alpha 3.9 

102591 U62325 Hs.324125 amyloid beta (A4) precursor protein-btnd 4 

102721 H16646 Hs. 11 8666 hypothetical protein PP591 3.5 

50 102739 AA363025 Hs.1 55572 Human done 23801 mRNA sequence 3.2 

102791 AF080229 gb:Human endogenous retrovirus K done 1 3 

102804 NMJ02318 Hs.83354 lysyl oxidase-like 2 3.2 

102903 M73779 Hs.250505 retinoic acid receptor, alpha 3.3 

103010 X52509 Hs.161640 tyrosine aminotransferase 12.4 

55 103042 T81656 Hs.252259 ribosomal protein S3 4.5 

103117 X63578 Hs.295449 parvalbumin 3 

103207 X72790 gb:Human endogenous retrovirus mRNA for 5.9 

103282 BE390551 Hs.77628 steroidogenic acute regulatory protein r 3.9 

103284 AI751601 Hs.8375 TNF receptor-associated factor 4 3.3 

60 103329 X85134 Hs.72984 retinoblastoma-blnding protein 5 3.1 

103364 X90872 Hs.279929 gp25L2 protein 3 

103385 NM_007069 Hs.37189 similar to rat HREV107 3.4 

103456 AA496425 Hs.9629 papillary renal cell caidnoma (transloc 3.2 
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103498 Y09306 Hs.30148 homeodomain-lnteracGng protein kinase 3 3.4 

103558 BE616547 Hs.2785 keratin 17 3,7 

103563 L02911 Hs.150402 Activln A receptor, type I (ACVR1)(ALK 3.2 

103612 BE336654 Hs.70937 H3 histone family, member A 4.5 

5 103825 AI571835 Hs.55468 ESTs 4 

104073 AW779318 Hs.88417 ESTs 3.8 

104103 AW021102 Hs.21509 ESTs 4.3 

104115 AF183810 Hs.26102 opposite strand to trichominophalangeal 7.6 

.104168 AA461618 Hs.31704 ESTs, Weakly similar to KIAA0227 [H.sapi 3.6 

10 104173 AA084273 Hs.76561 ESTs, WeaWy simitar to S47072 finger pr 4 

104181 AF173296 Hs.283740 DC6 protein 3 

104189 AB040927 Hs.301804 KIAA1494 protein 3.2 

104269 AI559444 Hs.293960 ESTs 4.3 

104307 AI929700 Hs.111680 endosulfine alpha 3.1 

15 104518 H20B16 Hs. 11 2423 Homo sapiens mRNA; cDNA DKFZp5861 1420 (f 3.2 

104556 AV650851 Hs.96900 hypothetical protein; KIAA1830 protein 4.4 

104658 AA360954 Hs.27268 Homo sapiens cDNA: FU21933 fis, done H 3.2 

104748 AA015879 Hs.33536 ESTs 3,2 

104755 T49951 Hs.9029 OKFZP434G032 protein ' 4.5 

20 104825 AA035613 Hs.141883 ESTs 6.9 

104830 AW294092 Hs.21594 hypothetical protein MGC1 5754 11.1 

104865 T79340 Hs.22575 B-ceHCLL/lymphomae, member B (zinc fi 3.5 

104906 BE298684 Hs.26802 protein kinase domains containing protei 6.5 

104961 H78517 Hs.33905 ESTs 3.6 

25 105038 AW503733 Hs.9414 KIM1488 protein 4.5 

105088 H58589 Hs.35156 Homo sapiens cDNAFUl 1027 fis, done PL 3.8 

105092 AA148982 Hs.29068 ESTs 3 

105093 AL137566 Hs.32405 Homo sapiens mRNA; cDNA DKFZp586G0321 (f 4.8 
105304 AW134924 Hs.190325 ESTs 6-2 

30 105397 AA814807 Hs.7395 hypothetical protein FU23182 3.1 

105409 AW505076 Hs.301855 DIGeorae syndrome critical region gene 8 4.2 

105431 AA252033 Hs.242413 hypothetical protein OKFZp434K1421 4.4 

105552 AA256750 Hs.28802 centaurln-alpha 2 protein 3.2 

105598 AA279439 Hs.279763 hypothetical protein FU 10504 3.5 

35 105650 W16741 Hs.25635 HSPC003 protein 37 

105688 AI299139 Hs.17517 ESTs 5.5 

105808 AI133161 Hs.286131 CGl-101 protein 3.5 

105809 AW973653 Hs.20104 hypothetical protein FLJ00052 3.3 
105909 AA195191 Hs.5111 hypothetical protein FLJ20729 3.2 

40 105965 AA131657 Hs.23830 ESTs 3.3 

106135 AL117474 Hs.41181 Homo sapiens mRNA; cDNA OKFZp727C191 (fr 3.2 

106184 W28948 Hs.10762 ESTs 3.3 

106293 N39842 Hs.301444 KIAA1673 4.1 

106400 BE397649 Hs.94109 Homo sapiens cDNAFUl 3634 lis, done PL 3,1 

45 106474 BE383668 Hs.42484 hypothetical protein FU10618 3.2 

106484 AA351978 Hs.4943 hepatocellular carcinoma associated prot 7.8 

106533 AL134708 Hs.145998 ESTs 3 

106614 AA648459 Hs.335951 hypothetical protein AF301 222 3.8 

106636 AW958037 Hs.286 ribosomal protein L4 3.3 

50 106661 AW499914 Hs.7579 hypothetical protein FU 10402 3 

106743 BE613328 Hs.21938 hypothetical protein FU12492 4.2 

106644 AA485055 Hs.158213 sperm associated antigen 6 3.4 

106864 AI311928 gb:qo89h04.x1 NCLCGAP_KkJ5 Homo sapiens 4.4 

106865 AW192535 Hs.19479 ESTs 3.6 
55 106871 AW472981 Hs.321130 hypothetical protein MGC2771 4.1 

106942 AA995351 Hs.31314 retinoblastoma-blnding protein 7 3.6 

106968 AF216751 Hs.26813 CDA14 5.3 

107105 AW963419 Hs.155223 stanniocafdn 2 3.4 

107158 N32849 Hs.31844 hypothetical protein FU12586 3.1 

60 107248 AW263124 Hs.315111 nudear receptor co-repressomHDAC3 comp 5.9 

107265 BE379594 Hs.49135 ESTs, Moderately similar to ALU7JWMAN A 3.9 

107630 AW961576 Hs.60178 ESTs 4.6 

107710 AI955040 Hs.265398 ESTs, Weakly similar to transformation-* 3 

107890 AA025386 Hs.61311 ESTs, Weakly similar to S 10590 cysteine 3.1 

65 107985 T40064 Hs.71968 Homo sapiens mRNA; cDNA DKFZp564F053 (fr 4.8 

108000 A1263307 Hs.239884 H2B histone family, member L 3.3 

108217 AA058686 Hs.62588 ESTs 3.8 
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55 



60 



65 



108435 T82427 
108591 AB033073 
108733 AA121022 
108771 AF068290 
108819 AA011449 
108912 AA136674 
109086 AF186114 
109124 AK000684 
109132 AI970536 
109163 N23235 
109277 AA196443 
109410 AW504732 
109454 AA232255 
109514 AA234087 
109581 R45584 
109632 AA325138 
109644 AW973964 
109700 F09609 
109768 F06838 
109807 R43646 
109842 AW818436 
109695 AK001680 
110024 AW973152 
110561 AA379597 
110675 H89355 
110707 AI239832 
110915 BE092285 
111139 N64683 
111155 N66563 
111199 AI767435 
111336 A1457338 
111510 R07856 
111532 R08440 
111689 AA602004 
111823 R35252 
111876 R38239 

111892 AA421081 

111893 AF070526 
112125 AW379029 
112170 BE246743 
112287 AB033064 
112300 H24334 
112303 R54797 
112478 R66067 
112561 AI791493 
112631 R82040 
112637 R82331 
112657 AW844878 
112678 AI416466 
112917 AA082465 
113070 AB032977 
113095 AA828380 
113117 AW813731 
113187 BE613410 
113200 T57773 
113206 BE262470 
113374 T79925 
113440 U54727 
113494 T91451 
113518 AW367788 
113571 AI702609 
113822 NMJ304585 
113835 AI912410 
113938 W81598 
113947 W84768 
113970 W27249 
114086 AA378776 



Hs.194101 
Hs.43857 

Hs.79741 

Hs.271627 

Hs.118681 

Hs.270737 

Hs.183687 

Hs.16603 

Hs.30567 

Hs.86043 

Hs.21275 

Hs.295232 

Hs.262346 

Hs.23025 

Hs.235873 

Hs.291531 

Hs.14763 

Hs.12422 

Hs.23590 

Hs.30488 

Hs.31050 

Hs.5199 

Hs.249159 

Hs.15617 

Hs.29724 

Hs.290943 

Hs.191358 

Hs.29822 

Hs.29894 

Hs.16355 

HS2326Q 

Hs.24944 

Hs.293246 

Hs.12388 

Hs.13429 

Hs.118338 

Hs.288529 

Hs.334806 

Hs.26125 

Hs.26664 
HS.129B73 

Hs.164599 

Hs.19769 

Hs.33665 

Hs.125031 

Hs.6298 

Hs.126733 

Hs.159153 

Hs.31575 

Hs.10263 

Hs.241471 

Hs.269165 

Hs.191445 

Hs.86536 

Hs.323954 

Hs.15713 

Hs.17466 

Hs.27475 



Homo sapiens cDNA: RJ20869 fis, done A 



Hs.8109 
Hs.288649 



3 

3.3 

gb:zn84f10.r1 Stratagene lung carcinoma 3.9 

hypothetical protein FU10 116 6.1 

ESTs 3.6 

EST 3.9 

tumor necrosis factor (ligand) superfeml 3.7 

hypothetical protein FU22104 3.1 

hypothetical protein FU13163 3.7 

ESTs, Weakly similar to B34087 hypotheS 4.5 

Homo sapiens cONA FU 1 3558 fis, done PL 3.7 

hypothetical protein FU 1 1 01 1 4.6 

ESTs, Moderately similar to A46010 X-lin 6.4 

ESTs, Weakly similar to S72482 hypotheti 4.8 

ESTs, Weakly similar to ALU5.HUMAN ALU S 3.3 

hypothetical protein FU22672 3 

ESTs. Highly similar to 1203217A dehydro 3 

gb:HSC33H092 normalized infant brain cON 3.2 

ESTs 3.2 

ESTs 3.8 

solute carrier family 1 6 (monocarboxyfic 3.3 

DKFZP434F091 protein 3.6 

ESTs 4.2 

HSPC150 protein similar to ubiquifin-con 5.1 

adrenergic, a!pha-2A-, receptor 5.3 

ESTs, Weakly similar to ALU4_HUMAN ALU S 3.7 

hypothetical protein FU13187 3.7 

ESTs 4 

ESTs 3.1 

ESTs 4.5 

ESTs 5.4 

ESTs 3.2 

gbryf 19f09.s 1 Scares fetal liver spleen 3.1 

ESTs 3.2 

ESTs, Weakly similar to 21 09260A B cell 3.3 

ESTs, Weakly similar to putative p150 [H 3.1 

ESTs 3.4 

Homo sapiens done 24787 mRNA sequence 3.3 

ESTs, Weakly similar to unnamed protein 4.4 

hypothetical protein FU22635 7.3 

K1AA1238 protein 3.2 

ESTs 4.4 

gb:yg87b07.s1 Scares infant brain 1NIB H 3.4 

ESTs 8.2 

ESTs. Weakly similar to A36036 cytochrom 5.5 

gb:yj06b06.s1 Soares placenta Nb2HP Homo 3.9 

ESTs 5.4 - 

hypothetical protein MGC4174 3.2 

ESTs 4.7 

chcfine/elhanolamlnephosphotransferase 3.7 

KIAA1151 protein 3.1 

ESTs 3.4 

ESTs. Moderately similar to S65657 alpha 3.4 

SEC63, endoplasmic reticulum translocon 3.2 

ESTs 3.5 

RNB6 6.2 

ESTs, Weakly similar to ALU1 HUMAN ALU S 3.7 

ESTs 3 

ESTs 3.4 

postmeiotic segregation increased 24ike 3.1 

hypothetical protein MGC2776 3.1 

reflnoic add receptor responder (tazaro 3.9 

Homo sapiens cONA FU 12749 fis. done NT 3 

gb:zd88g02.s1 SoaresJetaUu}arLNbHH19W 4.6 

gb:zh53d03.s1 Soares_feta|Jver_spleen_ 3.1 

hypothetical protein FU21080 6.9 

hypothetical protein MGC3077 4.3 
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114148 AW470411 Hs.288433 neurotrimin 4.1 

114424 AW780192 H&267596 ESTs 3.4 

114518 AW163267 Hs.1 06469 suppressor of varl (S.cerevisiae) 3-like 3.1 

114563 AI979168 Hs.82226 glycoprotein (transmembrane) nmb 4.8 

5 114965 AI733881 Hs.72472 BMP-R1B 10.1 

114995 AA769266 Hs.193657 ESTs 3.6 

115121 AI634549 Hs.88155 ESTs 3.2 

115134 AW968073 Hs.194331 ESTs, Highly similar to A557 13 inositol 4.2 

115167 AA749209 Hs.43728 hypothetical protein 3 

10 115253 BE149845 Hs.289038 hypothetical protein MGC41 26 3.6 

115277 AA814100 Hs.86693 ESTs 3.9 

115327 N46436 Hs.109221 ESTs 3.4 

115354 AA281636 Hs.334827 ESTs 4.8 

115657 AA40562Q Hs.55158 ESTs, Weakly similar to T29520 hypotheti 3.5 

15 115676 AA953006 Hs.88143 ESTs 9.3 

115709 AW293849 Hs.58279 ESTs, Weakly similar to ALU7JWMAN ALUS 3.4 

115729 AA417812 Hs.38775 ESTs 4 

115787 AI126772 Hs.40479 ESTs 3.1 

115830 AW970529 Hs.86434 hypothetical protein RJ21 816 3.6 

20 115835 AA521410 Hs.41371 ESTs 3.1 

115850 NM.014937 Hs.52463 KIAA0966 protein 3 

115900 AK001500 Hs.165186 hypothetical protein FU 13852 3.2 

115935 AA354549 Hs.41181 Homo sapiens mRN A: cDNA DKFZp727C 191 (fr 3 

115948 AL042465 Hs.43445 poly(A)-specific ribonudease (deadenyla 3.1 

25 116092 AB041035 Hs.93847 NMJ>16931:Homo sapiens NADPH oxidase 4 { 6.7 

116115 AL042355 Hs.70202 WO repeat domain 10 3.6 

116184 AW450737 Hs.128791 CGI-09 protein 3.1 

116192 AA464976 Hs.62528 ESTs, Moderately similar to A46010 X-Cn 3.3 

116208 AI219083 Hs.42532 ESTs. Moderately similar to ALU8.HUMAN A 3.2 

30 116246 AF265555 Hs.250646 baculovirat IAP repeat-containing 6 3.6 

116443 AW962196 Hs.321264 LBP protein 32 4.1 

116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 4.1 

116726 AK001114 Hs.53913 hypothetical protein FU10252 8.6 

116845 AA649530 gb^s44fi)5.s1 NCLCGAPJUvl Homo sapiens 3.2 

35 117026 H88256 Hs.50456 ESTs, Moderately similar to ZN75_HUMAN Z 3.5 

117216 AI569804 Hs.42792 ESTs, Weakly similar to 178885 serineAh 3.1 

117296 AL133427 Hs.42506 Homo sapiens mRNA full length insert cDN 3.2 

117403 H84455 Hs.40639 ESTs 47 

117691 AB040959 Hs.93836 DKFZP434N014 protein 3 

40 118229 AW968941 Hs.166254 hypothetical protein DKFZp566H 33 3.3 

118363 AI183838 Hs.48938 hypothetical protein FU21 802 4.3 

118416 N66028 Hs.49105 FKBP^associated protein 3.1 

118470 AW970584 Hs.291033 ESTs 3.4 

118502 AL157488 Hs.50150 Homo sapiens mRNA; cDNA DKFZp564B182 (fr 52 

45 118695 AKDG0465 Hs.50081 KIAA1 199 protein 3.4 

118925 N92293 Hs.206832 ESTs, Moderately similar to ALU8JHUMAN A 3.3 

119025 BE003760 Hs.55209 Homo sapiens mRNA; cDNA DKFZp434K0514 (f 19.7 

119036 R95872 Hs. 1 17572 chemokine binding protein 2 3.7 

119063 R16833 Hs.53106 ESTs, Moderately similar to ALU1_HUMAN A 4.1 

50 119075 M10905 Hs.287820 fibronectin 1 3.2 

119620 W47620 Hs.56009 2 , ^ , -oligoadenylate synthetase 3 (100 k 3.3 

119741 AF041853 Hs.43670 kinesin family member 3A ■ 3.1 

119747 AI970797 Hs.64859 ESTs 5 

119754 AL037824 Hs.194695 ras homolog gene family, member I 3.8 

55 119905 AW449064 Hs.119571 collagen, type III, alpha 1 (Ehlers-Oanl 3.1 

120084 W94472 Hs.59529 ESTs, Moderately similar to ALU 1.HUMAN A 8.4 

120241 AA825686 Hs.321176 ESTs, Weakly similar to S65824 reverse t 3.6 

120326 AA196300 Hs.21145 hypothetical protein RG083M05.2 3.2 

120742 AA225084 gb:nc21d06.r1 NCI_CGAP_Pr1 Homo sapiens 3,6 

60 120870 AA357172 Hs.292581 ESTs, Moderately similar to ALU1_HUMAN A 5.8 

120885 AA365515 Hs.301 872 hypothetical protein MGC4840 3 

120970 AA398118 Hs.97579 ESTs, WeaWy similar to A46010 X-linked 37 

121054 AW976570 Hs.97387 ESTs 5.3 

121095 AA320134 Hs.196029 Homo sapiens mRNA for KIAA 1657 protein, 4 

65 121103 AA398936 Hs.97697 EST 3.5 

121 121 AA399371 Hs.189095 similar to SALL1 (sal (DrosophBa)-IIke 6.3 

121337 AW885727 Hs.301570 ESTs 4.7 
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121351 AW206227 Hs-287727 hypothetical protein RJ23132 5 

121611 M31669 Hs.1735 Inhfbin, beta B (activin AB beta potypep 3.6 

121643 AA640987 Hs.193767 ESTs 5.6 

121770 NM_O15902 Hs.278428 progestin induced protein 3.4 

5 122125 AK000492 Hs.98806 hypothetical protein 4.1 

122338 AA443311 Hs.98998 ESTs 3 

122417 AA446965 Hs.112092 ESTs 4.7 

122513 AI767879 Hs.99214 ESTs 3.8 

122544 AW973253 Hs.292689 ESTs 3 

10 122655 AA323296 Hs.97837 Homo sapiens mRNA; cDNA DKFZp547J047 (fr 5.6 

122605 AA526911 Hs.82772 collagen, type XI. alpha 1 3.2 

122851 AW205931 Hs.99598 hypothetical protein MGC5338 8.6 

123105 M487809 Hs.166011 catenin (cadherin^associaled protein), d 3 

, 123111 AA228776 Hs.191721 ESTs 6.9 

15 123249 AA371307 Hs.125056 ESTs fc 3.6 

123273 AA491253 Hs.173611 Empirically selected from AFFX single pr 7 

123385 BE149685 Hs.17767 KIAA1554 protein 3.1 

123419 T66087 Hs.1 12482 Homo sapiens unknown mRNA sequence 3.4 

123485 A1308876 Hs.103849 hypothetical protein DKFZp761D1 12 3.1 

20 123645 AJ675944 Hs.188691 Homo sapiens cDNA FU 12033 fis, clone HE 3.8 

123819 AA580082 Hs.112264 ESTs 4.7 

124012 AA352723 Hs.241471 RNB6 3.8 

124243 H69125 Hs.133525 ESTs 4.1 

124357 N22401 gb:yw37g07,s1 Morton Fetal Cochlea Homo 4.1 

25 124359 N22508 Hs, 13931 5 Homo sapiens cONA: FU21479 fe. done C 3.6 

124567 AW451645 Hs.151504 Homo sapiens cDNA RJ11973 fis, dene HE 3.1 

124911 N34151 Hs. 174195 interferon Induced transmembrane protein 3.5 

124972 R41396 Hs.101774 hypothetical protein FU23045 4.3 

125006 BE065136 Hs.145696 spfclng factor {CC 1.3) 6 

30 125042 T78906 Hs.269432 ESTs, Moderately similar to ALU1_HUMAN A 8.1 

125184 W60326 Hs.288684 Homo sapiens cDNA FU1 1750 fis, clone HE 47 

125243 AW970536 Hs.105413 ESTs 3.1 

125286 AF086534 Hs.187561 ESTs, Moderately similar to ALU1 JiUMAN A 3.3 

125304 AL359573 Hs.124940 GTP-binding protein 3 

35 125330 AW880562 Hs.114574 ESTs 3 

125331 AI422996 Hs.161378 ESTs 3.2 

125685 AI924630 Hs.4943 hepatocellular carcinoma associated prot 3.2 

126257 N99638 gb:za39g11.r1 Soares fetal liver spleen 4 

126474 AW975814 Hs.326714 Homo sapiens done IMAGE:713177, mRNA se 4 

40 126666 AA648886 Hs.151999 ESTs 3.8 

126872 AW450979 gb:Ul-H-BI3-a1a-a-12-0-Ul.s1 NCI_CGAP_Su 3 

127431 AW771958 Hs.175437 ESTs, Moderately similar to PC4259 ferri 3.6 

127980 AA961459 Hs.125644 ESTs 4.1 

127997 AW068311 Hs.311054 Homo sapiens mRNA full length Insert cDN 3.3 

45 128420 AA650274 Hs.41296 fibroneclin leucine rich transmembrane p 4.6 

128609 NM_003616 Hs.102456 survival of motor neuron protein interne 3.9 

128946 Y13153 Hs.107318 kynurenine 3-monooxygenase (kynurenine 3 3.1 

128955 AA775076 Hs.185807 Homo sapiens, Similar to PRO0478 protein 3.9 

129092 056365 Hs.63525 poly(rC)-binding protein 2 3.3 

50 129270 AA357185 Hs.109918 ras homolog gene family, member H 3.1 

129301 AF182277 Hs.330780 cytochrome P450, subfamily IIS (phenobar 3.9 

129385 AA172106 Hs.1 10950 Rag C protein 6.2 

129619 AA209534 Hs.284243 tetraspan NET-6 protein 3.4 

129629 AK0O0398 Hs.1 1747 hypotheUcal protein FLJ20391 3 

55 129725 X56411 Hs.1219 alcohol dehydrogenase 4 (dass II), pi p 3.2 

1 30069 AI754813 Hs,146428 collagen, type V, alpha 1 5.4 

130092 X03363 Hs.323910 v-erb-b2 avian erythroblastic leukemia v 4.4 

130298 AI347487 Hs.132781 dass I cytokine receptor 4.6 

130382 NMJKJ3450 Hs.1 55204 zinc finger protein 174 5.6 

60 130622 A1582291 Hs.16846 ESTs, Weakly similar to 04HUD1 debrisoqu 3 

130703 R77776 Hs.18103 ESTs . 3.8 

130881 AA809875 Hs.25933 ESTs 4.2 

130954 AB014544 Hs.21572 KIAA0644 gene product 4.7 

131095 A1399653 Hs.22917 ESTs 4.3 

65 131153 H09048 Hs.23606 ESTs 3.8 

131253 R71802 Hs.24853 ESTs 3.5 

131372 AW293399 Hs.144904 nudear receptor co-repressor 1 3.6 
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131507 AI826268 Hs.27769 ESTs, Weakly similar to MCALHUMAN MITOC 

131587 AI695549 Hs.183868 glucuronidase, beta 

131739 AF017986 Hs.31386 secreted frizzled-reiated protein 2 

131795 BE501849 Hs.32317 high-mobility group 20B 

5 131970 D86960 Hs.3610 KIAA0205 gene product 

131986 NM_002314 Hs.36566 LIM domain kinase 1 

132093 AA400091 Hs.39421 ESTs 

132122 AA426202 Hs.40403 Cbp/p300-tnteracfing transacfivator, wit 

132159 D76435 Hs.41 154 7k family member 1 (oddfaired Drosophi 

10 132333 AA192669 Hs.45032 ESTs 

132406 AL133731 Hs.4774 Homo sapiens mRNA; cDNA DKFZp761C1712 (f 

132482 AV660345 Hs.238126 CGI-49 protein 

132543 BE568452 Hs.5101 protein regulator of cytokinesis 1 

132624 AA326108 Hs.33829 bHLH protein DEC2 

15 132700 AA319233 . Hs.5521 ESTs 

132725 NM_006276 Hs.184167 splicing factor, arginine/serine-rich 7 

132799 W73311 Hs.1 69407 SAC2 (suppressor of actin mutations 2, y 

132847 T48195 Hs.58189 eukaryotic translation initiation factor 

132857 Y00272 Hs.184572 cell division cycle 2, G1 to S and G2 to 

20 132936 AL120659 Hs.6111 aryl-hydrocarbon receptor nuclear trans! 

133130 A1128606 Hs.6557 zinc finger protein 161 

133142 AW952412 Hs.65874 ESTs, Weakly similar to A40348 Elav/Sex- 

133167 AW162840 Hs.6641 kinesin family member 5C 

133225 AW600291 Hs.6823 hypothetical protein FU10430 

25 133274 AA085191 Hs.6949 hypothetical protein MGC11275 

133275 293241 Hs.239934 CGI-96 protein 

133287 AW797437 Hs.69771 B-factor, properdin 

133376 BE618768 Hs.7232 acetyl-Coenzyme A carboxylase alpha 

133462 AW675064 Hs.73875 fumarylacetoacetate hydrolase (fumarylac 

30 133740 AW162919 Hs.170160 RAB2, member RAS oncogene family-like 

133831 BE274552 Hs,76578 protein inhibitor of activated STAT3 

133976 AI908165 Hs. 169946 GATA-binding protein 3 (T-cell receptor 

i34666 6E391923 Ks.8752 transmembrane protein 4 

134710 AI433797 Hs.8889 serine hydroxymethyltransferase 1 {sohib 

35 134731 089377 Hs.89404 msh (Drosophila) homeo box homotog 2 

134776 J05582 Hs.89603 mucin 1, transmembrane 

135230 AF064804 H&96757 suppressor of Ty (S.cerevisiae) 3 bomofo 

135303 R61253 Hs.98265 KIAA1877 protein 

135400 X78592 Hs.99915 androgen receptor (dihydroiestosterone r 

40 135411 L10333 Hs.99947 reticuton 1 

300089 A1199738 Hs.208275 ESTs, Weakly similar to ALUAJWMAN 
300233 AW614220 Hs.189402 ESTs 
300254 AW183618 Hs.55610 solute carrier family 30 (zinc transport 
300256 AW591433 Hs.298241 Transmembrane protease, serine 3 

45 300376 Z45270 Hs.235873 hypothetical protein RJ22672 
300973 AA572949 Hs.207566 ESTs 
301111 R10799 Hs.191990 ESTs 
301341 AA887801 Hs.208229 G protein-coupled receptor 
301548 A1091631 Hs.203845 two pore potassium channel KT3. 3 

50 301884 AA312082 Hs.1Q5445 GDNF family receptor alpha 1 

301936 U79745 Hs.1 14924 solute carrier family 16 (monocarboxylic 
301976 T97905 gb:ye54c10.r1 Soares fetal liver spleen 

302001 AB020711 Hs.278346 KIAA0904 protein 
302067 BE542706 Hs.222399 CEGP1 protein 

55 302094 AW749321 Hs.6786 ESTs 

302099 AL049670 Hs.1 37576 ribosomal protein L34 pseudogene 1 
302145 NM_003613 Hs.151407 cartilage intermediate layer protein, nu 
302235 AL049987 Hs.166361 Homo sapiens mRNA; cDNA DKFZp564F1 12 (fr 
302276 AW057736 Hs.32391G HER2 receptor tyrosine kinase (c-erb-b2, 

60 302290 AA179949 Hs.175563 Homo sapiens mRNA; cDNA DKFZp564N0763 (i 
302372 AL1 17406 Hs.200102 ATP-binding cassette transporter MRP8 
302378 AL109712 Hs.296506 Homo sapiens mRNA fuH length Insert cDN 

302384 AI678059 Hs.202676 synaptonemal complex protein 2 

302385 AJ224172 Hs.204096 lipophilin B (uteroglobin family member) 
65 302680 AW192334 Hs.38218. ESTs 

302830 AI038997 Hs.132921 ESTs 

302857 AF282265 Hs.44836 inner centromere protein antigens (135kD 
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302892 AW176909 Hs.42346 caldneimjhbMmg protein caJsartin-1 3.4 

302970 W05608 Hs.312679 ESTs, Weakly similar to A49019 dyneln he 5.1 

303271 AA652687 Hs.96151 Human DNA sequence from clone RP5-1103G7 3.7 

303289 AL121460 Hs.272673 hypothetical protein FU20508 4.1 

5 303357 AW006352 Hs.159643 ESTs, Weakly similar to T32554 hypoftetl 4.2 

303540 AA355607 Hs.309490 ESTs, Weakly similar to putative WHSC1 p 4.3 

303563 AA367699 Hs. 10082 potassium intermediate/small conductance 3.3 

303642 AW299459 gbas50d08j(1 NCLCGAP KW1 1 Homo sapien 4.2 

303780 AI424014 Hs.18995 KIAA1 304 protein 3.6 

10 303797 AW629759 gb:hh70e05.y1 NCLCGAP GUI Homo sapiens 4.9 

303852 R53434 Hs.90207 hypothetical protein MGC1 1 1 38 3.7 

304328 M149951 Hs.62112 zinc finger protein 207 3 

304782 AA582081 gb:nn32h08.s1 NCLCGAP Gasi Homo sapiens 4.1 

305913 AA876109 gb:nx24h01.s1 NCLCGAP GC3 Homo sapiens 3 

15 305917 AA876469 gb:oe48b04.s1 NCLCGAP Pr25 Homo sapiens 3.1 

307010 AI140014 gb:qa68J09j(1 Soares.fetaLhearLNbHH19W . 3.5 

307041 AI144243 gb:qb85b12jc1 Soares fetaLhearLNbHH19W 3.9 

308106 A1476803 gbrtj77e12j(1 SoaresJJSF F8 9W OT_PAJ> S 4.3 

308307 AI581398 Hs. 172928 collagen, type I, alpha 1 4.6 

20 308615 AK000142 Hs.101774 hypothetical protein RJ23045 4.4 

309177 AI951118 Hs.326736 Homo sapiens breast cancer antigen NY-BR 17.3 

309328 AW024348 Hs.233191 EST, Weakly similar to A27217 glucose tr 3.2 

309574 AW168083 gb:xg59g0Ax1 NCI_CGAP_Ut4 Homo sapiens 3.1 

309583 AW170035 Hs.326736 Homo sapiens breast cancer antigen NY-8R 57.6 

25 310054 AI199712 Hs.148486 ESTs, WeaWy similar to 1917210A Pro/Arg 4.6 

310098 A1685841 Hs.161354 ESTs 3.6 

310438 AW022192 Hs.200197 ESTs 4.6 

310683 A1939456 Hs.160870 ESTs 3.2 

310727 AK000703 Hs.323822 Homo sapiens mRNA for K1AA1551 protein, 3.6 

30 310781 AI380797 Hs. 158992 ESTs 10.2 

310895 AI955121 Hs.165724 N-acetylgalactosamine^-O^ulfotransfera 3.4 

310955 AI476732 Hs.263912 ESTs 10.9 

311117 A1671439 Hs.196029 Homo sap/ens mRNA forKIAA1657 protein, 3.1 

311166 A1821005 Hs.11B599 ESTs 10.8 

35 311237 AA641098 Hs.208809 ESTs, Moderately similar to ALU1 JHUMAN A 4.3 

311465 AJ758660 Hs.206132 ESTs 4.4 

311587 A1828254 Hs.271019 ESTs, Weakly similar to A47582 B-ceB gr 5.1 

311598 AW023595 Hs.232048 ESTs 5.8 

311774 AA700870 Hs.14304 ESTs 3.3 

40 311785 AI056769 Hs.133512 ESTs 3.9 

311872 R12375 Hs.194600 ESTs 3.3 

311889 AA767342 Hs.1224B3 ESTs, Weakly similar to PSF_HUMAN PT8-AS 3 

311913 AI358522 Hs.270188 ESTs 3 

311923 T60843 Hs.189679 ESTs 5.6 

45 311935 AA216387 gb:nc16b02.s1 NCLCGAP J>r1 Homo sapiens 5.2 

312019 AA373630 Hs.188750 ESTs 3 

312021 AA759263 Hs.14041 ESTs 3.4 

312067 T78968 Hs.14411 ESTs 3.5 

312090 T80177 Hs.1 18064 similar to rat nuclear ubiquitous casein 3.8 

50 312147 AI633744 Hs.195648 ESTs, WeaWy similar to 138022 hypothefi 4.4 

312153 BE261944 Hs.118625 hexokinasel 5.2 

312168 T92251 Hs.198882 ESTs 3.3 

312182 T94344 Hs.326263 ESTs 3.3 

312187 AA700439 Hs.188490 ESTs 3.4 

5 5 312199 AW438602 Hs.191179 ESTs 3.9 

312219 H73505 Hs.117874 ESTs 4 

312226 AA315703 Hs.199993 ESTs, Weakly similar to ALU8_HUMAN III! 4.9 

312299 AA972712 Hs.269737 ESTs 5.7 

312544 AA516420 Hs.183526 ESTs, Weakly similar to 138022 hypotheti 6.3 

60 312638 AW439195 Hs.256880 ESTs, Weakly similar to S65657 alpha-lC- 4.9 

312826 AW291545 Hs.185018 ESTs 4.9 

312837 AW292286 Hs.255058 ESTs 4.4 

312980 AA497043 Hs.1 15685 ESTs 3.1 

313070 A1422023 Hs.161338 ESTs 4.3 

65 313079 N76497 Hs.1787 proteoIlpM protein 1 {PeDzaeus-Merzbac 3.3 

313089 AF026944 Hs.293797 ESTs 5.8 

313096 AW073310 Hs.163533 Homo sapiens cOMA FU14142 fis, done MA 4.5 
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313126 AA746503 Hs.283313 ESTs 10 

313166 AI801098 Hs.151500 ESTs 3.5 

313197 AW979008 Hs.222487 ESTs 3.3 

313280 AW960454 Hs.222B30 ESTs 4.7 

5 313325 AJ420611 Hs.127832 ESTs 3.4 

313328 AW449211 Hs. 1 05445 GDNF family receptor afoha 1 12.4 

313352 AW150945 Hs.144758 ESTs 4.1 

313385 AI032087 Hs.269819 ESTs 3 

313393 AI674685 Hs.200141 ESTs 5.2 

10 313417 AA741151 Hs.137323 ESTs 3.5 

313434 W92070 gb2h48g05.rt SoaresJetaUveLspleen_ 3,7 

313569 A1273419 Hs.135146 hypothetical protein FU13984 3 

313591 AA046309 gb:zf12fQ1.s1 SoaresJetaL_hearLNbHH19W 5.6 

313615 AI540978 Hs.301997 hypothetical protein RJ 13033 3.2 

15 313915 C18863 Hs.1 63443 Homo sapiens cDMA FU1 1576 fis, done HE 263 

AW175896 Hs.65114 keratin 18 3 

AI535895 Hs.221024 ESTs 4.9 

313997 AV657317 Hs.288649 hypothetical protein MGC3077 3.9 

314043 AA827082 Hs.291872 ESTs 3.1 

20 314078 AW129357 Hs.329700 ESTs 8.3 

314097 AA648744 Hs.269493 ESTs 6.6 

314121 A1732083 Hs.187619 ESTs 6.2 

314129 AA228366 Hs.115122 ESTs 4 
314138 AA740616 gb:ny97f11.s1 NCLCGAPJ3CB1 Homo sapiens 5.9 

25 314236 AA743396 Hs.189023 ESTs 3.1 

314244 AL036450 Hs.103238 ESTs 4 

314305 AI2801 12 Hs.125232 Homo sapiens cDNA FU13266 fis, clone OV 8 

314306 A1697901 Hs.192425 ESTs 3.7 
314322 AA907153 Hs.190060 ESTs 3.3 

30 314394 AW961597 Hs. 1308 16 ESTs, Moderately similar to 138022 hypot 42 

314401 A1660412 Hs.234557 ESTs 3.3 

314465 AA602917 Hs.156974 ESTs 4.7 

314506 AA833855 Hs.206868 Homo sapiens cDNAFU 14056 fis, clone HE 8.5 

314510 A1204418 Hs.190080 ESTs 4 

35 314546 AW007211 Hs.16131 hypothetical protein FU12876 3.4 

314547 AA399272 Hs.144341 ESTs 6.7 

314558 A1873274 Hs.190721 ESTs 27.4 

314627 AA425310 Hs.155766 ESTs, Weakly similar to A47582 B-cefl gr 4.4 
314648 AW979268 gb:EST391378 MAGE resequences, MAGP Homo 4.6 

40 314691 AW2Q7206 Hs.136319 ESTs 20.7 

314729 AA457367 Hs.191638 ESTs 3.6 

314754 AW026761 Hs.134374 ESTs 3.6 

314814 BE350122 Hs.157367 ESTs, Weakly similar to I78885 serine/th 4.9 

314864 AW971198 Hs.294068 ESTs 4.3 

45 314881 A1095087 Hs. 152299 ESTs. Moderately similar to S65657 alpha 3.7 

314882 AA828032 Hs,169076 ESTs 3.1 

314981 AW972359 Hs.293334 ESTs 3 

315006 AI538613 Hs.298241 Transmembrane protease, serine 3 10.9 

315021 AA533447 Hs.312989 ESTs 5.3 

50 315051 AW292425 Hs.163484 ESTs 12.9 

315060 AA551104 Hs.189048 ESTs, Moderately similar to ALUC_HUMAN t 5.8 

315073 AW452948 Hs.257631 ESTs 4.2 

315080 AA744550 Hs.136345 ESTs 3.7 

315175 AI025842 Hs.152530 ESTs 6 

55 315183 AW136134 Hs.220277 ESTs 3.9 

315193 A1241331 Hs.131765 ESTs, Moderately similar to I38937 DNA/R 4.4 

315196 AI367347 Hs.44893 Homo sapiens clone TCCCTA001 51 mRNA sequ 8.2 

315198 AI741506 Hs.165900 ESTs, Weakly similar to ALU INHUMAN ALU S 3.6 

315240 R38772 Hs.172619 myelin transcription factor 14ike 3.4 

60 315263 AW510994 Hs.220740 ESTs 3.4 

315282 A1222165 Hs.144923 ESTs 4.9 

315296 AA876905 Hs.125286 ESTs 4 

315368 AB037745 Hs.104696 K1AA1324 protein 4.7 

315397 AA218940 Hs. 137516 fidgetin-like 1 3.1 

65 315489 AI378817 Hs.191847 ESTs 3.1 

315498 AA628539 Hs.116252 ESTs. Moderately similar to ALU 1_HUMAN A 3.2 

315526 AI193043 Hs,128685 ESTs, Weakly similar to T17226 hypotheti 4.1 
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315530 AW015415 


Hs.127780 ESTs 


8.9 


315562 AA737415 


Hs.152826 ESTs 


5.5 


315634 AAS37085 


Hs.220585 ESTs 


6.3 


315647 AA648983 


Hs.212911 ESTs 


3.6 


315707 AI416055 


Hs.161160 ESTs 


5.1 


315772 AW515373 


Hs.271249 Homo sapiens cONA FU13580 fe, clone PL 


3,1 


315850 AW270550 


Hs.116957 ESTs 


3.8 


315858 AA737345 


Hs.294041 ESTs 


5 


315878 AA683336 


Hs.189046 ESTs 


3.1 


315977 AW865916 


Hs.151206 ESTs 


4J 


315978 AA830893 


Hs.119769 ESTs 


4.1 


315995 AI217477 


Hs.194591 ESTs 


4.1 


316012 AA764950 


Hs.119898 ESTs 


7 


316042 AJ469960 


Hs.170698 ESTs 


4.9 


316052 A1962796 


Hs.136754 ESTs 


4.1 


316072 AW517524 


Hs.135201 NOD2 protein 


3,2 


316074 AW975114 


Hs.293273 ESTs 


3.8 


316100 AW203986 


Hs.213003 ESTs 


3.2 


316133 AI187742 


Hs.125562 ESTs 


3.7 


316177 AI904982 


Hs.293102 ESTs. Moderately similar to ALU1.HUMAN A 


30.7 


316186 AI433540 


gb:ti69g05jc1 NCI_CGAP_Kld11 Homosapien 


3.1 


316244 AI640761 


Hs.224988 ESTs 


3.5 


316303 AA740994 


Hs.209609 ESTs 


3.8 


316313 AA741300 


Hs.202599 ESTs, WeaWy similar to 138022 hypothefi 


4.4 


316364 AA747807 


Hs.149500 ESTs 


3.2 


316580 AA938198 


Hs.146123 pory(A) polymerase gamma 


9.4 


316697 AW293174 


Hs.252627 ESTs 


4.4 


316715 A1440266 


Hs.170673 ESTs, Weakly similar to T24832 hypothetl 


3 


316868 AI660898 


Hs.195602 ESTs 


3.2 


316869 A1954880 


Hs.134604 ESTs 


3.2 


316886 AA836331 


Hs.134981 ESTs 


4.4 


316897 AA838114 


Hs.221612 ESTs 


3.7 


316943 AW014875 


Hs.137007 ESTs 


4.6 


317069 AI732892 


Hs.190489 ESTs 


5.9 


317194 AW445167 


Hs.126036 ESTs 


4.1 


317360 AI125252 


Hs.126419 ESTs 


3.5 


317404 AI806867 


Hs.126594 ESTs 


5.1 


317452 AA972965 


Hs.135568 ESTs 


6.9 


317501 A1822034 


Hs.137097 ESTs 


4.6 


317674 AW294909 


Hs.132208 ESTs 


4.3 


317803 AW664964 


Hs.128899 ESTs 


6.1 


317834 X56348 


Hs.287270 ret proto-oncogene (multiple endocrine n 


3.1 


317850 AI681545 


Hs.152982 hypothetical protein FU13117 


3.4 


317881 AI827248 


Hs.224398 Homo sapiens cONA RJ 1 1469 fis, clone HE 


9.6 


317902 AW102941 


Hs.211265 ESTs 


4.1 


317918 AI565071 


Hs.159983 ESTs 


10.3 


318042 AW294522 


Hs.149991 ESTs - 


3.1 


318223 AI077540 


Hs.134090 ESTs 


3.9 


318327 AW294013 


Hs.200942 ESTs 


3 


318332 A1093930 


Hs.163440 Homo sapiens cONA: FU2 1000 fis, done C 


4.4 


318418 AF107493 


Hs.118498 Homo sapiens LUCA-15 protein mRNA,splic 


5.4 


318558 AW402677 


Hs.1 46381 RNA binding motif protein, X chromosome 


4.4 


318625 AA526235 


Hs.193162 Homo sapiens CDNARJ1 1983 fis, done HE 


5.9 


318634 T49598 


Hs,156832 ESTs 


4 


318740 NMJW2543 


Hs.77729 oxidised tow density lipoprotein (lectin 


7.3 


318744 AI793124 


Hs.144479 ESTs 


17.8 


318781 F11802 


Hs.6818 ESTs 


3 


319191 NM»012391 


Hs.794 14 prostate epithelium-specific Ets transcr 


3.6 


319478 AI524124 


Hs.270307 ESTs 


4.6 


319510 W88532 


Hs.254562 ESTs 


3.3 


319551 AA761668 


gb^z24c08.s1 NCLCGAP^GCBI Homo sapiens 


3.2 


319745 T79366 


Hs.108258 actm binding protefri; macrophln (microf 


3.3 


319834 AA071267 


gb.zm61g01 .r1 Stratagene fibroblast (937 


6.2 


319840 C19035 


Hs.164259 ESTs 


3.3 


319977 AA534222 


gb:n)21d02.s1 NCLCGAP.AA1 Homo sapiens 


4.3 


320074 AA321166 


Hs.278233 ESTs 


. 3.4 


320167 AA984373 


Hs.90790 Homo sapiens cDNA: FU22930 fis. ctone K 


4.1 
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320167 T99949 
320211 AL0394Q2 
320416 A1026984 
320588 U78082 
320635 N50617 . 
320654 AI160015 
320742 AI601188 
320832 AA214584 
320915 AI359144 
321016 BE144167 
321107 AI732643 
321171 A176941Q 
321253 AA610549 
321318 AB033041 
321642 A1432199 
321644 AW975944 
321683 AI471598 
321758 U29112 
321811 D80630 
321828 R59890 
321910 H67065 
321937 AL049351 
321978 N77342 
322035 AL137517 
322136 AF075083 
322258 BE265745 
322296 W76326 
322303 A1357412 
322476 AW963372 

322520 T55958 

322521 AF147347 
322567 AF155108 
322595 VV32147 
322675 AA017656 
322766 AW068805 
322818 AW043782 
322882 AW248508 
322975 C16391 
323091 AI902456 
323131 AK002088 
323168 AL120862 
323244 AW675572 
323262 AL133990 

323332 A1829520 

323333 AV651680 
323335 AI655499 
323645 AW445014 
323663 BE081058 
323693 AA317962 
323782 AW961560 
323817 AA410943 
323930 AL043683 
323974 AI825204 
324001 AL044949 
324036 A1472078 
324261 BE069341 
324285 AA431159 
324296 A1524039 
324305 AA642007 
324432 AA464510 
324585 AI823969 
324598 AW972227 
324603 AW993522 
324631 AA937116 
324716 BE169746 
324748 AW974941 
324771 AA631739 



Hs.303426 Homo sapiens cDNA FLJ14832 lis, done OV 
Hs.125783 DEME-6 protein 
Hs.293662 ESTs 

Hs.167738 RNA polymerase )l transcriptional regula 
Hs.80506 smafl nuclear ribonucteoprotein potypept 
Hs.118112 ESTs 
Hs.120910 ESTs 
Hs.290167 ESTs 

Hs.143688 Homo sapiens cDNA: FU23031 lis, done L 
Hs.49994 hypothetical protein similar to RNA-bind 
Hs.144151 ESTs 
Hs.221461 ESTs 
Hs.333239 ESTs 

Hs.137507 vang(vangogh,DrosophflaHike2 
Hs.247084 ESTs 
Hs.237396 ESTs 
Hs.197531 ESTs 
Hs.196151 ESTs 

gb:HUM091D02B Human fetal brain (TFujiwa 
Hs.83623 nudear receptor subfamily 1 , group I, m 
Hs.271 530 ESTs, Weakly similar to ALU7_HUMAN ALU S 
Hs.302058 Homo sapiens mRNA; cDNA DKFZp566C093 (fr 
Hs.21851 Homo sapiens cDNA FL1 12900 fis, done NT 
Hs.334473 hypothetical protein DKFZp5640 1278 

gb:Homo sapiens full length insert cDNA 
Hs.194359 ESTs, Weakly similar to ALUCLHUMAN Illl 

gb:zd60d04.r1 SoaresJetalheart_NbHH19W 
Hs.157601 ESTs 
Hs.46677 PRO2000 protein 

gb:yb35f05.r1 Stratagene fetaJ spleen (9 



Hs.256150 Homo sapiens, Similar to RIKEN cDNA 2810 
Hs.118394 ESTs 

gb:ze39h01.r1 Soares retina N2b4HR Homo 
Hs.288467 Homo sapiens cDNA FU12280 fis, done MA 
Hs.293616 ESTs 

Hs.279727 Homo sapiens cDNA FU14035 fis, done HE 
gb:C16391 Ctontech human aorta polyA mRN 

Hs.210761 ESTs, Weakly similar to I38022 hypotheti 

Hs.270124 Homo sapiens cDNA FU11226 fis, clone PL 

Hs.124165 programmed ceil death 9 (PDCD9) 

Hs.193620 ESTs 

Hs.190642 ESTs 

gbw!19c06jc1 NCLCGAPJJtl Homo sapiens 

Hs.208558 ESTs 

Hs.161712 ESTs 

Hs.197746 ESTs 

Hs.243023 ESTs 

Hs.249721 ESTs. Moderately similar to PC4259 fern 
Hs,97600 ESTs 

BMP-R1B 

Hs.8173 hypothetical protein FU10803 
Hs.211408 ESTs 
Hs.1 16298 ESTs 
Hs.303662 ESTs 

gb:QV3-BT0381-270100-073-c08BT0381 Homo 

Hs.122954 ESTs 
Hs.192524 ESTs 
Hs.116369 ESTs 
Hs.152812 ESTs 
Hs.132678 ESTs 

Hs.163986 Homo sapiens cDMA: FU22765 fis, clone K 
Hs.292934 ESTs 

Hs.293683 ESTs, Weakly similar to I54374 gene NF2 
Hs.12504 likely ortholog of mouse Arkadia 
Hs.292385 ESTs, Weakly similar to I78885 serine/ih 
Hs.335440 EST 
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3.3 

3 

3.9 
3 

11.7 

3.8 

4.4 

%2 

3.1 

4.7 

3.5 

5 

19 

3.6 

3 

4.4 
11.5 
3 
3 

4.2 
4 

5.4 

3.1 

5.2 

7.6 

5.9 

16.5 

4 

3.3 

6.3 

4.6 

10.5 

6.2 

4.3 

9.2 

3.1 

4 

3 

3.2 

8.4 

3.3 

4.5 

4.5 

8.4 
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3 

3 
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324774 
324623 
324824 
324826 
5 324961 
324987 
324994 
325146 
325372 

10 325544 
327075 
332798 
334223 
334447 

15 335809 
335824 
338255 
409430 
428046 

20 432558 



AI031771 

AW516704 

A1826999 

AA704806 

AA613792 

AI375572 

AI805416 

AI064690 



448569 
453542 



AA731602 
BE382657 



25 



30 



35 



40 



45 



50 



55 



60 



65 



R21945 Hs.166975 
AW812795 Hs.155381 
Hs.177269 
Hs.120266 
Hs.21486 
AW836724 Hs.339660 
M97935 
M97935 
M55150 
M13755 
AJ052047 
AA252033 
AA401739 
H18459 
R48744 
M31682 
AA416873 
080240 
R49590 



X03363 



AA034918 



AF049569 
M13955 



HG4126HT4396 



Hs.132586 ESTs 
Hs.208726 ESTs 
Hs.224624 ESTs 

Hs.143842 ESTs, Weakly similar to 2004399A chramos 

gb:no97hQ3.s1 NCLCGAP_Pr2 Homo sapiens 
Hs,172634 ESTs 
Hs^13897 ESTs 
Hs.171176 ESTs 

Phase 2 & 3 Exons 

Phase 2&3 Exons 

Phase 2& 3 Exons 

C22000007^i|12314195|emb|CAB99338.1l (A 
NM_005080*:Homo sapiens X-box binding pr 
NH_012429':Homo sapiens SEC 14 (S. cerevi 
NM_014509*;Homo sapiens kraken-iike (BK1 
ENSP00000249072*:DJ222E13.1 (N-TERMINAL 
N^L014323*:Homo sapiens zinc finger prot 
splicing factor, arginine/sertne-rlch 5 
ESTs, Moderately similar to 138022 hypot 
ESTs 
ESTs 

signal transducer and activator of trans 
Homo sapiens mRNA expressed only in plac 
AFFX controt STAT1 
AFFX control: STAT1 
fumarylacetoacetate 
interferon stimulated protein; 15 kDa 
ESTs 

ESTs; Weakly similar to II!! ALU SUBFAMILY J 
ESTs 

hepatocellular carcinoma associated protein; 
ESTs 

inhibin; beta B (activin AB beta polypeptide) 
ESTs 

HUM5G11A Human fetal brain (TFuJiwara) Homo 
ESTs 

CH22_FGENES.678_5 
CH22.FGENES.619J 
CH22_FGENES.619_12 
CH22_EMAC005500.GENSCAN.127 9 
CH22_EMACO05500.GENSCAN.3O4 2 
CHZLFGENES.271.8 
CH22_FGENES.619_13 
CH22_FGENES.271_7 
CH22_FGENES.617_7 
CH.07_hsgi|6004473 
CH22_FGENES.2S4J 

HER2 receptor tyrosine kinase (c erbB 2; ERBB2; 
CH22_FGENES.617_9 
CH.07_hsgi}5868264 
CH.19_hsgi|5867439 
CH22_FGENES.6 3 
CH.17_hsgil5867230 
CH.20_hsgijS552458 
CH22_EMAC005500.GENSCAN.148 22 
CH22 FGENES.669J0 
KIAA1028 protein 
CH22.FGENES.48J2 
CH22J r GENES.118_2 
ESTs 

multiple UniGene matches 
CH22 FGENES.619.8 
CH22_FGENES.13 7 

CH22_FGENES.360_3 
CH22_FGENES.706_9 
CH.21_hsgi[6531965 
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4.2 
3.4 
3.1 
4.4 
3.9 
18.8 
3.3 
4.2 
4.4 
5.7 
3.8 
4.3 
26.2 
3.9 
10.1 
20 
9 
4 

4.6 
3.2 
3.9 
4.1 
3.7 
3.2 
3 
3 

4.5 
6.7 
3.2 
3.3 
3 

4.2 
3 
3 
4 

3.2 
16.8 
12.9 
11.3 
9.2 
8.5 
8.4 
6 

7.3 
72 
7.1 
6.8 
neu) 
6.5 
5.8 
5.7 
5.3 
5.1 
5.1 
4.7 
4.6 
4.6 
4.5 
4.5 
4.4 
4.3 
4.3 
4.3 

Zinc Finger Protein Hzf4 

4.3 

43 

42 



6.6 



4.3 
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CH.17Jisgi|5867215 
CH22_FGENES.669_8 

CH22J=GENES.48J8 

matrix metalloproteinase 14 (membrane Inserted) 
CH22_FGENES.271_6 
CH22_FGENES.617_3 
CH22_FGENES.290J 

CH22_FGENES.135 
CH22J=GENES.132 
CH.14_hsgi|6682474 
CH.02_hsgi|5867750 
CH22.FGENES.617.8 

CH22_DJ32I10.GENSCAN.23 39 

CH22_FGENES,543_20 

CH22 EMAC005500.GENSCAN.96 1 

CH22JGENES.204J 

CH22_FGENES.619_4 

CH.16_hsgi|5867087 

EST cluster (not in UniGene) 

CH22_EMAC005500.GENSCAN.149 9 

CH22_EM:AC005500.GENSCAN.421 5 

CH22.FGENES.134 

CH.07_hsgi|6004478 

CH22^FGENES.360J 

CH22_FGENE$.6_2 

CH22_C20H12.GENSCAN.16 2 

CH22_C65E1.GENSCAN.8 1 

ESTs; Weakly similar to cis Golgl matrix 

CH22_FGENES.307J 

CH22^EM:AC00550Q.GENSCAN.248 14 

CH.06_hsgi|5902482 

CH22_FGENES.669_5 

CH22JW32I10.GENSCAN.19 8 

CH22_FGENES.527jB 

CH22_FGENES.330J0 

CH22^FGENES.142 

ESTs 

CH22J=GENES.226 7 
CH22_.FGENES.13 3 
OT2_EM:AC0O550O.GENSCAN.2O9 12 
CH22_FGENES.271_3 



4.1 
4.1 

Collagen, Type Viii, Alpha 1 4.1 

4.1 

4 

3.9 
3.9 
3.8 

Guanosine 5 1 Monophosphate Synthase 3.8 

3.8 

3.8 

3.8 

3.8 

3.7 

Oncogene Ret/Ptc2, Fusion Activated 3.7 

3.7 

37 

3.7 

3.5 

3.5 

3.5 

3.4 

3.4 

3.4 

3.3 

3.3 
33 

Dna Binding Protein Ap 2, Alt Splice 3 3.3 

3.3 

3.2 

3.2 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3 

3 

3 

3 

3 
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Table 17A shows the accession numbers for those pkeys lacking unigenelD's for Table 17. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



15 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



20 
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Pkey CAT number Accession 

116845 393481.1 AA649530 AA659316 H64973 
103207 30635 -4 X72790 

126257 1 82217 1 N99638 AW973750 AA328271 H90994 AA558020 AA234435 N59599 R94815 

102791 37186J AF080229 AF080231 AF080230 AF080232 AF080233 AFQ80234 BE550633 AI636743 AW614951 BE467547 AI680833 
AI633818 N29986 U87592 U87593 U87590 U87591 S46404 U87587AA463992 AW206802 AI970376 AI583718AI672574 
N25695 AW665466 A1818326 AA1261 28 AW80345 AW013827 AA248638 AI214968 AA204735 AA2071 55 AA206262 AA204833 
AW003247 AW496808 AI080480 AI631703 AI651023 AI857418 AW818140 AA502500 A1206199 AI671282 AI352545 BE50103O 
AI652535 BE465762 AA206331 AW451866 AA471088 AA206342 AA204834 AA206100 AW021661 AA332922 N66048 
AA703396 H92278 AW139734 H92683 U87589 U87595 H69001 U87594 BE465420 AI624817 BE46661 1 A1206344 AA574397 
AA348354 AI493192 

126872 142696 1 AW4 50979 AA1 36653 AA1 36656 AW4 19381 AA984358 AA492073 BE168945 AA809054 AW238038 BE01 1212 BE01 1359 
BE011367 BE011368BE011362 BE011215 BE011365 BE011363 
1746257 1 R82040 R7O934 
176835.1 AA225084AA302713 

324239J AI311928 AA936030 T51931 AA609816 AA487195M664207 
gentaafiRJW&N F09809 
genbank.R08440 R08440 



112631 
120742 
106864 
109700 
111532 
113938 
113947 
124357 
108733 
112303 
322136 
322296 
321811 
314648 
322520 
322521 
322675 
323332 
316186 
322975 
324261 
323817 
301976 
324961 
303642 
303797 
319551 
311935 
319834 
319977 
314138 
313591 



genbank_W81598 W81598 

genbank W84768 W84768 

genbank_N22401 N22401 

504187 J AA121022AA126422 

genbanU*54797 R54797 

46802 J AF075083 H52291 H52528 

47334J W76326AF086341W72300 

1 527481J D80630 D80896 D80895 

293660.1 AW979268 AA878419 AA431342 AA431628 

38916 J T55958 T57205 AF147346 

38917 J AF147347T55426 T55503 

86787.1 AA017656AA017374AA019761 

179142 1 AI829520 AI791832 AA228414 AI791823 AA229211 AA229315 
425440J AJ433540 AA728984 AA804981 
1510563 1 C16391 C16413 

273265 1 BE069341AW748403AL044891A1908240AA393080 

233566.1 AA410943 AW948953 AA334202 AA332882 

128835.1 T97905AA101672 

376239.1 AA613792 AW182329 T053O4 AW858385 

284260.1 AW299459AA417112 

386364.1 AW629759 AW749955 AA633408 AI651005 

357371 J AA761668 AA573621 R92814 R09670 

174129.1 AA216387 T63548 AA228676 

1 12523.1 AA071267 T65940 T64515 AA071334 

345248 J AA534222 M632632 T81 234 

179960.1 AA740616 AA654854 AA229923 

103087.1 AA046309 AI263500 AA046397 
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308106 AI476803 

338255 CH22_6856FG_LINK^EM:AC00 
335809 CH22J181FGJ17JJ.INK.EM 
335824 CH22J197FG_619J1JJNK_E 
5 307010 AI140014 
307041 AI144243 
305913 AA876109 
305917 AA876469 
309574 AW168083 
10 325372 c12_h$ 
325544 c12_J>s 

332798 CH22 14FGJJJ.1NK.C4G1.G 
334223 CH22ll507FGJ60J_UNK_EM 
327075 c21_hs 
15 334447 CH22J746FGJ87_7_LIMeEM 
304782 AA582081 

313434 441798J W92070AW0 19952 W92053 
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TABLE 17B 



Table 17B shows the genomic positioning for those pkeys lacking unigene ED's and 
5 accession numbers in Table 17. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



10 Ret" 



15 



20 



25 



Strand: 
NLposition: 

Pkey Ref 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. 'Dunham I. et al." refers to the publication 

entitled 'The DNA sequence of human chromosome 22." Dunham i. et aL, Nature (1999) 402:489495. 
Indicates DNA strand from which exons were predicted. 
Indicates nucleotide positions of predicted exons. 



Strand 



334447 Dunham. L etal Plus 

335809 Dunham, I. etaL Plus 

335824 Dunham, I etal. Plus 

332798 Dunham, I. etal Minus 

334223 Dunham, I. etal Minus 

338255 Dunham, I. etal. Minus 

325372 5866920 Minus 

325544 6682452 Plus 

327075 6531965 Plus 



NCposifJon 

14308764-14308824 

26310772-26310909 

26376860-26376942 

232147-231974 

12734365-12734269 

15242294-15242231 

1117061-1117304 

171228-171286 

4041318-4041431 
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TABLE 18: Table 2 from BRCA 014 P 

Table 18 shows genes with atleast five times the expression in breast tumor tissue than is 
5 expressed in normal body tissues. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

10 UnigenelO: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of tumor to norma! body tissue 



IS Pkey ExAccn UnigenefD Unigene Title 



101378 
101530 
101767 
20 101878 
103010 
104115 
104825 
107105 
108819 
112287 
112561 
112637 
113206 
113970 
114965 
118925 
119905 
121611 
129301 
133976 
134731 
300254 
301884 
302001 
302067 
302276 
302290 
302372 
45 302385 
309177 
309583 
310781 
311166 
50 311935 
312153 
313328 
313915 
314097 
55 314138 
314506 
314558 
314691 
315006 
60 315021 
315051 
315060 



25 



30 



35 



40 



BE563085 

M29874 

M81057 

M97815 

X52509 

AF183810 

AA035613 

AW963419 

AA011449 

AB033064 

AI791493 

R82331 

BE262470 

W27249 

AI733881 

N92293 

AW449064 

M31669 

AF182277 

A1908165 

D89377 

AW183618 

AA312082 

AB020711 

BE542706 

AW057736 

AA179949 

AL1 17406 

AJ224172 

AI951118 

AW170035 

AI380797 

AI821005 

AA216387 

BE261944 

AW449211 

C18863 

AA648744 

AA740616 

AA833655 

AI873274 

AW207206 

AI538613 

AA533447 

AW292425 

AA551104 



Hs.833 

Hs.1360 

Hs.180884 

Hs.183650 

Hs.161640 

Hs.26102 

Hs.141883 

Hs, 155223 

Hs.271627 

Hs.334806 

Hs.129873 

Hs.164599 

Hs.241471 

Hs.8109 

Hs.72472 

Hs.206832 

Hs.1 19571 

Hs.1735 

Hs.330780 

Hs.169946 

Hs.89404 

Hs.55610 

Hs.105445 

Hs.278346 

Hs.222399 

Hs.323910 

Hs.175563 

Hs.200102 

Hs.204096 

Hs.326736 

Hs.326736 

Hs.158992 

Hs.1 18599 

Hs.1 18625 
Hs.105445 
Hs.163443 
Hs.269493 

Hs.206868 
Hs.190721 
Hs.136319 
Hs.298241 
Hs.312989 
Hs.163484 
Hs.189048 



R1 

5.3 



interferon-stimulated protein, 15 kOa 

cytochrome P450, subfamily ilB (phenobar 9 

carboxypeptidaseBI (tissue) 12 

cellular retinofc acid-binding protein 2 6.5 

tyrosine aminotransferase 12.4 

opposite strand to trichorhinophalangeal 7.6 

ESTs 6.9 

stanniocalcin 2 5.3 

ESTs 6.1 

KIAA1238 protein 7.3 

ESTs, Weakly similar to A36036 cytochrom 8.2 

ESTs 5.4 

RNB6 6.2 

hypothetical protein RJ21080 6.9 

BMP-R1B 10.1 

ESTs, Moderately similar to ALU8JWMAN A 19.7 

collagen, type III, alpha 1 (Ehlers-Danl 8.4 

inhibin. beta B (acGvtn AB beta polypep 5.6 

cytochrome P450, subfamily HB (phenobar 6.2 

GATA-binding protein 3 (T-ceB receptor 6.2 

msh (Drosophila) homeo box homolog 2 5.8 

solute carrier family 30 (zinc transport 9.9 

GDNF family receptor alpha 1 5.7 

KIAA0904 protein 7.7 

CEGP1 protein 7.3 

HER2 receptor tyrosine kinase (c-erb-b2, 5.4 
Homo sapiens mRNA; cDNA DKF2p564N0763 {f 34.1 



ATP-binding cassette transporter MRP8 
tipophiiin B (uteroglobin family member) 
Homo sapiens breast cancer antigen NY-BR 
Homo sapiens breast cancer antigen NY-BR 
ESTs 
ESTs 

gb:nc16bQ2.s1 NCLCGAP_Pr1 Homo sapiens 
hexokinase 1 

GONF family receptor alpha 1 
Homo sapiens cDNA FU1 1576 fis, clone HE 
ESTs 

gb:ny97f11.s1 NCL_CGAP_GCB1 Homo sapiens 5.9 
Homo sapiens cONA FU14056 fis, clone HE 8.5 
ESTs 27.4 
ESTs 20.7 
Transmembrane protease, serine 3 1 0.9 

ESTs 5.3 
ESTs 12-9 
ESTs. Moderatelysimilarto AlUCJiUMAN I 5.8 
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6.7 
13.8 
17.3 
57.6 
10.2 
10.8 
5.2 
5.2 
12.4 
26.3 
6.6 
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10 



15 



20 



25 



30 



35 



40 



315196 
315530 
315634 
316012 
316177 
316580 
317803 
317881 
318740 
318744 
320211 
321107 
321644 
321978 
322035 
322766 
322818 
322975 
323262 
323332 
323817 
324261 
324432 
324598 
324603 
324987 
325544 
330388 
334223 
335809 
335824 



AI367347 Hs.44898 
AW015415 Hs.127780 
AA837085 Hs.220585 
AA764950 Hs.119898 
A1904982 Hs.293102 
AA938198 Hs.146123 
AW664964 Hs. 128899 
AI827248 Hs.224398 
NM_002543Hs.77729 
AI793124 Hs.144479 
AL039402 Hs.125783 
AI732643 Hs.144151 
AW975944 Hs.237396 
N77342 Hs.21851 
AL137517 Hs.334473 
AW068805 Hs.288467 
AW043782 Hs.293616 
C16391 

AL133990 Hs.190642 
AI829520 
M410943 
BE069341 

AA464510 Hs.152812 
AW972227 Hs.163986 
AW993522 Hs.292934 
AI375572 Hs.172634 

Hs.46 



AI052047 
R72427 



Homo sapiens clone TCCCTA00151 mRNA sequ8.2 

ESTs 8.9 

ESTs 6.3 

ESTs 7 

ESTs, Moderated similar to ALU1.HUMAN A 307 

poly(A) polymerase gamma 9.4 

ESTs 6.1 

Homo sapiens cDNA FU1 1469 fis, clone HE 9.6 

oxidised low density lipoprotein (lectin 7.3 

ESTs 17.8 

OEME-6 protein 9.2 

ESTs 12.3 

ESTs 117 

Homo sapiens cONA FU12900 fis, clone NT 5 

hypothetical protein 0KFZp564O1278 1 9 

Homo sapiens cONA FU12280 fis, done MA 5.2 

ESTs 7.6 

gb:C16391 Clontech human aorta polyA mRN 16.5 

ESTs 10,5 

gb^19c06 jc1 NCLCGAPJJtl Homo sapiens 6.2 

BMP-R1B 8.4 
gb:QV3-BT0381-270100-073-c08 BT0381 Homo 49.4 

ESTs 16.5 

Homo sapiens cDNA: FU22765 fis, clone K 5 

ESTs 10.4 

ESTs 18.8 

Phase 2 & 3 Exons 5.7 

HER2 receptor tyrosine kinase (c-erb-b2, 6.6 

NMJ)05080*:Homo sapiens X-box binding pr 26.2 

NMJ)14509*:Homo sapiens kraken-like (BK1 10.1 
ENSP00000249072*:DJ222E13.1 (N-TERMINAL 20 

ESTs; Weakly similar to CYTOCHROME P450 67 

CH22_EM:AC0Q5500.GENSCAN.127 9 5.5 

CH2^FGENES.619J3 9.2 

CH22.FGENES.617J 8 

CH22_FGENES.271_7 6.5 

CH22_FGENES.619J 7.3 

CH22J : GENES.271_8 12.9 

CH22_FGENES.619_12 8.4 

CH22_EMAC005500.GENSCAN.304 2 11.3 

CH.07__hsgij6004473 8.5 

CH22_FGENES.617_7 7.1 

CH22J=GENE$.678J 7.2 

CH22_FGENES.678_5 16.8 
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TABLE 18A 



5 

10 
15 
20 
25 



Table 18A shows the accession numbers for those pkeys lacking unigeneED's for Table 18. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 


CAT number. Gene cluster number 


Accession: Genbank accession numbers 


Pkey CAT number 


Accession 


323332 179142J 


AI829520 AI791832 AA228414 AI791823 AA22921 1 AA229315 


322975 1510563J 


C16391 C16413 


324261 273265J 


BE069341 AW748403 AL044891 AI908240 AA393080 


323817 233566J 


AA410943 AW948953 AA334202 AA332882 


311935 174129J 


AA216387T63548AA228676 


314138 179960J 


AA740616 AA654854 AA229923 


335809 CH22J181FG_617J_UNK_EM 


335824 CH22J197FG_619J1JJNK_E 


325544 c12_hs 




334223 CH22J507FGJ60_4JINICEM 
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TABLE 18B 

Table 18B shows the genomic positioning for those pkeys lacking unigene ID' s and 
5 accession numbers in Table 18, For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

1 0 Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled "The DNA sequence of human chromosome 22: Dunham I. et al., Nature (1999) 402:489-495. 
Strand: indicates DNA strand from which exons were predicted. 

NLpositioru Indicates nucleotide positions of predicted exons. 



Pkey Ref Strand NLposition 



335809 OunhanUetaJ. Plus 26310772-26310909 

20 335824 Dunham, I etal. Plus 26376860-26376942 

334223 Dunham, I. etal. Minus 12734365-12734269 

325544 6682452 Plus 171228-171286 
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TABLE 19: 1045 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES 



10 



Table 19 shows 1045 genes up-regulated in breast cancer compared to normal adult tissues. 
These were selected from 59680 probesets on the Affymetrix/Eos-Hu03 GeneChip array such 
that the ratio of "average" breast cancer to "average" normal adult tissues was greater than or 
equal to 2.5. The "average" breast cancer level was set to the 90 th percentile value. The 
"average" normal adult tissue level was set to the 90 th percentile value amongst 144 non- 
malignant tissues. In order to remove gene-specific background levels of non-specific 
hybridization, the 15 ,h percentile value amongst the 144 non-malignant tissues was subtracted 
from both the numerator and the denominator before the ratio was evaluated. 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 



Rl: 



Ratio of tumor lo normal body tissue 



Pkey ExAccn UnigenelD UnigeneTHIe 



406964 
400291 
407277 
449746 
426878 
400292 
427585 
408045 
407178 
407377 
450705 
407212 
428848 
404561 
407980 
447350 
450375 
422109 
435496 
453160 
420813 
415989 
422505 
424399 
423575 
429441 
431474 
448595 
427217 
402578 
422805 
424634 
456207 
424086 
459587 



AF015224 

M21305 

AA401369 

AW170035 

AI668594 

BE069341 

AA250737 

031152 

AW138959 

AA195651 

C16391 

U90304 

AA412108 

NMJ00230 

AA046309 

AI375572 

AA009647 

S73265 

AW840171 

AI263307 

X51501 

A1267700 

AL120862 

AI905687 

C18863 

AJ224172 

AU33990 

AB014544 

AA399272 

AA436989 

NMJ03613 

AA193450 

A1351010 

AA031956 



Hs.46452 mammagtobin 1 



Hs.190721 
Hs.326736 
Hs.176588 

Hs.72472 
Hs.179729 
Hs.245123 
Hs.104106 

Hs.25351 

Hs.269350 

Hs.194236 



Hs.172634 

Hs.8850 

Hs.1473 

Hs.265398 

Hs.239884 

Hs.99949 

Hs.317584 

Hs.124165 

Hs.2533 

Hs.163443 

Hs.204096 

Hs.190642 

HS.21572 

Hs.144341 

Hs.121017 
Hs,151407 



ab:Human aloha satellite and sateflite 3 
ESTs 

Homo sapiens breast cancer antigen NY-BR 
ESTs, Weakly similar to CP4Y_HUMAN CYTOC 
gb:QV3-BT0381-270100-073-c08BT0381 Homo 
BMP-R1B 

collagen, type X, alpha 1 (Schmid metaph 
ESTs 
ESTs 

gb:C16391 Clontech human aorta polyA mRN 
Iroquois homeobox protein 5 
ESTs 

leptin (murine obesity homotog) 
trichorhinophalangeal syndrome I (TRPS1) 
gb:zf12f01.s1 SoaresJebUwaiUttSHH19W 
ESTs 

a disintegrln and metaltoproteinase doma 
gastrin-reteasing peptide 
ESTs, Weakly similar to transformatlon-r 
H28 histone family, member L 
prolactin-induced protein 
ESTs 

programmed cell death 9 (PDCD9) 
aldehyde dehydrogenase 9 family, member 
Homo sapiens cDNA FLJ11576 fis, done HE 
lipophilin B {uteroglobin family member) 
ESTs 

KIAA0644 gene product 
ESTs 

C1001134:gil2117372|pirl|l65981 fatty ac 
H2A histone family, member A 
cartilage intermediate layer protein, nu 
gb:zr40e07rl SoaresJJhHMPu.SI Homosapi 



Hs.1 02267 lysyl oxidase 



gb:zkl5e04.s1 Scares j)regnant_uterus_NbH 
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R1 

137.6 

71.0 

68.4 

54.2 

46.4 

44.8 

37.4 

32.9 

31.9 

30.4 

27.7 

24.8 

22.0 

21.9 

21.8 

19.8 

17.3 

16.6 

16.5 

16.0 

15.8 

15.8 

15.5 

14.8 

14.5 

13.7 

13.6 

13.5 

13.0 

12.8 

12.6 

122 

12.0 

119 

11.9 

11.5 
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416208 AW291168 Hs.41295 ESTs, Weakly similar to MUC2.HUMAN MUCIN 11.5 

429170 NM_001394 Hs.2359 dual specificity phosphatase 4 11.5 

407276 AI951 1 18 Hs.326736 Homo sapiens breast cancer antigen NY-BR 1 1.4 

434377 AW137148 Hs.306593 Homo sapiens cDNA FU 11 382 fis, clone HE 11.3 

5 448390 AL035414 Hs.21068 hypothetical protein 11.1 

452401 NMJJ07115 Hs.29352 tumor necrosis factor, alpha-induced pro 11.0 

421037 A1684808 Hs.197653 programmed cell death 9 (PDCD9) 10.9 

452461 N78223 Hs. 108 106 transcription factor 10.7 

443348 AW873596 Hs. 182278 calmodu&i 2 (phosphorytase kinase, delt 10.6 

10 421155 H87879 Hs.102267 lysyl oxidase 105 

402606 NM_024626:Homo sapiens hypothetical prot 10.4 

409269 AA576953 Hs.22972 hypothetical protein FU 13352 10.4 

447268 AI370413 Hs.36563 hypothetical protein FU22418 10.3 

447033 A1357412 Hs.157601 ESTs 10.2 

15 400295 W72838 Hs.2533 aldehyde dehydrogenase 9 family, member 10.1 

424905 NNL002497 Hs. 1 53704 Nl MA (never in mitosis gene a)-relatedk 10.1 

432441 AW292425 Hs.163484 ESTs 9.9 

427365 AI873274 Hs.190721 ESTs 9.9 

438950 H23789 Hs.144530 EST 9.8 

20 422835 BE218705 Hs.121378 metaltolhionein-tike 5, testis-specific 9.7 

425692 D90041 Hs. 1 55956 N-acetyttransferase 1 (arylamine N-acety 9.7 

411869 W20027 Hs.23439 ESTs 9.6 

439820 AL360204 Hs.283853 Homo sapiens mRNA full length insert cDN 9.6 

445730 AI624342 Hs.170042 ESTs 9.5 

25 459583 A1907673 gb:lL-8T152-O8O399-0O4 BT152 Homo sapien 9.3 

452744 AI267652 Hs.30504 Homo sapiens mRNA; cONA DKFZp434E082 (fr 9.2 

432596 AJ224741 Hs.278461 matriiin3 9.1 

400297 AI127076 Hs.334473 hypothetical protein DKFZp56401 278 9.1 

449448 D60730 Hs.57471 ESTs 9.1 

30 423945 AA410943 gb:zt32h03/1 Soares ovary tumor NbHOT H 9.1 

406348 Target Exon 9.0 

424735 U31875 Hs.272499 short-chain alcohol dehydrogenase family 9.0 

453392 U23752 Hs.32964 SRY (sex determining region Y)-box 11 9.0 

433365 AF026944 Hs.293797 ESTs 8.8 

35 405654 NA C1 2001521 :gil7513934|pirj|T31081 cca3pr 8.8 

418601 AA279490 Hs.86368 calmegin 8.8 

451110 AI955040 Hs.265398 ESTs, Weakly similar to transformations 8.7 

408771 AW732573 Hs.47584 potassium voltage-gated channel, delayed 8.5 

409041 AB033025 Hs.50061 KIAA1 199 protein 8.4 

40 442353 BE379594 Hs.49136 ESTs, Moderately similar to ALU7.HUMAN A 8.3 

451561 N52812 Hs.177403 ESTs 8i 

424001 W67883 Hs. 137476 paternally expressed 10 8.2 

429859 NMJW7050 Hs.225952 protein tyrosine phosphatase, receptor t 8.1 

423887 AL080207 Hs.134585 DKFZP434 6232 protein 8.1 

45 405095 NA TaigetExon 8.1 

419296 AA236115 Hs.120785 ESTs 8.0 

447164 AF026941 Hs.17518 Homo sapiens dg5 mRNA, partial sequence 8.0 

413472 BE242870 Hs.75379 sotute carrier family 1 (glial high affl 8.0 

416747 AW876523 Hs.15929 hypothetical protein FU12910 8.0 

50 415385 R17798 Hs.7535 COBW-Gke protein 7.9 

434424 AI811202 Hs.325335 Homo sapiens cDNA: FU23523 fis, done I 7.9 

420931 AF044197 Hs.100431 small inducible cytokine B subfamily (Cy 7.9 

406687 M31126 Hs.272620 matrix metalloproteinase11 (MMP11;stro 7.8 

400285 NA Eos Control 7.7 

55 437207 T27503 Hs.15929 hypothetical protein FU12910 7.6 

427119 AW880562 Hs.114574 ESTs 7.5 

429534 AW976987 Hs.163327 ESTs, Weakly similar to 2109260AB cell 7.5 

433426 H69125 Hs.133525 ESTs 7.5 

411078 AI222020 Hs.182364 CocoaCrisp 7.4 

60 426214 H59846 Hs.128355 ESTs. Moderately similar to ALU7 J1UMAN A 7.4 

447475 AJ380797 Hs. 158992 ESTs 7.3 

415263 AA948033 Hs.130853 ESTs 7.2 

439569 AW602166 Hs.222399 CEGP1 protein 72 

414142 AW368397 Hs.150042 Homo sapiens cDNA FU14438 fis, clone HE 7.1 

65 426261 AW242243 Hs.168670 peroxisomal famesytated protein 7.0 

444783 AK001468 Hs.62180 anliDn (Drosophila Scraps homolog), act 6.9 

445885 AI734009 Hs. 127699 KIAA 1603 protein 6.9 
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429432 A1678059 Hs.202676 synaptonemal complex protein 2 6.9 

410781 AI375672 Hs.165028 ESTs 6.9 

443788 A1732643 Hs.144151 ESTs 6.9 

421373 AA808229 Hs.167771 ESTs 6.8 

451398 AJ793124 Hs,144479 ESTs 6.8 
404253 NMJ)21Q58 # :Homo sapiens H2B hlstone fami 6.8 

441098 AI015591 Hs.131004 ESTs, Weakly similar to T17227 hypothetl 6.7 

426215 AW963419 Hs.155223 stanniocalcin 2 6.6 

428227 AA321649 Hs.2248 small inducible cytokine subfamily B (CX 6.6 

422956 BE545072 Hs.122579 hypothetical protein FU10461 6.6 

411111 AW818127 gb:CM1-STO277-061299-059-bO7ST0277 Homo 6.6 

434988 A1418055 Hs.161160 ESTs 6.6 

442580 AI733682 Hs.130239 ESTs 6.6 

449611 AI970394 Hs.197075 ESTs 6.6 

408000 111690 Hs.620 bullous pemphigoid antigen 1 (230/240kD) 6.5 

420757 X78592 Hs.99915 androgen receptor (dihydrofestosterone r 6.5 

431089 BE041395 Hs.283676 ESTs, WeaWy similar to unknown protein 6.5 

400301 X03635 Hs.1657 estrogen receptor 1 6.5 

427356 AW023432 Hs.97849 ESTs 6.5 

425704 U79293 Hs.159264 Human clone 23948 mRNA sequence 6.4 

441134 W29092 Hs.7678 cellular retinoic acid-binding protein 1 6.4 

424902 NWL003866 Hs.153687 inositol polyphosphates-phosphatase, ty 6.4 

448693 AW004854 Hs.228320 hypothetical protein FU23537 6.4 

431448 AL137517 Hs.334473 hypothetical protein DKF2p5640l 278 6.2 

444342 IWL014398 Hs.10887 similar to Iysosome-associated membrane 6.1 

422168 AA586894 Hs.1 12408 S1 00 caiaunvWnding protein A7(psorias 6.1 

453331 A1240665 Hs.8895 ESTs 6.1 

418007 M13509 Hs.83169 matrix metalfoproteinase 1 (MMP1; Inters 6.0 

441233 AA972965 Hs.135568 ESTs 6.0 

418092 R45154 Hs.106604 ESTs 6.0 

430044 AA464510 Hs.152812 ESTs 5.9 

432837 AA310693 Hs.87329 HSPC072 protein 5.9 

433235 AW375944 Hs.237396 ESTs 5.9 

450701 H39960 Hs.288467 Homo sapiens cDNA FU12280 fis, clone MA 5.9 

425707 AF115402 Hs.11713 E74-like factor 5 (ets domain transcript 5.9 

410785 AW803341 gb:lL2-UM0079-090300-05O-DO3 UM0079 Homo 5.9 

425398 AL049689 Hs.156369 hypothetical protein similar to tenascin 5.9 

414812 X72755 Hs.77367 monokine induced by gamma interferon 5.8 

459371 R20991 gb:yg06h01.r1 Soares infant brain 1NIB H 5.8 

411284 N28519 Hs.135191 ESTs, Weakly similar to unnamed protein 5.8 

453511 AL031224 Hs.33102 transcription factor AP-2 beta (acthrati 5.8 

451807 W52854 Hs.27099 hypothetical protein FU23293 similar to 5.7 

430510 AW162916 Hs.241576 hypothetical protein PR02577 5.7 

415539 AI733881 Hs.72472 BMP-R1B 5.6 

438199 AW016531 Hs.122147 ESTs 5.6 

417866 AW067903 Hs.82772 collagen, type XI, alpha 1 5.5 

430019 AA463893 Hs.220933 ESTs 5.5 

439809 R41396 Hs.101774 hypothetical protein FU23Q4 5 5.5 

423811 AW299598 Hs.50895 homeoboxC4 5.4 

434539 AW748078 Hs.214410 ESTs, Weakly similar to MUC2_HUMAN MUCIN 5.4 

439138 AI742605 Hs.193696 ESTs 5.4 

453931 AL121278 Hs.25144 ESTs 5.4 

444078 8E246919 Hs.10290 U5 snRNP-specific 40 kOa protein (hPrp8- 5.4 

447102 BE167434 Hs.98471 ESTs, Weakly similar to T 187 12 hypotheS 5.4 

451621 AI879148 Hs.26770 fatty acid binding protein 7. brain 5.4 

425236 AW067800 Hs,155223 stanniocalcin 2 5.3 

421464 AA291553 Hs.190086 ESTs 5.3 

450736 AW970060 gb:EST382140 MAGE resequences, MAGK Homo5.3 

428085 AA421081 Hs.12388 ESTs 5.3 

452838 U65011 Hs.30743 preferentially expressed antigen In mela 5.3 

445424 AB026945 Hs.12696 cortacfin SH3 domain-binding protein 5.3 

456938 X52509 Hs.161640 tyrosine aminotransferase 5.3 

422867 L32137 Hs.1584 cartilage oligomeric matrix protein (COM 5.2 

438167 R28363 Hs.24286 ESTs * 5.2 

433330 AW2O7084 Hs.132816 hypothetical protein MGC 14801 5.2 

449765 N92293 Hs.206832 ESTs. Moderately similar to ALU8_HUMAN A 5.2 

416276 O41060 Hs.79136 LIV-1 protein, estrogen regulated 52 
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400300 X03363 HER2 receptor tyrosine kinase (c-erb-b2, 5.2 

418004 U37519 Hs,87539 aldehyde dehydrogenase 3 family, member 5.2 

439840 AW449211 Hs.105445 GDNF family receptor alpha 1 5.2 

428771 AB028992 Hs. 193 143 K1AA1069 protein 5.2 
455047 AW852530 g bPM1-CT0243-071 099^)01^06 CT0243 Homo 5.2 

419169 AW851980 Hs.262346 ESTs, Weakly similar to S72482 hypoihefi 5.2 

453197 AI916269 Hs.109057 ESTs, Weakly similar to ALU5_HUMAN ALU S 5.1 

400298 AA032279 Hs.61635 six transmembrane epflheflal antigen of 5.1 

431023 AI283133 Hs.297420 ESTs 5.1 

427666 AI791495 Hs.180142 calmodulln-like skin protein 5.1 

427718 AI798680 Hs.25933 ESTs 5.1 

434531 AA642007 Hs.116369 ESTs 5.1 

429220 AW207206 Hs.136319 ESTs 5.1 

405494 NA C2M1837*:gi|12697903|dbjjBAB21770.1|(A 5.1 

452930 AW195285 Hs.194097 ESTs. Weakly similar to 138022 hypothec 5.1 

444910 AI201849 gb:qs76gMj(1 NCLCGAPJ>i28 tomo sapiens 5.1 

453310 X70697 Hs.553 solute carrier family 6 (neurotransmitte 5.0 

444381 BE387335 Hs.283713 ESTs, Weakly similar to S64054 hypotheti 5.0 

450603 R43646 Hs.12422 ESTs 5.0 

416575 W02414 Hs.38383 ESTs 5.0 

438504 AW665281 Hs.224625 ESTs 5.0 

416209 AA236776 Hs.79078 MAD2 (mitotic arrest deficient, yeast h 5.0 

428804 AK000713 Hs.193736 hypothetical protein RJ20706 5.0 

420077 AW512260 Hs.87767 ESTs 4.9 

450480 X82125 Hs.25040 zinc finger protein 239 4.9 

437637 AJ003029 Hs.65792 syntrophin.gamma2 4.9 

431808 M30703 Hs.270833 amphireguiin (schwannoma-derived growth 4.9 

418836 AI655499 Hs.161712 ESTs 4.8 

442441 A1820662 Hs.129598 ESTs 4.8 

435635 AF220050 Hs.181385 uncharacterized hematopoietic stem/prog e 4.8 

400286 NA C16000922:gi|7499103lpirj{T20903hypothe 4.8 

407506 U71600 gb:Human zinc finger protein zfp31 (zf31 4.8 

420026 A1831190 Hs.166676 ESTs 4.8 

441377 BE218239 Hs.202656 ESTs 4.8 

457726 AI217477 Hs.194591 ESTs 4.8 

412785 AW997556 Hs.78521 KIAA1717 protein 4.8 

428368 BE440042 Hs.83326 matrix metalloproteinase 3 (stromeiysin 4,7 

436026 AI349764 Hs.217081 ESTs 4.7 

409110 AA191493 Hs.48778 niban protein 4.7 

400284 NA estrogen receptor 1 4.7 

410102 AW248508 Hs.279727 Homo sapiens cONA FU14035 fis. done HE 4.7 

407819 R42185 Hs,274803 ESTs 4.7 

430486 BE062109 Hs.241551 chloride channel, calcium activated, fam 4.7 

422896 AW961489 Hs.154116 ESTs 4,7 

453616 NM_003462 Hs.33846 dyneln, axonemal, fight Intermediate pot 4.7 

427427 AF077345 Hs.177936 ESTs 4.6 

421751 AW813731 -Hs.159153 ESTs. Moderately sim3ar to S65657 alpha 4.6 

454074 R63503 Hs.28419 ESTs 4.6 

405718 C4000799*;gi}6330365|dbJ|BAA86508.1| (AB 4.6 

444649 AW207523 Hs.197628 ESTs 4.6 

429431 240313 Hs. 1 06330 Homo sapiens done IMAGE:23371, mRNAseq 4.6 

427811 M81057 Hs.180884 carboxypeptidase B1 (tissue) 4.6 

447342 AI199268 Hs.19322 Homo sapiens, Similar to RIKENcDNA 2010 4,6 

430345 AK000282 Hs.239681 hypothetical protein FU20275 4.6 
454307 AW855717 gb:RC1-CT0279-081299-013-b01 CT0279 Homo 4.6 

400303 AA242758 Hs.79136 L1V-1 protein, estrogen regulated 4.6 

438180 AA808189 Hs.272151 ESTs 4.6 
451340 AW936273 gb:QV0-OT0020-090200-107-g07 DT0020 Homo 4.6 

458711 AL036877 Hs.282878 ESTs 4,6 

457430 AA514660 Hs.128443 ESTs 4.6 

416030 H15261 Hs.21948 ESTs 4.6 

447233 AW246333 Hs.17901 Homo sapiens, done IMAGE:3937015, mRNA, 4.6 

445537 AJ245671 Hs.12844 EGF-Iike-domain, multiple 6 (EGFL6) 4.5 

424590 AW966399 Hs.46821 hypothetical protein FU 20086 4.5 

432374 W68815 Hs.301885 Homo sapiens cDNA FU1 1346 fis. done PL 4.5 

423833 AW503329 gb:U!-HF-BN0-akx-e-02-(WJlJl NIH_MGC_5Q 4.5 

406747 AI925153 Hs.217493 annexinA2 45 
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412102 H56435 
431716 D89053 
411050 AW814902 
401418 NA 
436194 AK001074 
436211 AK001581 
414080 AA135257 
424115 AA335497 
415786 AW419196 
442117 AW664964 
452784 BE463857 
432731 R31178 
410534 AW905138 
405196 NA 
430217 N47863 
401793 

415747 AA381209 
423679 AB007975 
400238 NA 
425627 AF019612 
400608 

458634 AV657310 
407771 AL138272 
405906 NA 
405925 NA 
439382 BE247684 
445263 H57646 
407162 N63855 
419536 AA603305 
454359 N71277 
411558 AA102670 
450715 AI266484 
421451 AA231377 
452864 AA033714 
409757 NIVL001898 
413043 BE158766 
413499 BE144884 
444619 BE538082 
408380 AF123050 
406992 S82472 
404285 NA 
425247 NNL00594Q 
428046 AW812795 
446163 AA02688O 
421147 AW592167 
426451 AI908165 
415227 AW821113 
452176 AA024538 
452862 AW378065 
443646 AI085198 
425523 AB007948 
424687 J05070 
430009 AA894564 
434469 M634806 
451381 BE241831 
450229 R18717 
455700 BE068115 
431924 AK000850 
438885 AI886558 
401451 

431676 A1685464 
409092 AI735283 
429270 W60379 
443903 AI220547 
427122 AW057736 
410275 U85658 
432912 BE007371 



4.5 
4.5 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 



gb:yq98e09.r1 Soares fetal liver spleen 4.5 
Hs.268012 fatty-acid-Coenzyme A ligase, long-chain 4.5 
gb:MR1-ST0206-12M00-G22-f08ST0206Horno 4.5 
C14000338*:giI7459502|plr||S74665 outer 
Hs.333435 Homo sapiens cONA FU 1021 2 fis, clone HE 
Hs.334828 hypothetical protein HJ10719; KIAA1794 
Hs.47783 B aggressive lymphoma gene 
Hs.293965 ESTs, Weakly similar to I38022 hypotheti 
Hs.257924 hypothetica! protein FU 13782 
Hs.128899 ESTs 

Hs.151258 hypothetical protein FU21062 
Hs.287820 fibronectin 1 

gb:QV0-NN1071-2804O0-207-g07 NN1071 Homo4.4 

C2000662*:gil7512792|pir|)T12482hypcthe 
Hs.336901 ribosomal protein S24 

C17001545:gij5360127jgb|AAD42882.1jAF155 

gb:EST94257 Activated T-cells I Homo sap 
Hs.131454 KIAA0506 protein 

C19000274*:gi|12741327jreqXP,008833il 
Hs.297007 membrane-bound transcription factor prot 

C100O1899:gil7508633lpirl|T25392 hypothe 
Hs.282898 ESTs 
Hs.62713 ESTs 

Target Exon 

Target Exon 
Hs.103070 ESTs 
Hs.42586 K1AA1560 protein 
Hs.142634 zinc finger protein 

gb:np12d1U1 NCLCGAP_Pr3 Homo sapiens 

gb:za36e03.s1 Soares fetal liver spleen 
Hs.70725 gamma-aminobutyric acid (GABA) A recepto 
Hs.31570 ESTs, Weakly similar to KIAA1324 protein 
Hs.50831 ESTs 

Hs.287629 hypothetical protein FU14260 
Hs.123114 cystatinSN 

gb:lL2-HT0397-071299-024-F02HT0397Homo 4.2 
gb:CM0-HT0182-O41O99-065*11 HT0182Homo4.2 
Hs.8172 ESTs, Moderately similar to A46010 X4n A n 
Hs.44532 diubiquitin 

gb:beta -pol=DNA polymerase beta {exon a 
C6001909:gi|704441|dbjjBAA18909.1| (D298 
matrix metalloproteinase 11 (MMP11; stro 
ESTs, Moderately similar to I38022 hypot 
prolactin receptor 
ESTs 

GATA-binding protein 3 (T-cell receptor 
ESTs 

Human DNA sequence from clone RP1-28H20 
ESTs 
ESTs 

KIAA0479 protein 



4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.2 
4.2 
4.2 
4.2 
4.2 



Hs.155324 
Hs.155381 
Hs.25252 
Hs.293299 
Hs.169946 
Hs.72402 
Hs.282990 
Hs.8687 
Hs.164226 
Hs.158244 
Hs.151738 
Hs.22242 

Hs.172330 
Hs.8929 

Hs.272203 
Hs.184987 



Hs.172608 

Hs.57773 

Hs.135223 

Hs.323910 

Hs.61796 

Hs.200313 



4.2 
4.2 
4.2 
4.2 
M 
M 
4.2 
4.2 
4.2 
A2 
4.2 
4.2 
4.2 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 



ESTs 

gb:ab28c02.r1 Stratagene lung (937210) H 
hypothetical protein MGC2705 
hypothetical protein FU11362 
gb:CM1-BT036fl-061299-060^07 BT0368 Homo4.1 
Homo sapiens cDNA FU20843 fis. clone AO 4.1 
ESTs 

NMJHM496*:Homo sapiens hepatocyte nucle 
gb:tt88f04.x1 NCI CGAP_Pr28 Homo sapiens 
ESTs 
ESTs 
ESTs - 

HER2 receptor tyrosine kinase (c-erb-b2, 
transcription factor AP-2 gamma (actlvat 
ESTs 
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403585 Target Exon 4.1 

433295 AI394151 Hs.37932 ESTs 4.1 

420380 AA640891 Hs.102406 ESTs 4.1 

431118 BE264901 Hs.250502 carbonic anhydrase VIII 4.1 

416182 NfvL004354 Hs.79069 cycfinG2 4.1 

418994 AA296520 Hs.89546 selectin E (endothelial adhesion molecul 4.1 

400555 Target Exon 4.1 

410079 U94362 Hs-58589 glycogenln2 4.0 

427674 NM_003528 Hs.2178 H2B histone family, member Q 4.0 

427131 AA448460 Hs.1 12017 GE36gene 4.0 

439759 AL359Q55 Hs.67709 Homo sapiens mRNA full length Insert cDN 4.0 

429353 AL1 17406 Hs.200102 ATP-binding cassette transporter MRP8 4.0 

421296 NM.002666 Hs.103253 perilipin 4.0 

418819 AA228776 Hs.191721 ESTs 4.0 

424188 AW954552 Hs.142634 zinc flnger protein 4.0 

455431 AW938484 gb:CMtK)T0057-290200-253-d06 DT0057 Homo4.0 

404142 NA Target Exon 4.0 

441143 A1027604 Hs.159650 ESTs 4.0 

444540 A1693927 Hs.265165 ESTs 4.0 

415579 AA165232 Hs.222069 ESTs 4.0 

452891 N75582 Hs.212875 ESTs, Weakly similar to DYH9 JiUMAN CILIA 4.0 

414605 BE390440 gb:601283601F1 NlH_MGC_44 Homo sapiens c 4.0 

452281 T93500 Hs.28792 Homo sapiens cDNA FU11041 fis, clone PL 4.0 

417801 AA417383 Hs.82582 integrin, beta-like 1 (with EGF-like rep 4.0 

446232 A1281848 Hs.194691 retinolc acid Induced 3 4.0 

447377 X77343 Hs.334334 transcription factor AP-2 alpha (activat 4.0 

437854 AL1 19723 gb:DKFZp761A2124_r1 761 (synonym: hamy2) 4.0 

446140 AA356170 Hs.26750 hypothetical protein FU21 908 4.0 

452240 AI591147 Hs.61232 ESTs 4.0 

459574 AI741122 Hs.101810 Homo sapiens cDNA FU14232 fis, clone NT 4.0 

458673 N99626 gb:za39d11.r1 Soares fetal liver spleen 4.0 

444858 AI199738 Hs.208275 ESTs, Weakly similar to ALUAJHU MAN 1 1 II 4.0 

452166 AI948607 Hs.264680 ESTs 4.0 

452681 AF153330 Hs.30246 solute carrierfamily 19 (thiamine trans 3.9 

450192 AA263143 Hs.24596 RAD51-mierac«ng protein 3.9 

406554 NA Target Exon 3.9 

416259 AA573006 Hs.19173 ESTs 3.9 

445813 Z42023 Hs.106576 alanine-glyoxylate aminotransferase 2-il 3.9 

451024 AA442176 gb:zw63b08.r1 Soares_tota(Jetus_Nb2HF8_ 3.9 

413930 M86153 Hs.75618 RAB1 1 A, member RAS oncogene family 3.9 

401781 Target Exon 3.9 

415298 F05086 Hs,328142 ESTs 3.9 

452564 AA026777 gb:ze93c11.r1 Soares_fetaLhearLNbHH19W 3.9 

442500 A1819068 Hs.209122 ESTs 3.9 

419759 Z21336 Hs.135411 actin related protein 3.9 

424638 A1472106 Hs.49303 Homo sapiens cDNA FU1 1663 fis, done HE 3.9 

439699 AF086534 -Hs.187561 ESTs, Moderately similar to ALU INHUMAN A 3.9 

428042 AA419529 Hs.76391 myxovirus (influenza) resistance 1, homo 3.9 

452501 AB037791 Hs.29716 hypothetical protein FU10980 3.9 

453049 BE537217 Hs.30343 ESTs 3.9 

443213 BE568414 Hs.145497 Homo sapiens cDNA: FU22097 fis. done H 3.9 

443489 A1073512 Hs.133916 ESTs 3.9 

455092 BE152428 gb:CM0-HT0323-151299-126-b04HTO323Homo3.9 

401785 NMJX)2275*:Homo sapiens keratin 15 (KRT1 3.9 

426427 M86699 Hs. 1 69840 TTK protein kinase 3.9 

446009 AJ989885 Hs.231926 ESTs 3.9 

436033 H75391 Hs.255748 ESTs 3,9 

451067 BE172186 gb:MRO-HT0559-1 1 0300-00 Wi1 1 HT0559 Homo3.8 

419348 AA236645 Hs.98274 ESTs 3.8 

444635 AI184268 Hs.339665 ESTs 3.8 

412140 AA219691 Hs.73625 RAB6 Interacting, Wnesin-tike (rabkines 3.8 

403593 NA Target Exon 3.8 

442323 AW016669 Hs.29190 ESTs 3.8 

419854 AW664873 Hs.87836 Homo sapiens PAC done RP5-1087M19 from 3.8 

433871 W02410 Hs.205555 ESTs 3.8 

445253 A1217928 Hs.144762 ESTs 3.8 

409542 AA503020 Hs.36563 hypotheticaJ protein FU2241 8 3.8 
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443162 T49951 
458194 AW383618 
422475 AL359938 
440705 AA904244 
447290 AI476732 
403426 

427821 M470158 
454288 BE222648 
443801 AW206942 
410658 AW105231 
410672 AW794600 
428579 NMJJ05756 
445495 BE622641 
447995 AI742618 
401747 

420633 NWL014581 
423545 AP000692 
■ 433138 AB029496 
434715 BE005346 
428664 AK001666 
450951 AA018534 
402696 NA 
446868 AV660737 
458154 AW816379 
422026 U80736 
419440 AB020689 
421524 AA312082 
417283 N62840 
401508 NA 
410303 AA324597 
420362 U79734 
433384 AI021992 
434302 AA623065 
443938 R55373 
446420 BE623004 
458712 A1347502 
433404 T32982 
405232 

430491 AL109791 
455609 BE011668 
450164 AI239923 
453948 A1970797 
436061 A1248584 
401049 NA 
418867 D31771 
420179 N74530 
458663 AV658444 
437259 AI377755 
428309 M97815 
450522 AI69B839 
451952 AL120173 
412209 AW901456 
425201 AA352111 
443830 AI142095 
439255 BE164500 
414869 AA157291 
409064 AA062954 
407721 Y12735 
445135 AK000054 
404091 NA 
409731 AA125985 
405153 

423248 AA380177 
403639 NA 
404360 

422352 AA766296 
423338 AB007961 



Hs.9029 DKFZP434G032 protein 

Hs.265459 ESTs, Moderately similar (o ALU2_HUMAN A 

Hs.117313 Meis{mouse)homolog3 

Hs.153205 ESTs 

Hs.263912 ESTs 

Target Exon 
Hs.98202 ESTs 

Hs.279458 ESTs, Highly similar to c38QA1.1 b (H^ap 
Hs.253594 ESTs 
Hs.192035 ESTs 



3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 



gbJlC6^M0014-17030(W)22-C05 UM0014 Homo3.8 



Hs.184942 G protein-coupled receptor 64 
Hs.38489 ESTs, Weakly similar to 138022 hypotheti 
Hs.181 733 ESTs, Weakly similar to nitrflase homolo 

Homo sapiens keratin 17 (KRT1 7) 
Hs.274480 odorant-binding protein 2A 
Hs.1 29781 chromosome 21 open reading frame 5 
Hs.59729 semaphorinsem2 
Hs.1 16410 ESTs 

Hs.189095 similar to SALL1 (sal (Drosoph3a)-like 
Hs.103334 ESTs 



3.8 
3.8 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 



C3002523gll6686211lspJQ27533|YH2M_CAEa3.7 



3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 



Hs.135100 ESTs 
Hs.335018 ESTs 

Hs.1 10826 trinucleotide repeat containing 9 
Hs.90419 KIAA0882 protein 
Hs.105445 GDNF family receptor alpha 1 
Hs.48648 ESTs 

NMJ)24817:Homo sapiens hypothetical prot 
Hs.21851 Homo sapiens cONA RJ12900 fis, clone NT 
Hs.97206 huntkigtin interacting protein 1 
Hs.124244 ESTs 
Hs.1 10301 ESTs 

Hs.20864 ESTs 3.7 
gbf 01441262F1 UMJAGCJ2 Homo sapiens c 3.7 

Hs.107872 hypothetical protein FU20761 3,7 

Hs.102720 ESTs 3.7 

NH.01 5832:Homo sapiens methyl-CpG bindln 3.7 

Hs.241559 Homo sapiens mRNA full length insert cDN 3.7 
gb:CM3-BN0223-100500-177-a04 BN0223 Homo3.7 

Hs.30098 ESTs 3.7 

Hs.64859 ESTs 3.7 

Hs.190745 Homo sapiens cONA: FLJ21326 fis, clone C 3.7 

Target Exon 3.6 

Hs.89404 msh (Drosophila) homeo box homolog 2 3.6 

Hs.21168 ESTs 3.6 

Hs280776 tankyrase, TRFI-lnterading ankyrin-rela 3.6 

Hs.120695 ESTs 3.6 

Hs.183650 cellular retinoic add-binding protein 2 3.6 
gbwd31ID2.x1 SoaresJiFkX_G8C_S1 Homos3.6 

Hs.301663 ESTs 3.6 
gb:RCO-NN1012-270300-031-c07 NN1012 Homo3.6 

gbf ST60061 Activated T-cells XX Homo sa 3.6 

Hs.143273 ESTs 3.6 
gb^C4-HT0469-23030(W)14-«10 HT0469 Homo 3.6 



Hs.21479 ubinudehl 
Hs.141883 ESTs 

Hs.38018 dual-spedficirytyrosine-(YH)hosphoryl 
Hs.12347 hypothetical protein FU20047 
Target Exon 

Hs.56145 thymosin, beta, Identified in neuroblast 
Target Exon 

Hs.1 25845 ribulose-S-phosphate-3-epimerase 

ENSP00000233023*:CDNA FLJ12662 fis, clon 
C7001385:gi|12082809|gb|AAG48618.1lAF315 3.6 

Hs.99200 ESTs 3.6 

Hs.127338 KIAA0492 protein 3.6 
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424202 BE350295 Hs.15032 RAN binding protein 17 3.6 

431750 AA514986 Hs.283705 ESTs 3.6 

439907 AA853978 Hs.124577 ESTs 3.6 

453596 AA441838 Hs.62905 hypothetical protein FU 14834 3.6 

406446 NA Target Exon 3.6 

418454 AA315308 Hs.195870 hypothetical protein FU14991 3.6 

434360 AW015415 Hs.127780 ESTs 3.6 

409079 W87707 Hs.Q2QB5 interieukin 6 signal transducer (gp130, 3,6 

440132 AI697121 Hs.202466 ESTs, Weakly similar to S65824 reverse t 3.6 

448706 AW291095 Hs.21814 interteukin 20 receptor, alpha 3.6 

440671 AW297920 Hs.130054 ESTs 3.5 

407647 AW860158 gb^CO-CT0379-29010(W32-b04CT0379 Homo 3.5 

459023 AW968226 Hs.60798 ESTs 3,5 

402820 NM_017646*:Homo sapiens tRNA isopentenyl 3.5 

417009 AA191719 Hs.314714 ESTs 3.5 

447048 AW393080 Hs^28320 hypothetical protein FU2353 7 3.5 

449978 AI806335 Hs.200829 ESTs, Weakly similar to T30171 mm - 3.5 

428062 AA420683 Hs.98321 hypothetical protein FU14103 3.5 

452909 NM_015368 Hs.30985 pannexln 1 3.5 

400610 NA Target Exon 3.5 

417843 W07361 Hs.22545 Homo sapiens cDNA FU 12935 lis, clone NT 3.5 

419335 AW960146 Hs.284137 hypothetical protein RJ 12888 3.5 

451592 AI805416 Hs.213897 ESTs 3.5 

443270 NMJHM272 Hs.337737 Homer, neuronal immediate early gene, 1B 3.5 

423S48 AW392342 Hs.283077 centrosomal P4.1-associated protein; unc 3.5 

449424 AW448937 Hs.197030 ESTs 3.5 

423841 AW753967 gb:RC2^T0304-08010(W11-h12CT0304 Homo 3.5 

416806 NM_000288 Hs.79993 peroxisomal biogenesis factor 7 3.5 

422060 R20893 Hs.325823 ESTs, Moderately similar to ALU5_HUMAN A 3.5 

433104 AL043002 Hs.128246 ESTs, Moderately similar to unnamed prot 3.5 

415778 H84847 Hs.49391 hypothetical protein LOC541 49 3.5 

413054 AW316843 Hs.66309 hypothetical protein MGC 11 061 3.5 

416636 N32536 Hs.42645 soluta carrier family 16 (monocarboxylic 3.5 

424639 AI917494 Hs.9812 Homo sapiens cDNA FU 14388 fis, clone HE 3.5 

424827 AI057094 Hs.96867 Homo sapiens cDNA: FU23155 fis, clone L 3.5 

437782 AI370876 Hs.79090 exportin 1 (CRM1, yeast, homolog) 3.5 

41 1514 AW850178 gb:(L3-CT021 9-271 099422-H 12 CT0219 Homo 3.5 

413783 AA314337 Hs.301547 ribosomal protein S7 3.5 

421106 AA877124 Hs.172844 ESTs 3.5 

431291 N25521 Hs.25275 Kruppel-type zinc finger protein 3.5 

440623 AI935016 Hs.216639 ESTs 3.5 

455838 BE145808 gb:MRO-HT0208-101299-103-f1 1 HT0208 Homo 3.5 

458771 AW295151 Hs.163612 ESTs 3.5 

442942 AW167087 Hs.131562 ESTs 3.5 

436550 Z50158 Hs.270235 ESTs, Weakly similar to MMHUB1 laminln b 3.5 

418849 AW474547 Hs.53565 Homo sapiens PIG-M mRNA for mannosyftran 3.5 

424420 BE614743 Hs.146688 prostaglandin E synthase - 3.5 

430916 AW505021 Hs.88414 BTB and CNC homology 1 , basic leucine zi 3.5 

432030 AI90840O Hs.143789 ESTs 3.5 

439405 AF086224 Hs.55238 ESTs 3.5 

405917 NA C17000675:gij7290703lgb)AAF46150.1|{AE0 3.5 

452727 AW993582 Hs.176220 ESTs 3.5 

426320 W47595 Hs.169300 transforming growth factor, beta 2 3.4 

421070 AA283185 Hs.19327 ESTs 3.4 

424625 AW904466 Hs.321197 PDZ domain protein (Drosophila inaWike 3.4 

428508 BE252383 Hs.184668 SBB131 protein 3.4 

455651 BE064962 gb:RC1-BT031 3-130400416*02 BT0313 Homo 3.4 

410555 U92649 Hs.64311 a disintegrin and metalloproteinase doma 3.4 

447754 AW073310 Hs.163533 Homo sapiens cDNA FU14142 fis, done MA 3.4 

418636 AW749855 gb:QV4-BTQ534-28 1299-053*05 BT0534 Homo 3.4 

404097 NA C5000242*:gll9369379|gblAAF87128.1|AC006 3.4 

434205 AF119861 Hs.283032 hypothetical protein PRO201 5 3.4 

421072 AI215069 Hs.89113 ESTs 3.4 

402421 NA C1001578*:gii6759903JgbJAAF28099.1|(AF1 3.4 

405248 NA Target Exon 3.4 

407638 AJ404672 Hs.334483 hypothetical protein FU23571 3.4 

403000 BE247275 Hs.151787 U5 snRNP-specific protein, 116 fcO 3.4 
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433393 

432239 

458747 

442082 

417974 

446002 

448995 

436007 

424698 

435202 

410467 

405460 

441826 

453472 

447078 

441690 

420092 

418478 

408908 

414737 

449650 

418912 

436405 

453911 

409361 

429548 

420807 

409695 

445189 

402892 

426681 

458722 

409430 

443194 

445432 

410908 

406151 

436461 

411171 

432415 

439310 

401575 

420900 

445628 

448243 

445102 

442118 

416783 

435039 

451474 

442559 

453921 

420036 

435627 

411598 

446733 

410153 

403637 

405547 

427878 

451871 

410313 

416856 

449490 

450506 

440684 

459055 



AF038564 
X81334 
BE618395 
R41823 
AA210765 
AI346468 
A1613276 
AI247716 
AA164366 
AI971313 
AF102546 
NA 

AW503603 
AL037925 
AW885727 
R81733 
AA814043 
U38945 
BE296227 
AI160386 
AF055575 
NMJXJ0685 
AA160079 
AW503857 
NMJJ05982 
AW138872 
AA280627 
AA296961 
A1936450 
NA 

AA994896 
AA741545 
R21945 



AV653771 
AA121686 
NA 

AW511956 

AW820260 

T16971 

AF086120 

NA 

AU045633 
AI344166 
AW369771 
AW204610 
AA976718 
AA206186 
AW043921 
T70874 
T10213 
AI824009 



Hs,98074 itchy (mouse home-log) E3 ubiquitin prate 3.4 

Hs.2936 matrix metafloproteinase 13 (collagenase 3.4 

Hs.257391 hypothetical protein DKFZp761J1523 3.4 

HsJ413 ESTs; caisyntenin-2 3.4 
gb:zr90c06.r1 NCLCGAPJ3CB1 Homo sapiens 3.4 

Hs.145789 ESTs 3.4 

Hs.5662 guanine nucleotide binding protein (G pr 3.4 

Hs.232168 ESTs 3.4 

Hs.151973 hypothetical protein FU23511 3.4 

Hs.170204 KiAA0551 protein 3.4 

Hs.63931 dachshund (Drosophila) homolog 3.3 

Target Exon 3.3 

Hs.129915 phosphotriesterase related 3.3 

gb:DKFZp564M037_f1 564 (synonym: hfbr2) 3.3 

Hs.301570 ESTs 3.3 

Hs.33106 ESTs 3.3 

Hs.88045 ESTs 3.3 

Hs.1 174 cyclin-dependent kinase inhibitor 2A (me 3,3 

Hs.250822 serine/threonine kinase 15 3.3 

Hs.125087 ESTs 3.3 

Hs.23838 calcium channel, voltage-dependent, Lty 3.3 

Hs.89472 angiotensin receptor 1 3.3 

Hs.172932 Homo sapiens mRNA for partial 3'UTR, seq 3.3 

Hs.4007 Sarcolemmal-associated protein 3.3 

Hs.54416 sine oculis homeobox (Drosophila) homolo 3.3 

Hs.135288 ESTs 3.3 

Hs.57846 ESTs 3.3 

gb:EST112514 Adrenal gland tumor Homo sa 3.3 

Hs.147482 ESTs 3.3 

Target Exon 3.3 

Hs.22514 ESTs 3.3 

Hs.282832 ESTs, Weakly similar to T24961 hypotheU 3.3 

Hs.1 66975 splidng factor, arginine/serine-rich 5 3.3 

Hs.279009 matrix Gla protein 3.3 
gb:AV653771 GLC Homo sapiens cDNA clone 3.3 

Hs.10592 ESTs 3.3 

Target Exon 3.3 

Hs.293261 ESTs 3.3 
gb:QV2-ST0296-150200-040-c10 ST0296 Homo 3.3 

Hs.289014 ESTs, Weakly similar to A43932 mucin 2 p 3.3 

Hs.102793 ESTs 3.3 

Target Exon 3.3 

Hs.44269 ESTs 3.3 

Hs.155743 ESTs 3.3 

Hs.52620 integrin, beta 8 3.3 

Hs.22270 ESTs 3.3 

Hs.202242 ESTs 3.3 

Hs.79889 monocyte to macrophage differentiation* 3.3 

Hs.130526 ESTs 3.3 

Hs.207636 ESTs 3.2 

Hs.159993 gycosyltransferase 3.2 

Hs.44577 ESTs 3.2 



R60336 Hs.52792 Homo sapiens mRNA; cONA DKFZp586l1823 (f 32 

W88774 Hs.118370 ESTs 3.2 

BE336654 Hs.70937 H3 histone family, member A 3.2 

AA863360 Hs.26040 ESTs, Weakly similar to fatty acid omega 3.2 

BE311926 Hs.15830 hypothetical protein FLJ12691 3.2 

NA C3001106*:gi|10047201|dbjlBAB13394.1| (A 3.2 

NMJ)18B33«:Homo sapiens transporter 2, A 3.2 

C05766 Hs.181022 CGI-07 protein 3.2 

AI821005 Hs,1 18599 ESTs 3.2 

R10305 Hs.185683 ESTs 3.2 

N27833 Hs.269028 ESTs, Weakly similar to 138022 hypotheti 3.2 

AI652777 Hs.197069 ESTs 3.2 

NM 004460 Hs.418 fibroblast activation protein, alpha 3.2 

AI253123 Hs.127356 ESTs. Highly similar to S21 424 nesting 3.2 

N23235 Hs.30567 ESTs, Weakly similar to B34087 hypotheti 3.2 
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457421 
453403 
454141 
65 426650 
450865 
407993 



452190 
430965 
405394 
424693 
454265 
437687 
428372 
414083 
411670 
416283 
437488 
428398 
452042 
421477 
438078 
448816 
419519 
404580 
447046 
457473 
429838 
459702 
400195 
417860 
417995 
422589 
435870 
440801 
426274 
423728 
439677 
452834 
431349 
417576 
430264 
418827 
410835 
426269 
405336 
437783 
440931 
455945 
430437 
405848 
455685 
406970 
409602 
423518 
425653 
426326 
433805 
437152 



452844 
407366 
408254 
424085 
416790 
420020 
426119 



H26735 
AA489732 

BE169810 

H03556 

M765917 

AK000684 

AL121282 

AW856552 

NMJJ05429 

AA758239 

AI249368 

H38857 

AI904743 

AI016377 

AB033052 

AI198719 

AA326187 

AW974903 

AW904907 

AI204995 

NA 

AW408557 

AW974175 

AA312735 

AA701327 

AA906366 

D38122 

AW891294 

R82331 

AJ638627 

AA503653 

AA339449 

AA470519 

BE327311 

AW806906 

H15302 

NA 

AI683150 
AI583052 
BE160636 
AI768801 
NA 

BE066976 

M29994 

W26713 

045027 

A/065104 

BE165753 

AA706910 

AL050027 

AI541305 

AW407181 

AF026942 

AW807227 

NWL002914 

R83066 

BE295866 

W94997 

U07616 

AL1 17431 

BE466639 

AW138413 

AA382814 

AI248013 

AW135274 



Hs.91668 
Hs.154918 

Hs.47557 
Hs.300949 
Hs.122840 
Hs.183887 
Hs.257786 

Hs.79141 
Hs.180330 
Hs.98558 
Hs.243901 
Hs.104650 
Hs.131693 
Hs.22151 
Hs.176376 

Hs.17170 
Hs.291231 
Hs.30732 



Homo sapiens clone PP1498 unknown mRNA 3.2 

ESTs 3.2 

Target Exon 3.2 

ESTs 3.2 

ESTs, Weakly similar to thyroid hormone 3.2 

ESTs 3.2 

hypothetical protein RJ22104 3.2 

ESTs 3,2 
gb:RC1-CT029448010lH)12-a04 CT0294 Homo 3.2 



Hs.235498 

Hs.188751 

Hs.30512 

Hs.17949 

Hs.190535 

Hs.2007 

Hs.132136 

Hs.164599 

Hs.105685 

Hs.156942 

Hs.82285 

Hs.47166 

Hs.168950 

Hs.201550 
Hs.270058 

Hs.169943 



Hs.256972 
Hs.129732 
HsJ>49718 
Hs.250528 
HS.112742 

Hs.48778 
Hs.218377 



Hs.139226 

Hs.7043 

Hs.94382 

Hs.189917 

Hs.173034 

Hs.112165 

Hs.61779 

Hs.139336 

Hs.106532 
Hs.12433 



vascular endothelial growth factor C 3.2 

ESTs 3.2 

ESTs 3.2 

Homo sapiens cDNA FU20738 fis, clone HE 3.2 

hypothetical protein RJ10292 3.2 

ESTs 3.2 

K1AA1226 protein 3.2 

ESTs 3.2 

N014112*itorra)sapienstricnorhinophala 32 

G protein-coupled receptor 4 3.2 

ESTs 3.1 

hypothetical protein FU13409. KJAA1711 3.1 

g b:an03c03 jc 1 Stratagene schizo brain S1 3. 1 

NNL007057*:Homo sapiens ZW10 interactor 3.1 

hypothetical protein FU14075 3.1 
ESTs, Weakly similar to MAPB_HUMAN MICRO 3.1 

Homo sapiens mRNA for K1AA0556 protein, 3.1 

ESTs 3.1 

ESTs 3.1 

tumor necrosis factor (figand) superfam! 3.1 

solute carrier family 4, sodium bicarbon 3.1 

ESTs 3.1 

KIAA1688 protein 3.1 

ESTs, Moderately similar to ALU^UMAN A 3.1 

phosphoribosylglyclnamide formyltransfer 3.1 

gb:nc71f10.s1 NCI_CGAP_Pr1 Homo sapiens 3.1 

HT021 3.1 
gb:QV4-ST0023-160400-172-d12 ST0023 Homo 3.1 
Homo sapiens mRNA; cDNA DKF2p566A1046 (f 3.1 

Target Exon 3.1 

ESTs, Weakly similar to ALU1J1UMAN ALU S 3.1 

ESTs 3.1 
gb:PM1-HT0422-291299-O02-c08 HT0422 Homo3.1 

Homo sapiens cDNA RJ 13569 fis, clone PL 3.1 

Target Exon 3.1 
gb:PM0-BTO340-211299403-c12 BTO340 Homo 3.1 

gb:Human alpha-l spectrin gene, exon 12. 3.1 

ESTs . - 3.1 

R3H domain (binds single-stranded nuclei 3.1 

ESTs, Weakly similar to A46010 X-finked 3.1 

Homo sapiens, clone IMAGE:4098694, mRNA, 3.1 

ESTs 3.1 
gfcHomo sapiens mRNA; cDNA DKFZp566C03243.1 

niban protein 3.1 

Homo sapiens cDNA FU11927 fis. clone HE 3.1 

gb:Homo sapiens cig33 mRNA, partial sequ 3.1 
gb:MR4-ST(X)62-18020Q>O0t-e1O ST0062 Homo 3.1 

replication factor C (activator 1)2 (40 3.1 

succinate-CoA figase, GOP-formlng, alpha 3.1 

adenosine kinase 3.1 

ESTs 3.1 

amphlphysin (Stiff-Mann syndrome with br 3.1 

Homo sapiens cONA RJ12198 fis, clone MA 3.1 

Homo sapiens cONA FU13591 fis, done PL 3.1 

ATP-bindtng cassette, sub-family C (CFTR 3.1 

gb:EST96097 Testis I Homo sapiens cONA 5 3.1 

ESTs, Weakly similar to 138588 reverse t 3.1 

ESTs 3.1 
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446466 
457888 
420058 
409248 
452747 
418926 
419346 
429826 
435147 
420139 
405609 
404274 
449777 
415459 
415245 
406291 
414210 
432055 
442246 
451353 
451177 
418026 
401326 
409920 
432887 
411789 
401045 
432162 
434627 
442611 
425477 
457183 
433014 
415542 
416173 
408155 
453913 
435495 
423629 
411836 
415030 
419606 
440310 
443608 
414140 
444781 
435393 
454071 
446922 
448062 
427711 
450382 
424866 
433043 
420802 
445625 
403677 
411093 
435255 
443127 
448104 
427315 
430414 
423600 
458562 
402109 
429629 



H38026 
BE219794 
AK001423 
AB033035 
BE153855 
AA232658 
AI830417 
N93266 
AL133731 
NM 005357 
NA 

AI971362 
H07118 
N59650 
NA 

BE383592 
AW972359 
A1791988 
N21043 



Hs.308 

Hs.293471 

Hs.94694 

Hs.51965 

Hs.61460 

Hs.105794 

Hs.44143 

Hs.40747 

Hs.4774 

Hs.95351 



Hs.231945 

Hs.6099 

Hs.27252 



arrestin 3, retina) (X-anrestin) 
ESTs 

Homo sapiens cDNA FLJ10561 fis, clone NT 
K1AA1209 protein 
tg superfamily receptor LNIR 



BE379727 
NA 

BE169746 
A1926047 
AF245505 

AA584062 
AI221894 
8E077155 
AW958879 
H91882 
NWL014711 
R13474 
R52782 
AB014528 
AW0046B3 
AI754212 
AW021173 
AW901879 
031118 
AW294795 
AA878939 
A1375957 
AA281279 
NWL014400 
AA701259 
AI041793 
BE175605 
AW295923 
M31659 
AA397658 
W01938 
W57554 
U22376 
BE246743 
NA 

BE067650 
W87434 
BE568102 
AI674818 
AA179949 
AW365665 
AI633559 
N34128 
NA 

BE501732 



polybromo 1 
ESTs 



3.1 
3.1 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 



CO la " 

Homo sapiens mRNA; cDNA DKFZp761C17l2 (f 3.0 



3.0 

3.0 
3.0 
3.0 
3.0 
3.0 
3.0 



Hs.293334 
Hs.129115 
Hs.42932 
Hs.13034 
Hs.83213 

Hs.12504 
Hs.162859 
Hs.72157 

Hs.272798 
Hs.39311 
Hs.177537 
Hs.270535 
Hs,1 18569 
Hs.279912 
Hs.290263 

Hs.43133 
Hs.78934 
Hs.21951 
Hs.18612 
Hs.314453 
Hs.191735 
Hs.198529 
Hs.125406 
Hs.289074 
Hs.23317 
Hs.11950 
Hs.189299 
Hs.42502 



lipase, hormone-sensitive 
ENSP000Q0241065 t :CDNA 
NM 002944**>tomo sapiens v-ros avian UR2 
ESTs 
ESTs 
ESTs 

Target Exon 

gb:601297871F1 NIHJAGCJ9 Homo sapiens c 3.0 
ESTs 3.0 
ESTs 3.0 
ESTs 3.0 
ESTs 3.0 
fatty acid binding protein 4, adipocyte 3.0 
C10000447*:gi|1168375|spp43467|AGALPEO 3.0 
likely ortholog of mouse Arkadia J n 

ESTs 

DKFZP564I1922 protein 
C11001883*:gi|6753278|refiNPJ33938.1jc 
hypothetical protein RJ20413 
ESTs 

hypothetical protein DKFZp761B1514 
ESTs 

Dvt-binding protein IDAX (inhibition of 
KIAA04i9 gene product 
ESTs, Weakly similar to 138022 hypotheti 
gb:yg99d09.r1 Soares infant brain 1NIB H 
K1AA0628 gene product 
mutS (E. coB) homoiog 2 (colon cancer, 
Homo sapiens Xq pseudoautosomai region; 
Homo sapiens cONA: FLJ21909 fis, done H 
ESTs 

hypothetical protein MGC10520 
ESTs 
ESTs 

F-box only protein 22 
hypothetical protein FU14681 
GPI-anchored metastasis-associated prate 
ESTs 
ESTs 



3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 



3.0 
3.0 
3.0 
2.9 
2.9 
2.9 
2,9 
2.9 



gb:RC5-HT0580-100500-022-H07 HT0580 Homo3.0 
Hs.255472 KIAA1843 protein 
Hs. 1 80408 solute carrier family 25 (mitochondrial 
Hs.60257 Homo sapiens cDNA FU13598 fis. done PL 
Hs.337243 ESTs, Weakly similar to ALU7 JHUMAN ALU S 
Hs.125019 lymphoid nudear protein (LAF-4) mRNA 
Hs.1 334 v-myb avian myeloblastosis viral oncogen 
Hs.288529 hypothetical protein FU22635 

C4001462:gi|4887715lgb[AAA79329.2| (L088 
gb-MR4-BT0358-090300-003-e01 BT0358 Homo 19 
ESTs, Moderately similar to ALU1.HUMAN A 2.9 
mitochondrial ribosomal protein S16 2.9 
Homo sapiens cDNA FLJ11375 fis, done HE 2.9 
Homo sapiens mRNA; cDNA DKFZp564N0763 (f 2.9 
ESTs 2.9 
ESTs M 

ESTs 11 
Target Exon • 
Homo sapiens cDNA FLJ13010 fis, done NT 2.9 



Hs.106015 
Hs.180312 
Hs.316433 
Hs.175563 
Hs.120388 
Hs.310359 
Hs.145268 

Hs.30622 
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442295 
419752 
404721 
445107 
401987 
430566 
444517 
445563 
427691 
456561 
401458 
421039 
459504 
424962 
409617 
416931 
413221 
409732 
433687 
434340 
454529 
421379 
452291 
457402 
449051 
408761 
401093 
435061 
447985 
449340 
426384 
411905 
405953 
420854 
434265 
428365 
442861 
448337 
452554 
412248 
449450 
434757 
409038 
454545 
439842 
428479 
424800 
411086 
400250 
449168 
456482 
426044 
431854 
405873 
440400 
458265 
413708 
423739 
424408 
453096 
421825 
417742 
402765 
444378 
419172 
401497 
402376 



AI827248 
AA249573 



AJ208121 Hs.147313 



AA481282 
AI939339 
AW873606 
AW194426 



NMJJ03478 

BE514127 

NNI012288 

BE003760 

045371 

BE161151 

NIA.016122 

AA743991 

AI193043 

Z45439 

Y15221 

AF015592 

AW452648 

AW961400 

AA057264 

AI651474 

AI681475 

AW235786 

AI472078 

BE265067 

NA 

AW296927 

AA846811 

AA295331 

AA243837 

AW206453 

AW452434 

BE176480 

AL039852 

A1038997 

T97490 

AW806899 

AI910896 

Y00272 

AL035588 

BE070800 

NA 

NNL016206 

AA485224 

AA502490 

AA383550 

NA 

AA394364 

AI075375 

BE158791 

AA398155 

AI754813 

AW294631 

AA298758 

R64719 

R41339 
AW338625 



Hs.224398 Homo sapiens cDNA FU11469 fis, done HE 2.9 
Hs.152618 ESTs. Moderately similar to ZN91_HUMAN Z 2.9 
NMJ)Q5596*:Homo sapiens nudear factor I 2.9 
ESTs. WeakJy similar to 138022 hypotheti 2,9 
NMJ)02737*:Homo sapiens protein kinase C 2.9 
Hs.190149 ESTs 2.9 
Hs.146883 ESTs 2.9 
Hs.149006 ESTs 2.9 
Hs.20726 ESTs 2.9 
Hs.246358 ESTs. Weakly similar to T32250 hypotheti 2.9 
Target Exon 2.9 
Hs,101299 cullin 5 2.9 
gb:601315974F1 NIH MGC 8 Homo sapiens cO 2.9 
Hs.153954 TRAM-fike protein 2.9 
Hs.55209 Homo sapiens mRNA; cONA OKFZp434K0514 (f 2.9 
Hs.80485 adipose most abundant gene transcript 1 2.9 
gb:PM0-HT0425-14l299-001-FO8 HT0425 HomoZ9 
Hs.56148 NY-REN-58 antigen 2.9 
gb:ny57g01.s1NCI_CGAP_Pr18Homosapiens 2.9 
Hs.128685 ESTs, Weakly similar to T1 7226 hypotheti 2.9 
Hs.270425 ESTs 2.9 
Hs.103982 small indudble cytokine subfamily B (Cy 2.9 
Hs.28853 CDC7 (cell division cyde 7, S. cerevisl 2.9 
Hs.149342 activation-induced cytidine deaminase 2.9 
Hs.333526 HER2 receptor tyrosine kinase (c-erb-b2, 2.9 
Hs.238936 ESTs, Weakly similar to (defEne not ava 2.9 
C12000586*:gi|6330167ldbj]BAA86477.1| (A 2.9 
Hs.163944 ESTs 2.9 
Hs.200949 ESTs 2.9 
Hs.195359 hypothetical protein MGC10954 2.9 
Hs.303662 ESTs 2.9 
gb:601 193893F1 N!H JAGC_7 Homo sapiens cD 2.9 
Target Exon 2.8 
gb:UI-H-BW0-ajc-c47-0-Ul.s1 NCLCGAP_Su 2.8 
Hs. 130554 Homo sapiens cDNA: FU23089 fis, done L 2.8 
Hs.183861 Homo sapiens cDNA FLJ20042 fis, done CO 2.8 
Hs.57787 ESTs 2.8 
Hs.3782 ESTs 2.8 
Hs.58006 ESTs. Weakly similar to ALU5.HUMAN ALUS 2.8 
gb:RC3-HT0585-160300-022-cO2 HT0585 Homo 2.8 
Hs.49136 ESTs, Moderately similar to ALU7JWMAN A 2.8 
Hs.132921 ESTs 2.8 
Hs.50002 small Inducible cytokine subfamily A (Cy 2.8 
gb:QV4-ST0023-160400-172-c12 ST0023 Homo 2.8 
Hs.132413 ESTs 2.8 
Hs. 184572 celi division cycle 2, G1 to S and G2 to 2.8 
Hs.153203 MyoO family inhibitor 2.8 
gb:RCJ-BT0502-251199-Q11-c07 BT0502 Homo 2.8 
Eos Control 2.8 
colon carcinoma related protein 2.8 
gbaa41b12.s1 NCLCGAPJ3CB1 Homo sapiens2.8 
ESTs 2.8 
polymerase (DNA directed) iota 2.8 
Target Exon 2.8 
ESTs. Weakly similar to T25472 hypotheti 2.8 
ESTs, Weakly similar to IRX2JHUMAN IROQU 18 
gb:IU-HT0397-091299-025J)Q2HT0397Homo 2,8 
ESTs 2.8 
collagen, type V, alpha 1 2.8 
ESTs 2.8 
ESTs. Moderately similar to CALB_HUMAN C 2.8 
gb:EST22d1 1 WATM1 Homo sapiens cDNA don 2.8 
C1003621*:gi|12407405|gblAAG53491.1JAF22 2.8 
ESTs 2.8 
ESTs 2.8 
Target Exon 2.8 
C19000763 4 :gi|1363912|pir{jJC4296 ring f 2.8 
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Hs.23142 

Hs.336695 
Hs.271699 

Hs.125594 
Hs.128193 

Hs.97600 
Hs.146428 
Hs.11325 
Hs.183747 



Hs.12569 
Hs.22120 
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405041 NA 
408758 NM.003686 
431917 D16181 
437583 AA761190 
453737 AA744862 
458094 AF086325 
401283 NA 
410784 AW803201 
417601 NMJJ14735 
418236 AW994005 
435532 AW291488 
454714 AW815098 
418629 BE247550 
442101 AI651930 
405080 AK000375 
414661 T97401 
425589 A1650633 
429638 AI916662 
428824 W23624 
421566 NM.000399 
414596 8E386870 
440B68 R79707 
452943 BE247449 
443772 AV646449 
432361 AI378562 
430375 AW371048 
405504 NA 
423279 AW959861 
424871 NNL004525 
453619 H87648 
423961 D13666 
422156 N34524 
415752 BE314524 
419987 NM_005014 
406182 NA 
416495 X69970 
444701 A1916512 
408171 AA301228 
430153 AW968128 
413383 AA128978 
414831 M31158 
413278 BE563085 
433132 AB026264 
437030 AA742577 
439031 AF075079 
449532 W74653 
406153 

406625 Y13647 
444698 A1188139 
432328 AI572739 
429628 H09604 
420149 AA255920 
431207 AA495925 
438394 8E379623 
443304 AI050073 
427660 AI741320 
408460 AA054726 
416515 N91716 
429922 Z97630 
418203 X54942 
439509 AF086332 
402184 NA 
450496 AW449251 
451963 AI825440 
457938 AI373638 
441541 AA938663 
441111 AI806867 



C3001706*:gil1345652|sp|P15989|CA36^CHlC 2.8 



2.8 
2.8 
2.8 
18 
2.8 
2.8 



Hs.47504 exonuctease 1 
Hs.2868 peripheral myelin protein 2 
Hs.244627 ESTs 

Hs.194293 ESTs, Weakly similar to I54374 gene NF2 
gbJHomo sapiens full length insert cDNA 
Target Exon 

gb:IL2-UMW77-07050Q-080-E06 UMQ077 Homo 2.8 
KIAA0215 gene product 2.8 
ESTs 2.8 
Homo sapiens, clone !MAGE:3682908, mRNA 2.8 
gb:QV4-ST0212-091199-023-f10 ST0212 Homo 2.8 
growth factor receptor-bound protein 7 2 3 

ESTs 

HDCMC28P protein 
ESTs 

Homo sapiens cDNA: FU23031 fls, clone L 
kinectin 1 (kinesrn receptor) 
ESTs 

early growth response 2 (Krox-20 (Drosop 
gb:601275271F1 NIHJAGCJ0 Homo sapiens c 2.7 



Hs.82292 
Hs.337534 
Hs.117305 

Hs.86859 
Hs.135684 
Hs.88820 
Hs.21929 
Hs.143688 
Hs.211577 
Hs.173059 
Hs.1395 

Hs.263339 
Hs.31082 
Hs.282872 
Hs.159585 
Hs.93758 

Hs.290943 
Hs.153595 
Hs.33922 
Hs.136348 

Hs.78776 
Hs.94070 

Hs.79350 
Hs.198394 
Hs.43299 
Hs.336679 
Hs.154706 
Hs.77439 
Hs.B33 
Hs.284245 
Hs.303781 

Hs.271593 

Hs.1 19597 

Hs.147050 

Hs.195471 

Hs.13268 

Hs.88095 

Hs.9394 

Hs.27693 

Hs.135338 

Hs.1 14121 

Hs.285574 

Hs.194140 

Hs.226117 

Hs.83758 

Hs.58314 

Hs.257131 
Hs.224952 
Hs.133900 
Hs.199828 
Hs.126594 



2.8 
2.8 
2.8 
2.8 
2.7 
2.7 
2.7 



2.7 
Z7 
2.7 
2.7 
2.7 
27 
2.7 
2.7 



ESTs, Moderately similar to I38022 hypot 
hypothetical protein FU10525 
ESTs 
ESTs 

H4 histone family, member H 
C5000558:gil4504675lrefiNP_002175.1| int 
ESTs 

taw density fipoprotelrwelated protein 
Homo sapiens, done MGC:9084, mRNA, comp 2.7 
osteoblast specific factor 2 (fasctelin 2.7 
gb:yy56d10.s1 Soares_rrmlfiple_sderosis_ 
putative transmembrane proiein 
osteomodulin 
Target Exon 

RYK receptor-like tyrosine kinase 
ESTs • 

hypothetical protein FU12890 
ESTs 

hypothetical protein FU14917 
protein kinase, cAMP-dependent regulato 
interferon-stimulated protein, 15 kDa 
hypothetical protein IMPACT 
EST 

gb:Homo sapiens full length insert cDNA 
ESTs, Moderately similar to A47582 B-cel 
Target Exon 



ESTs 
6 

ESTs 
ESTs 
ESTs 

peptidylprolyl isomerase (cydophitin)-! 
ESTs 

Homo sapiens cDNA: FU23228 lis, clone C 
ESTs 

ESTs, Weakly similar to 138022 hypotheti 
H1 histone family, member 0 
CDC28 protein kinase 2 
ESTs 

ENSP00000245238*:CDNA FU10922 fis, don 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 
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2.7 
2.7 
2.7 
2.7 
2.7 
2.7 
2.7 
2.7 
27 
2.7 
2.7 
2.7 
2.7 
2.7 
27 
27 
2.7 
2.7 
2.7 
27 
27 
2.7 
27 
17 
27 
27 
27 
27 
2.7 
Z7 
27 
Z7 
27 
27 
27 
27 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



423020 
445354 
427961 
410889 
445234 
413903 
406089 
447410 
401256 
415139 
420218 
455511 
438825 
409564 
452837 
434876 
421565 
453279 
430785 
456986 
433068 
421952 
429208 
430733 
441720 
418986 
432481 
434338 
417061 
410530 
456672 
425071 



451531 
405822 
418301 
417315 
434699 
443204 
405638 
452542 
403943 
404535 
402800 
449144 
454934 
424717 
428303 
427970 
450638 
453034 
455097 
427317 
408875 
427510 
423201 
406271 
442696 
454018 
435420 
434398 
455708 
439347 
407523 
425101 
435153 
409139 



AA383092 

AV653485 

AW293165 

X91662 

AW137636 

AA496493 

NA 

AI470235 
NA 

AW975942 

AW958037 

BE144762 

BE327427 

AA045857 

AL121053 

AF160477 

AK001122 

AW893940 

Z30201 

D38299 

NM_O06456 

AA300900 

AA447990 

AW975920 

AI346487 

AI123555 

AW451645 

AW754311 

AI675944 

M25809 

AKD02016 

NM_013989 

AW292286 

AA018311 

AW976201 
AI080042 
AA643687 
AW205878 

AW812256 

Z25884 
NA 

AJ989503 

AW846080 

H03754 

AW974476 

AA418187 

AK001826 

BE246010 

AW855802 

AB028955 

NM 015434 

Z47542 

NM_000163 



Hs.1608 
Hs.6390 
Hs.143134 
Hs.66744 
Hs.146059 
Hs.23136 



replication protein A3 (1 4kD) 2.7 
Homo sapiens clone FLB3344 PRO0845 mRNA, 2.7 



AW016892 

A1928513 

AA121098 

BE069326 

W24320 

X64984 

AA830431 

AA668763 

AI6B1917 



ESTs 2.7 

twist (Orosophlla) homolog (acrocephatos 2.7 

ESTs 2.7 

ESTs 2.7 

Target Exon 2.7 

Hs.172698 EST 2.7 

NM_024089*:Homo sapiens hypothetical pro 2.7 

Hs.48524 ESTs 2.7 
Hs.286 ribosoma! protein L4 "2.7 
gb:CM0-HTO180-041099-065-bO4 HT0180 Homo2.7 

Hs79953 ESTs 2.6 

Hs.54943 fracture callus 1 (raQ homolog 2.6 

Hs.5534 Homo sapiens cONA FU12961 fis, done NT 2.6 

Hs.61460 Ig superfamily receptor LNIR 2.6 

Hs.105859 hypothetical protein FU10260 2.6 

ESTs 2.6 

gb:HHEA22G Atrium cDNA library Human hea 2.6 

Hs. 170917 prostaglandin E receptor 3 (subtype EP3) 2.6 

Hs.288215 sialyltransferase 2,6 

Hs.98849 ESTs, Moderately similar to AF16 1511 1 H 2.6 

Hs.1 90478 ESTs 2.6 

Hs.283361 ESTs 2.6 

Hs.28739 ESTs 2.6 

Hs.81796 ESTs 2.6 

Hs.151504 Homo sapiens cONA FU1 1973 fis, clone HE 2.6 
gb:CM1-CT0337-14129W)68-fl)7 CT0337 Homo 16 

Hs.188691 Homo sapiens cONA FU12033 fis, done HE 2.6 

Hs.64173 ATPase, H transporting, lysosomal (vacuo 2.6 
Hs.1 14727 Homo sapiens, done M6C:16327, mRNA. com 2.6 

Hs.1 54424 deiodinase, lodothyronine, type II 2.6 

Hs.255058 ESTs 2.6 

Hs.1 14762 ESTs 2.6 

Target Exon 2.6 

Hs.53913 hypothetical protein FU10252 2.6 

Hs.336901 ribosoma] protein S24 2.6 

Hs.149425 Homo sapiens cDNA FU1 1980 lis, clone HE 2.6 

Hs.29643 Homo sapiens cDNA FU13103 fis, clone NT 2,6 

Target Exon 2.6 
gb:RC0-STO174-191099431-a07 ST0174 Homo 2.6 

C5000355:gil4503225|refjNP^000765.1| cyt 2.6 

Hs.121483 chloride channel 1 , skeletal musde (Th 2.6 

Target Exon 2.6 

Hs.233405 ESTs 2.6 

Hs.314324 ESTs 16 

Hs.1 52213 win^less^peM^lntegratransitetaml 2.6 

Hs.1 83601 regulator of G-proteln signalfing 16 2.6 

Hs.330515 ESTs 2.6 

Hs.25245 hypothefical protein FU11269 16 

Hs.271458 HomosapiensmRr^lorFUOOOSSprbteln, 2.6 
gb:RC1-CTO279-170200-O23-dO8 CT0279 Homo 16 

Hs.175780 KIAA1032 protein 2.6 

Hs.48604 DKFZP434B168 protein 2.6 

Hs.179312 small nuclear RNA activating complex, po 2.6 

Hs.125180 growth hormone receptor 2.6 

Target Exon 2.6 

Hs7063 Homo sapiens cDNA: FU20913 fis, done A 2.6 

Hs.100855 ESTs 2.6 

Hs.59203 ESTs 2.6 

Hs.3838 serum-indudble kinase 2.6 
gb:QV3-BT0381-17010(W60^03 BT0381 Homo 16 

Hs.102941 Homo sapiens cDNA: FU21531 fis, clone C 2.6 

gb-Rsaptens mRNA HTPCRX10 for olfactory 2.6 

Hs.180811 ESTs 2.6 

Hs.291939 ESTs 16 

Hs.3321 ESTs, Highly similar to IRX1 JIUMAN IROQU 2.6 
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455100 BE160198 
414612 BE274552 
440283 AI732892 
423025 AA831267 
431473 AA825686 
404440 
403388 NA 
403775 NA 
405037 NA 
407447 AF29Q544 
420952 AA282Q67 
435447 AI872932 
440202 AW516211 
445854 AI702885 
421247 BE391727 
414870 N72264 
457411 AW085961 
424676 Y08565 
404443 

452268 N&L003512 
430832 AI073913" 
444779 AI192105 
408633 AW963372 
459089 F13036 
447197 R36075 
454111 AW081681 
411465 NM_000169 
406922 S70284 
456045 H62943 
413111 BE065837 
423123 NM.012247 
432201 AI538613 
427032 AF012023 
445417 AK001058 
422225 BE245652 
428330 L22524 
410011 AB020641 
426310 NML000909 
419589 AW973708 
437770 AA767881 
442818 AK001741 
414251 AL042306 
428301 AW628666 
431956 AK002032 
455732 BE080908 
458624 AI36279Q 
428257 BE394723 
418054 MI002318 
458652 AW375610 
458012 AI424899 
422996 BE091089 
410804 U64820 
457211 AW972565 
440029 AW089705 
448141 AI471598 
409163 AA065081 
431385 BE178536 
414569 AF109298 
439963 AW247529 
455935 BE158687 
425025 AW953168 
416589 AA652687 
404826 

422938 NMJM1809 
421991 NNL014918 
417404 NMJJ07350 
448516 AW898595 



gb:QV^0413-010200-059-h03 HT0413 Homo2.6 



Hs.76578 protein inhibitor of activated STAT3 
Hs.190489 ESTs 

Hs. 12244 hypothetical protein RJ20097 

Hs.321 176 ESTs, Weakly similar to S65824 reverse t 

NMJ)21048:Homo sapiens melanoma antigen. 

C3001398*:gi|12248917ldb]lBAB20375.1j(A 

Target Exon 

NM_021628*:Homo sapiens arachidonate lip 
gb:Homo sapiens aminopeptidase mRNA, par 
Hs.88972 ESTs, Moderately similar to A46010 X-Iln 



2.6 
2.6 
2.6 
2.6 
2.6 
2.6 
2.6 
2.6 
Z6 
16 



Hs.125300 
Hs.145568 
Hs.102910 
Hs.300670 
Hs.130093 
Hs.151678 

Hs.28777 

Hs.100686 

Hs.147170 

Hs.46677 

Hs.27373 

Hs.269064 
Hs.69089 

Hs.154188 

Hs.124027 

Hs.298241 

Hs.173274 

Hs.12680 

Hs.1 18281 

Hs.2256 

Hs.57856 



gbwm72e03jc1 NCLCGAP.W2 Homo sapiens 2.6 



2.6 
2.6 
2.6 
2.6 
2.6 
2.6 
2.6 
2.6 
2.6 
2.6 
2.6 



Hs.201925 
Hs.122897 
Hs.8739 
Hs.97689 
Hs.98440 
Hs.272245 

Hs.278639 
Hs.275243 
Hs.83354 
Hs.1 17102 
Hs.188211 

Hs.66521 
Hs.32399 
Hs.293711 
Hs.197531 

Hs.1 1090 
Hs.1 18258 
Hs.6793 

Hs.12407 
Hs.96151 

Hs.1594 

Hs.110488 

Hs.82101 



ring finger protein 21 , interferon-respo 
ESTs 

general transcription factor IIH, polype 
K1AA1204 protein 
ESTs 

UDP-N-acetyl-aipha-D-galactosamine:polyp 
C8001428*:gil6572242|emb|CAB62951.1| (Z9 
H2A hlstone family, member L 
ESTs. Weakly similar to JE0350 Anterior 
ESTs 

PRO2000 protein 

Homo sapiens mRNA; cONA DKFZp56401763 (f 2.6 
gb:yh88b01.s1 Soares placenta Nb2HP Homo 2.6 
ESTs, Weakly similar to T42689 hypotheti 2.6 
galactosidase, alpha 2.6 
gbstearoyl-CoA desaturase (human, adipo 2.6 

gkRC2-BT0318-110100-012-g12 BT0318 Homo 2.6 
SEIENOPHOSPHATE SYNTHETASE ; Human seten2.6 
Transmembrane protease, serine 3 2.5 
integrin cytoplasmic domain-associated p 2 5 
Homo sapiens cDNA FU10196 fis, done HE 
zinc finger protein 266 
matrix metaUoproteinase 7 {MMP7; uterin 
PFTAIRE protein kinase 1 
neuropeptide Y receptor Y1 
Homo sapiens cDNA FU13446 fis, clone PL 
ESTs 

hypothetical protein FU10879 
VASA protein 

ESTs. Weakly similar to I38022 hypotheti 
Homo sapiens cDNA FU11170 fis, clone PL 
gb:QV1-BT0631-280200-084-h07 BT0631 Homo 2.5 
KIAA1684 protein; likely homolog of mous 2.5 
S100 calcium-binding protein A6 (calcyci 2.5 
fysyioxidase-like2 2 ^ 
hypothetical protein FU13046 similar to 2.5 
ESTs 

gb:PM4-BT0724-130400-006-c07 BT0724 Homo 2.5 



2.5 
2.5 
2.5 
2.5 
2.5 
2.5 
2.5 
2.5 
2.5 
2.5 
2.5 



25 
2.5 
2.5 
2.5 
2.5 
2.5 
2.5 
2.5 



Machado-Joseph disease (spinocerebellar 
ESTs, Weakly similar to S51797 vasodilat 
ESTs. Weakly similar to S64329 probable 
ESTs 

gb:zm13a03.s1 Stratagene pancreas (93720 
membrane-spanning 4-domains, subfamily A 
prostate cancer associated protein 1 
platelet-activating factor acetylhydrola 
gb:CM0-HT0395-280100-169-bG9 HT0395 Homo2.5 
ESTs 2.5 
Human DNA sequence from clone RP5-1 103G7 2.5 
Target Exon 2.5 
centromere protein A (17kD) 2.5 
KIAA0990 protein 2 - 5 
pleckstrin homology-like domain, family 2.5 
gb:RC1-NN0073-26040(H>11-g09 NN0073 Homo2.5 
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403356 NA EN SP00000251 525*.HypothetIcal protein Kl 25 

404983 ENSP00000252242 # :Kerafin l typelJcytosk 25 

418282 AA215535 Hs.98133 ESTs 2.5 

427409 AW467143 Hs.135411 actin related protein 25 

5 431806 AF186114 Hs.270737 tumor necrosis factor (Dgand) superfami 25 

443367 AW071349 Hs.215937 ESTs 25 

421246 AW582962 Hs.102897 CGI47 protein 25 

439217 AF086041 Hs.42975 ESTs 25 

400925 Target Bcon 25 

10 404552 NA ENSP00000220888*21NC FINGER TRANSCRIPT125 

417168 AL1331 17 Hs.81376 Homo sapiens mRNA; cDNA DKFZp586l1 121 (f 25 

418841 NMJW2332 Hs.89137 low density lipoprotein-related protein 25 

426853 U32974 Hs.172777 bacutoviral IAP repeat-conlainhg 4 25 

427738 NM_000318 Hs.180612 peroxisomal membrane protein 3 (35kD. Ze 25 

15 457384 AA501760 Hs.15806 Homo sapiens mRNA; cONA DKFZp434H2Q19 (1 25 

447128 AI271898 Hs.164866 cydinK 25 
454693 AW813428 gb:MR3-ST01924)10200-21(W>5ST0192Homo25 

434657 AA641876 Hs.191840 ESTs 25 

402077 NA Target Exon 2.5 

20 400289 X07820 Hs.2258 matrix metalioproteinase 10 (MMP10; str 25 

. 409723 AW885757 Hs.257862 ESTs 2.5 

447020 T27308 Hs.16986 hypothetical protein FU 11 046 25 

455068 A1807894 Hs.47274 Homo sapiens mRNA; cDNA DKFZp564B176 (fr 25 

431232 AI024353 Hs.131755 hypothetical protein FU 14298 2.5 

25 408938 AA059013 Hs.22607 ESTs 2.5 

411571 AA122393 Hs.70811 hypothetical protein FU205 16 2.5 

42B504 AW162919 Hs,17Q\6Q RAB2, member RAS oncogene family-like 25 

428248 AI126772 Hs.40479 ESTs 2.5 

408813 AI560030 Hs.48295 RNA helicase family 25 

30 423504 N80077 Hs.24792 chromosome 12 open reading frame 5 25 

425441 AA449644 Hs.193063 Homo sapiens cDNA FU14201 fis, clone NT 25 

443066 AW297921 Hs.255703 ESTs 25 

443556 AA256769 Hs.94949 methylmalonyl-CQA epimerase 2.5 

428943 AW086180 Hs.37636 ESTs, Weakly similar to K1AA1392 protein 2.5 

35 425320 U29344 Hs.83190 fatty acid synthase 25 

430368 AA356923 Hs.240770 nuclear cap binding protein subunit 2, 2 25 

423242 AL039402 Hs.125783 DEME-6 protein 2.5 

416241 N52639 Hs.32683 ESTs 2.5 

440244 AI743977 Hs.2Q5144 ESTs 2.5 

40 409239 AA740875 Hs.44307 ESTs, Moderately similar to 138022 hypot 25 

452464 AW500507 Hs.192619 KIAA1 600 protein 2.5 

410718 A1920783 Hs,191435 ESTs 25 

408877 AA479033 Hs.130315 ESTs, Weakly similar to A47582 B-cell gr 25 

445150 AI446747 Hs.338704 olfactory receptor, family 7, subfamily 2.5 

45 407756 AA1 16021 Hs.38260 ubiquitin specific protease 18 2.5 

407633 NM_007069 Hs.37189 similar to rat HREV1 07 25 

449754 H00820 Hs.30977 ESTs, Weakly similar to B34087 hypothetl 25 

419316 AA236255 Hs.298419 ESTs 25 

429118 H20669 Hs.35406 ESTs, Highly similar to unnamed protein 25 

50 440331 AL045412 Hs.202151 ESTs 2.5 • 

449344 AI640355 Hs.312691 ESTs 25 

459006 AW298631 Hs.27721 Wolf-Hirschhom syndrome candidate 1-fik 2.5 

423165 AI937547 Hs.124915 hypothetical protein MGC2601 25 
411337 AW837349 gb:QV2-LT0038-270300-10Wl2LT0038Homo 2.5 

55 438290 AA843719 Hs.122341 ESTs 25 

406414 C5000506*:gij124941 |sp|P186l4|ITA1_RAT I 25 

424498 AB033043 Hs.149377 hypothetical protein DKFZp761L0424 25 

443464 BE548446 Hs.5167 Homo sapiens mRNA; cDNA DKFZp434F1 52 (fr 25 

424856 AA347746 Hs.9521 ESTs, Weakly similar to ZN43_HUMAN ZiNC 25 

60 440304 BE159984 Hs.125395 ESTs 25 

409045 AA635062 Hs.50094 Homo sapiens mRNA; cDNA DKFZp434O0515 (f 25 

422648 D86983 Hs.1 18893 Melanoma associated gene 25 

428819 AL135623 Hs.193914 KIAA0575 gene product 25 

412520 AA442324 Hs.795 H2Ahistone family, member O 25 

65 430602 D13752 Hs.1 84927 cytochrome P450, subfamily XIB {steroid 2.5 

408031 AA081395 Hs.42173 Homo sapiens cONA FU10366 fis, clone NT 25 

403133 Target Exon 25 
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413189 BE070231 gb:QV4-BT0407-260100-QB7-H2 BT04Q7 Homo 2.5 

400346 AB041269 Hs.272263 Homo sapiens mRNA for keratin 19, partia 2.5 

435509 AW58679 Hs.181915 ESTs 2.5 

458145 AI239457 Hs.130794 ESTs « 
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TABLE 19A 



40 



45 



50 



55 



60 



Table 19A shows the accession numbers for those pkeys lacking unigenelD's for Table 19. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: 

CAT number 
Accession: 



Pkey 

407647 
407980 
408254 



409695 
410534 
410672 
410784 
410785 
410835 
411050 
411086 
411093 
411111 
411171 
411337 
411514 
411670 
411905 
412102 
412209 
412248 
413043 
413111 
413189 
413221 
413499 
413708 
414210 
414596 



1007366.1 

103087J 

1049346J 



10 



15 



20 



25 



30 



35 409163 110418J 



Unique Eos probeset identifier number 
Gene duster number 
Genbank accession numbers 



CAT number Accessions 



114876 J 

1207247J 

1214882.1 

1221005.1 

1221055.1 

1223785.1 

1230330 1 

1231500.1 

1231970.1 

1232669.1 

1234393.1 

1239217.1 

1248638.1 

1253680.1 

1265181.1 

1277395.1 

1283610 1 

1285000.1 

1346556 1 

1349546.1 

1352723.1 

1353887 1 

1373910.1 

1384140.1 

1426051.1 

1465004.1 



AW860158 AW862385 AW860159 AW862386 AW862341 AW821869 AW821893 AW062660 A W062656 
AA046309 AI263500 AA046397 

AW807227 AW807576 AW807137 AW807157 AW807495 AW807494 AW807417 AW807083 AW845786 AW845801 
AW807130 AW807335 AW807081 AW807349 AW807339 AW807164 AW807341 AW807224 AW845903 AW177424 
AW807159 AW807123 BE141576 AW807340 AW807334 AW807520 AW807205 AW807505 BE141574 AW807390 
AW807395 AW845789 AW807101 AW807089 AW807519 AW807239 AW807509 AW807356 AW807526 AW807098 
AW807307 AW807153 AW807295 AW807313 AW807322 AW807265 AW807513 AW807516 AW807166 AW807501 
AW807120 AW807168 AW807121 AW807527 AW807151 AW807203 AW807489 AW807511 AW807158 AW845800 
AW807507 AW845793 AW845796 AW807296 AW845781 AW807444 AW845871 AW807512 AW807242 AW807141 
AW807522 AW807487 AW807514 AW807 1 42 AW8 07232 AW807379 AW8071 14 AW807518 AW807199 AW80721 1 
AW807498AW807086AW807492AW807218AW807082AW807525AW807493AW807523AW807087AW845784 
AW807037 AW807128 AW807080 AW807118 AW845807 AW807524 AW845803 AW807249 AW845795 AW807160 
AW807343 AW807515 AW807233 AW807289 AW177102 AW807352 AW807394 AW177105 AW807176 AW177103 
AW845870 AW177099 AW177101 AW807528 AW807336 AW807038 AW177100 AW807411 AW807088 AW845865 
AW807226 AW80751 7 AW807397 AW807303 AW8071 77 AW807154 AW807136 AW807146 AW807085 AW807521 
AW807488 AW807385 AW807355 AW807223 AW807155 

AA065081 AA075017 AA084791 AA071015 AA081560 AA071459 AA545727 AA083100 AA085366 AA115845 AA075457 

AA064704 AA082878 AA075742 M069162 

AA296961 AA296889 AA076945 AA077528 AA077497 

AW905138 AW753008 R13818 243519 

AW794600AW794730 

AW803201 BE079700 BE062940 

AW803341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 

AW806906 AW8069 1 5 AW866460 AW866475 AW866462 AWB66448 AW866372 AW866604 

AW814902 BE156656 BE156667 BE156590 BE156441 BE156447 

BE070800 AW875226 BE149115 

BE067650AW817053 

AW818127AW818161R09719 

AW820260AW820332R94406 

AW837349 AW837355 AW882717 

AW850178 AW850233 AW850445 AW850446 

AW856552 AW861 101 AW856574 AW861099 AW861 1 00 AW856573 AW856576 AW856562 

BE265067 BE264978 AW875420 

H56435H56572AW892929 

AW901456 AW901450 AW901441 

BE176480 AW903298 AW903313 

BE158766 BE061699 BE147360 BE147362 BE061666 BE061697 BE061647 BE061678 
BE065837 BE065805 BE065799 BE065818 BE065839 BE065831 BE065894 BE065789 BE065792 
BEO70231 BE070229BE070255 

BE161151 BE162495 BE161002 BE072205 BE160989 BE162482 
BE144884 H97942 

BE158791 BE158806 BE158748 BE158744 BE158740 BE158739 BE158811 BE158700 BE158741 BE158683 8E158685 

BE383592BE261671 

8E386870 Z41986 H08501 
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65 



414605 
415747 
416173 
417742 
417974 
418636 
419536 
420654 
422156 

422996 

423833 

423841 

423945 

425201 

426650 

426878 

430264 

430785 

431676 

433687 

434338 

434469 

435447 

437152 

437854 

439031 

439255 

444910 

445432 

446922 

447197 

448420 

448516 

450522 

450736 

451024 

451067 

451340 

452542 

452564 

453472 

454307 

454359 

454545 

454693 

454714 

455047 

455092 

455097 

455100 

455431 

455511 

455609 

455651 

455685 
455700 
455708 
455732 



455935 
455945 
456207 
456482 
458094 . 
458673 



1465790.-1 

155189J 

1574973J 

1696282J 

171237.1 

177402J 

185688.1 

197072J 

212379J 

223666J 

232451.1 

232507J 

233566J 

247933.1 

270283.1 

273265.1 

315008.1 

323486J 

336411J 

373061.1 

383982.1 

387447.1 

406400.1 

43386.1 

44418.1 

46798.1 

470321.1 

624951.1 

63943.1 

69865.1 

711623.1 

76273.1 

766241.1 

837264.1 

844652.1 

85565.1 

85759.1 

86640.1 

921410.1 

92227.1 

968371.1 

1106070. 

1130674, 

1223779. 

1229132.' 

1230493. 

1250536. 

1252971. 

1253130. 

1253334. 

1289854. 

1321229. 

1337548 

1348732. 

1350393.' 
1351264. 
1352232." 
1353874. 
1374605. 
1384144. 
1385588. 
165078.- 
192289J 
47311.1 
679507.1 



BE390440 

AA381209AA381245M167683 
R52782 R17313 H24192 R19876 
R64719 Z44680R12451 
AA210765T95700 H94407 
AW749855 AA225995 AW750208 AW750206 
AA603305 AA244095 AA244183 

AW835572 AW385512 AI334966 W32951 H62656 H53902 R88904 AWB35732 

BE091089BE091123AA319959 

AW503329 N46610AA331571 

AW753967 AA370795 AA331630 AW962550 

AA410943 AW948953 AA334202 AA332882 

AA352111 AW962247 AA429695 

AA382814 AA402411 AA412355 

BE069341 AW748403 AL044891 A1908240 AA393080 

AA470519 BE303010 BE302954 BE384120 

Z30201M486132T72025 

A1685464 AW971336 AA513587 AA525142 

AA743991 M604852 AW272737 

AW754311 AA630185 AW803285 

AA634806 C18732 AA729161 AA729860 

AI872932 AA682306 BE220163 W88695 T81307 H91447 

AL050027BE089051 

AL119723 AL119874 AI909018 U50537 

AF075079 H48601 H48795 

BE164500AA832198BE164502 

AI201849 BE069007 AW946544 

AV653771 BE089370 

BE175605 Z43529 F06610 BE175602 AV661027 
R36075 AI366546 R36167 
BE623004 AA360669 BE263527 BE246433 



AW970060 AI732366 AI792313 AW839644 
AA442176AA259181 

BE172186 AA059279 AA020815 AA013437 
AW936273 AW340350 AA017208 
AW812256 AW812257 AI906423 AI906422 
AA026777 N50065 R09961 N54721 
AUJ37925 AL037931 AL037957 
AW855717 AW362452 AW362443 
N71277AW390764 

AW806899 AW866451 AW866393 AW866297 AW817869 
AW813428 AW813444 AW813367 AW813368 AW813429 AW813424 
AW815098BE154843BE154831 
AW852530 AW852527 AW852526 

!^!^ 

BE160198AW935898T11520AW935930AW856073AW861034 
AW938484BE001245BE001190 

rIS BEoTl 9 6 0 8 9 9BE011627 BE011679 BE011699 BE011678 BE011696 BE011675 BE011622 BE011635 
BE064962 lo64979 BE064853 BE064857 BE064856 BE064977 BE064960 BE064860 BE06481 5 BE064957 BE064804 

BE066976 BE066928 BE066927 

BE068115 BE068104 BE068102 BE068096 BE068103 BE068154 BE068198 

BE069326 BE069290 BE069352 

BE080908 BE072258 BE072190 BE072236 

BE145808BE145807BE181883 

BE158687BE158688 

BE160636 BE160606 BE160703 

AA193450 

AA485224 AA287308 AA258121 
AF086325 W72956 W73221 AA219112 
N99626AI3027O1 
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TABLE 19B 



Table 19B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 19. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Idenfifler (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled "The ONA sequence of human chromosome 22/ Dunham I. et al., Nature (1999) 402:48^495. 
Strand: Indicates DMA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


Nt_position 






Minus 


134694-134817 


40flfi0fl 
tuuuvo 


300/ uuu 


Minus 


96756-97558 




QRR7R71 


Minus 


117606-117928,124040-124147 


4nftQ9f 


/ oj tut i 


Pius 




401045 


8117619 

Or If UI9 


Plus 


90044-90184 91111-91345 




7919177 


Pit IS 


149157-150692 


401093 


8516137 


Minus 


22335-23166 


401256 


9796573 


Minus 


45482-45620 


401233 


9800093 


Minus 


4725647456 


401326 


9212516 


Minus 


226246-227505 


401418 


7452889 


Minus 


124865-125075 


401451 


6634068 


Minus 


119926-121272 


401458 


9187886 


Pius 


76485-77597 


401497 


7381770 


Plus 


92607-92813 


401508 


7534110 


Minus 


110779-110983 


401575 


7229804 


Minus 


76253-76364 


401747 


9789672 


Minus 


118596-118816.119119-119244,119609-119761.120422-120990,130161-130381.130468-130593.131097- 








131258,131866-131932,132451-132575.133580-134011 


401781 


7249190 


Minus 


83215^83435.83531-83656.83740-83901,84237^84393.84955^85037,86290-66814 


401785 


7249190 


Minus 


165776-1 65996.166 189-16631 4.166408-1^569.1671 12-167268,1 67387-167469. 1 68634-1 68942 


401793 


7263888 


Minus 


102945-103083 


401987 


4406829 


Minus 


72893-73021,76938-77049 


402077 


8117414 


Plus 


65014-65195 


402109 


8131678 


Minus 


171722-171859.173197-173303 


402184 


8576001 


Minus 


112844-112986,113505-113636 


402376 


9625329 


Minus 


21753-22385 


402421 


9796341 


Minus 


46609-46662,46758-468 1 1 .86293-86346.89776^9829,90048-90101 .102817-1 02924 


402578 


9884928 


Plus 


66350-66496 


402606 


9909429 


Minus 


81747-82094 


402696 


7328818 


Minus 


23600-23731 


402765 


9367757 


Plus 


109588-109726 


402800 


6010175 


Plus 


43921-44049,4618146273 


402820 


6456853 


Minus 


82274-82443 


402892 


8086844 


Minus 


194384-194645 


403133 


7331427 


Plus 


3831448634 


403356 


8569930 


Plus 


92839-93036 


403388 


9438331 


Plus 


112733-113001,114599-114735 


403426 


9719529 


Minus 


157156-158183 


403585 


8101208 


Minus 


131266-131769 


403593 


6862650 


Minus 


62554-62712,69449-69602 


403637 


8671936 


Minus 


142647-142771,145531-145762 


403639 


8671948 


Plus 


113234.113326,115166-115287.119649-119786 


403677 


7331517 


Minus 


55008-55083.62860-63051 


403775 


7770580 


Minus 


102247-102326,103095-103148 


403943 


7711864 


Plus 


100742-100904.101322-101503 
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404091 
404097 
404142 
404253 
404274 
404285 
404360 
404440 
404443 
404552 
404561 
404580 
404721 
404826 
404983 
405037 
405041 
405095 
405153 
405196 
405232 
405248 
405336 
405394 
405460 
405494 
405547 



405638 
405654 
405718 
405822 
405848 
405873 
405906 
405917 
405925 
405953 
406069 
406151 
406153 
406182 
406271 
406291 
406348 
406414 
406446 
406504 
406554 



7684554 

7770701 

9856692 

9367202 

9885189 

2326514 

9858450 

7528051 

7579073 

7243881 

9795980 

6539738 

9856648 

6572184 

4432779 

7543748 

7547195 

8072599 

9965565 

7230083 

7249042 

7259728 

6094635 

6624123 

7684569 

8050952 

1054740 

5757553 

6289229 
4895155 
9795467 
6273498 
7651809 
6758747 
7705124 
7712162 
6758795 
7960374 
9117732 
7144806 
9929734 
5923650 
7534217 
5686274 
9255985 
9256407 
9454509 
7711360 
7711566 



Minus 82121-83229 

Plus 55512-55781 

Minus 80316-80459 

Minus 55675-56055 

Plus 104127-104318 

Plus 32282-32416 

Minus 122873-122966,151324-151469,153093-153253 

Plus 60430-81581 

Minus 87198-87441 

Plus 19854-20010 

Minus 69039-70100 

Minus 240588-241589 

Minus 173763-174294 

Plus 47726-48046 

Minus 51 178-51374.52000-52173 

Minus 127374-127578 

Pius 121230-121714 

Plus 138877-139066 

Minus 175317-175500 

Minus 135716-135851 

Plus 125904-126063 

Plus 637-777 

Plus 33267-33563 

Minus 31900-32373 

Minus 52223-52389 

Minus 70284-70518 

MirL «KloK 

52918.55469^5601,57111^7307,58169-58296,60215-60332,61482-61727 

Pius 199260-199372.199826-199929 

Minus 53624-63759 

Plus 113080-113266 

Minus 154660-154974,155203-155379 

Minus 28135-28244 

Minus 32129-32764 

Minus 10835-11059 

Minus 106829-107213 

Plus 129935-130282 

Minus 65101-65574 

Plus 68880-69374 

Minus 94087-94285 

Minus 12902-13069 

Minus 28256-28935 

Plus 36179-36692 

Plus 9562-9867 

Minus 71754-71944 

Plus 4959349850 

Minus 116424-116527,118721-118859.121187-121364 

Minus 107068-107277 

Plus 106956-107121 
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TABLE 20: 544 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
LIKELY TO ENCODE EXTRACELLULAR OR CELL- 
SURFACE PROTEINS 



10 



15 



20 



Table 20 shows 544 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode extracellular or cell-surface proteins. These were selected as for 
Table 19, except that the ratio was greater than or equal to 3.0, the "average" normal adult 
tissue level was set to the 85 th percentile value amongst 144 non-malignant tissues, and the 
96 th percentile value amongst the 73 breast cancer specimens was greater than or equal 100 
units, and the predicted protein contained a structural domain that is indicative of 
extracellular localization (e.g. ig, fii3, egf, 7tm domains, signal sequences, transmembrane 
domains). The predicted protein domains are noted. 



Pkey: Unique Eos prabeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unkjene number 

Pred.ProtDomains: Predicted Protein Domains 

Unfcene Title: Unigene gene title 

R1 : Ratio of 93* percentile of tumor to percentile of normal body tissue 



25 Pkey ExAccn UnigenelD Pred.Prot.Domains 
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408591 
400291 
449746 
407277 
400292 
424735 
426878 
428848 
407178 
408000 
427585 
429441 
450375 
420931 
422109 
445730 
451110 
400297 
420813 
452744 
424634 
420757 



447350 
456207 
431448 
427217 
455938 
435496 
402578 
453160 
422505 
444342 



AF015224 Hs.46452 
AA401369 Hs.190721 
A1668594 Hs.176588 
AW170035 Hs.326736 
AA250737 Hs.72472 
U31875 Hs.272499 
BE069341 

Nf^000230Hs.194236 
AA195651 Hs.104106 
L11690 Hs.620 
D31152 Hs.179729 
AJ224172 Hs.204096 
AA009647 Hs.8850 
AF044197 Hs.100431 
S73265 Hs.1473 
AI624342 Hs.170Q42 
AJ955040 Hs.265398 
AI127076 Hs.334473 
X51501 Hs.99949 
AI267652 Hs.30504 
NNL003613HS.151407 
X78592 Hs.99915 
AI905687 Hs.2533 
AI375572 Hs.172634 
AA193450 

AL137517 Hs.334473 
AA399272 Hs.144341 
X52509 Hs.161640 
AW840171 Hs.265398 

AI263307 Hs.239884 
AL120862 Hs.124165 
NM_014398Hs.10887 



SS.Uteroglobin,SS.Uteroglobln 
TM 

,SS,p450 
TM 

death,ZU5.TM,Activin_recp.pkinase I 
,SS,7M 
TM 

SS.Leptin.SS.teptin, 
,S$,Dihydroorotase, 

Plectin.repeat.SH3 1 spectrin,SS 1 Plectin_r 
SS,C1q,Collagen,SS ( C1q,Coilagen, 
.SS.Uteroglobin. 

,SS l TM ( dislntegrin,Pep^M12B_propep,Repro a disintegrln and metalloprotelnase doma 



UnigeneTitle 

mammaglobin 1 
ESTs 

ESTs. Weakly similar to CP4Y_HUMAN CYTOC 
Homo sapiens breast cancer antigen NY-8R 
BMP-R1B 

shorVchain alcohol dehydrogenase family 
gb:QV3-8T0381-27010(W)73-c08BT0381 Homo 
leptin (murine obesity homolog) 
ESTs 

bullous pemphigoid antigen 1 (230/240kD) 
collagen, type X, alpha 1 (Schmid metaph 
lipophifin B (uteroglobin family member) 



SS.IU.SS 
SS.Bombesin.SS 
,SS,TM,Cation_efflux 
SS 
TM 
SS SS 

,SS,TM,GNSLSUR4 1 cNMP_binding,Rila 
ig,tspJ,SS,AAA 

hormone jec,Androgenjecep,zf-C4, 
SS 



,SS,TM,p450,p450 
TM 
SS 



small Inducible cytokine B subfamily (Cy 
gastrin-releasing peptide 
ESTs 

ESTs. Weakly similar to transformafion-f 
hypothetical protein DKFZp56401278 
prolactin-induced protein 
Homo sapiens mRNA; cDNA DKFZp434E082 (fr 
cartilage intermediate layer protein, nu 
androgen receptor (dihydrotestosterone r 
aldehyde dehydrogenase 9 family, member 
ESTs 

gb2r40eQ7.r1 Soares_NhHMPu_S1 Homosapl 
hypothetical protein DKFZp56401278 
ESTs 



,SS.TM,aminotrai^1J,Cadhenn_C_term,cad tyrosine aminotransferase 

SS ESTs, Weakly similar to transformatiorK 

SS.p450,SS,TM.p450 C1001134:gi|2117372lpiriP65981 fatty ac 

SS H2B histone family, member L 

SS programmed ceil death 9 (PDCD9) 

Lamp.SS.TM.Lamp, similar to rysosorne-assoctated membrane 

304 



R1 

168.6 

73.2 

65.7 

57.6 

55.9 

53.8 

50.3 

40.8 

39.3 

37.3 

35.2 

30.0 

25.7 

25.2 

24.6 

24.1 

24.0 

23.8 

22.8 

22.6 

21.7 

21.7 

20.3 

19.2 

18.3 

18.2 

18.2 

18.1 

17.9 

17.8 

17.8 

17.7 

17.5 
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Hs.137476 
Hs.21572 
Hs.57471 
Hs.83169 



449765 N92293 Hs.206832 
428227 AA321649 Hs.2248 
425692 D90041 Hs.155956 
424001 W67B83 
448595 AB014544 
449448 D60730 
418007 M13509 
418994 AA296520 Hs.89546 
453596 AA441838 Hs.62905 
452401 NM 007115HS.29352 
446591 H44186 Hs.15456 
419296 AA236115 Hs.120785 
452838 U65011 Hs.30743 
422805 AA436989 Hs.121017 
448390 AL035414 Hs.21068 
447342 A1199268 
411869 W20027 
443709 A1082692 
459587 AA031956 
442580 AI733682 
400289 X07820 
411598 BE336654 Hs.70937 
415263 AA948033 Hs.130853 
433805 AA706910 Hs. 112742 
407276 AI951118 Hs.326736 
443348 AW873596 Hs.182278 
421037 AI684808 Hs.197653 
424086 AI351010 
400295 W72838 
452461 N78223 
427365 AI873274 
433365 AF026944 
409269 AA576953 
432596 AJ224741 
408771 AW732573 Hs.47584 
432912 BE007371 Hs.200313 
447033 AI357412 Hs.157601 
421155 H87879 Hs.102267 
424905 NM_002497Hs.153704 
425398 AL049689 Hs.156369 
438167 R28363 Hs.24286 
459583 A1907673 
423945 AA410943 
439820 AL360204 Hs.283853 
402606 

445263 H57646 
430217 N47863 
447164 AF026941 
431385 BE178536 Hs.11090 
423887 AL080207 Hs.134585 
415385 R17798 Hs.7535 
425704 U79293 Hs.159264 
429859 NM.007050HS.225952 
425523 AB007948 Hs.158244 
428368 BE440042 Hs.83326 
418912 NMJ)00685Hs,89472 
422026 U80736 Hs.1 10826 
451952 AL120173 Hs.301663 
438199 AW016531 Hs,122147 
400608 

413472 BE242870 Hs.75379 
432374 W68815 Hs.301885 
402408 NA 

445537 AJ245671 Hs.12844 
451621 AI879148 Hs.26770 
405654 NA 

434988 AJ418055 Hs.161160 



Hs.19322 
Hs.23439 
Hs.134662 

Hs.130239 
Hs.2258 



Hs.102267 

Hs.2533 

Hs.108106 

Hs.190721 

Hs.293797 

Hs.22972 

Hs.278461 



Hs.42586 
Hs.336901 
Hs.17518 



SS 

SS.1L8, 

,SS,Acetyltransf2, 
,pWnase, 

LRRCT l LRR,SS J LRRCT,serine.carbpept 
SS 



ESTs, Moderately similar to ALU8.HUMAN A 
small Inducible cytokine subfamily B (CX 
N-acetyitransferase 1 (arytamlne N-acety 



KIAA0644 gene product 
ESTs 



$S,hemopexin,PepiidaseJVI10 ) SS,P^ 1 (MMP1; inters 

SS!ectin_c l sushl 1 EGF > SsiEGF,tectiruc,su selectin E {endothelial adhesion molecul 
SS hypothetical protein RJ14834 

,SS,CUB,Xlink, tumor necrosis factor, alpha-induced pro 

PD2.SS PDZ domain containing 1 

SS ' ESTs 

Ss t sS preferentially expressed antigen in meta 

histone.SS,hlstone,hlstone H2A histone family, member A 

SS hypothetical protein 

.SS.Iipocalin Homo sapiens, Similar to RIKEN cONA 2010 

lsslPeptidase_M1, ESTs 

.SS.TM.SNF ESTs 

|SS|UM( gkzk15eQ4.s1 Soaresj)regnantuterus_NbH 

SS * ' ESTs 

hemopexin l Peptidase - M10,SS l Peptidase_M10matrix metalloproteinase 10 (MMP10; str 
histone,Ss!histone,his1one m hktnnfl famflv member A 

f SS,hfetone,histoneMerJiistone 
,SS,RtoosomaLL7Ae, 
TM 

,SS,DENN 
SS 

,SS,Lysyl_oxidase 
SS 

,SS,G9a,PHD, 
TM 

.SS.TPR 

$SJM,UPFu016,S3 t Ttf,uPrG016 
SS.EGF.vwa.SSJM.vwa, 
TM,K^tetra,k>rctrans,SS,TM,K_tetra,ionJ 
,SS,TM,Folate_carrier 
SS 



H3 histone family, member A 
ESTs 
ESTs 

Homo sapiens breast cancer antigen NY-BR 
calmodulin 2 (phosphorylase kinase, deJt 
programmed cell death 9 (PDCD9) 
iysyl oxidase 

aldehyde dehydrogenase 9 family, member 

transcription factor 

ESTs 

ESTs 

hypofce&a! protein FU13352 
matrilin 3 

potassium vollage-gated channel, delayed 
ESTs 
ESTs 



SS,LysyLoxldase/ldose_epim,Epimerase,S Iysyl oxidase 
pkinase,SS,TM,pkinase,po)yprenyLsynt, NIMA (never in mitosisj gene ablated k 



hypothetical protein similar to tenascin 
ESTs 

gb:lL-BT152-080399-004 BT152 Homo sapien 
gb:zt32h03.r1 Soares ovary tumor NbHOT H 
Homo sapiens mRNA full length insert cDN 
NMJ)24626:Homo sapiens hypothetical prot 
w »d.,^ KIAA1560 protein 

1 SS,Rr^oLA,RNA4)oLA2.RibosomaLS24e,ribosomal protein S24 



SS 

,SS,™,7tnU.p450,rrm 
.pkinase, 

death,ZU5,TM,Activin_recp > pkinase, 
SS 
SS 



JM.1BR 
,SS,TM 

,SS.TM,BRCT,anKABC_tran,ABC_tran 
.SSforkJiead, 
SS 

,SS,TM,Yj)hosphatase ) MAM,fn3 # 
l SS,laminiruB,larninin_EGF,laminin^Ntemi 



Homo sapiens cig5 mRNA, partial sequence 
membrane-spanning 4-domains, subfamily A 
DKFZP434G232 protein 
COBYV-fike protein 

Human done 23948 mRNA sequence 
protein tyrosine phosphatase, receptor t 
K1AA0479 protein 



SS.PeptidaseJWQ.hemopexin.SS.Peptldasejriatrix metalloproteinase 3 (stromelysm 



SS 1 TM,7foU.SS,TM l 7lmJ. 
SS 

.SS.pkinase, 
,SS,ArfGap f 
SS,TM,SS,TM 
SS 
SS 

,SS,carb_anhydrase 
,SS,TM,ras 
SS.npocarm.Upocalin, 
BTB.SS 
SS 



angiotensin receptor 1 
trinucleotide repeat containing 9 
ESTs 
ESTs 

C10001899:gii7508633[plrl|T253g2hypothe 
solute carrier family 1 (glial high affi 
Homo sapiens cDNA FU1 1346 fts, clone PL 
Nf4J)30920*:Homo sapiens hypothetical pro 
EGF-like-domain, multiple 6 (EGFL6) 
fatty acid binding protein 7, brain 
C12001521:gij7513934lplrj[T31081 cca3 pr 
ESTs 



17.3 

17.0 

16.7 

16.5 

16.3 

16.2 

15.7 

15.5 

15.5 

15.0 

14.9 

14.8 

14.7 

14.3 

14.2 

14.2 

13.9 

13.7 

13.7 

13.5 

13.5 

13.3 

13.2 
13.1 
13.1 
13.0 
12.9 
12.8 
12.7 
12.5 
12.4 
12.3 
12.0 
11.9 
11.9 
11.9 
11.8 
11.8 
11.7 
11.6 
115 
11.5 
114 
114 
11.3 
112 
111 
111 
111 
10.9 
10.9 
10.7 
10.4 
10.3 

m 

10.3 

10.3 

10.3 

10.2 

10.1 

10.0 

9.9 

9.8 

9.7 

9.6 

9.6 

9.6 
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416220 
431808 
414142 
400298 
418601 
415539 
421451 
429432 
442441 
426429 
406687 
448693 
419948 
426214 
427718 
414812 
400285 
422330 
416182 
420077 
452281 
434531 
408380 
443788 
411078 
445495 
433426 
424871 
426215 
409045 
435525 
409203 
424902 
431725 
418092 
439840 
427811 
420807 
426320 
447078 
415786 
410102 
404347 
433687 
421373 
422634 
453310 
435957 
407771 
443646 
446142 
444649 
435147 
439138 
429220 
428804 
453511 
439809 
414869 
416276 
452862 
452926 
453331 
420802 
450603 
422867 
418004 



N49776 Hs.170994 
M30703 Hs.270833 
AW368397 Hs.150042 
AA032279 Hs.61635 
AA279490 Hs.86368 
A1733881 Hs.72472 
AA291377 Hs.50831 
AI678059 Hs.202676 



AI820662 

X73114 

M31126 



Hs.129598 
Hs.169849 
Hs.272620 



Hs!l80884 SS l 2a_carbOpept,Propep_M14,SS l Propep_M14cart)oxypepfidase B1 (tissue) 



AW004854 Hs.228320 
AB041035 Hs.93847 
H59846 Hs.128355 
AJ798680 Hs.25933 
X72755 Hs.77367 
NA 

D30783 Hs.115263 
NMJ)04354Hs.79069 
AW512260 Hs.87767 
T93500 Hs.28792 
AA642007 Hs.1 16369 
AF123050 Hs.44532 
AI732643 Hs.144151 
AI222020 Hs.182364 
BE622641 Hs.38489 
H69125 Hs.133525 
NM_004525Hs.153595 
AW963419 Hs.155223 
AA635062 Hs.50094 
AI831297 Hs.123310 
AA780473 Hs.687 
NM_003866Hs.153687 
X65724 Hs.2839 
R45154 Hs.106604 
AW449211 Hs.105445 
M81057 

AA280627 Hs.57846 
W47595 Hs.169300 
AW885727 Hs.301570 
AW419196 Hs.257924 
AW248508 Hs.279727 

AA743991 

AA808229 Hs.167771 
NM_016010Hs.1 18821 
X70697 Hs.553 
N39015 Hs.190368 
AL138272 Hs.62713 
AI085198 Hs.164226 
AI754693 Hs.145968 
AW207523 Hs.197628 
AL133731 Hs.4774 
AI742605 Hs.193696 
AW207206 Hs.136319 
AK000713 Hs.193736 
AL031224 Hs.33102 
R41396 Hs.101774 
AA157291 HsJ21479 
U41060 Hs.79136 
AW378065 Hs.8687 
AI742170 Hs.31297 
AI240665 Hs.8895 
U22376 Hs.1334 
R43646 Hs.12422 
L32137 Hs.1584 
U37519 Hs.87539 



,SS,TM 

SS ( TM.EGF,SS 

.SS.UDPGT 

TM 

SS,TM.calreticu!in 1 SS,TM,calreticu!in, 

deathius.THActivin.recp.pkinase, 

TM 

SS 

ss 

,SS,TM,m3jg, 



hypothetical protein MGC10946 

amphiregulin (schwannoma-derived growth 

Homo sapiens cONA RJ14438 fis. done HE 

six transmembrane epithelial antigen of 

calmegin 

BMP-RIB 

ESTs 

synaplonemal complex protein 2 
ESTs 

myosin-blnding protein C, slow-type 



SS.Peplidase^MIO.hemopexin.SS.PeplIdase.matrix metailoproteinase 11 (MMP1 1; stro 



SS 

Ferric reductTM,Fenic_reduct 
SS 

l SS,TM,histone,Sec1 l histone,sugarJr 
SS.IL6.SS.1L8 

,TMABC_tran^BC_mernbrane, 

SS.TM.EGF.SS.TM 

cydln.SS 

SS 

,SS,TGF-beta,TGFb_propeptide, 
SS 

,SS,TM,ubIquitin,7tmJ,ANF._receptQr,sush 
TM 
SS,SS 



hypothetical protein FU23537 
NM 016931:Homo sapiens NADPH oxidase 4 ( 
ESTs, Moderately similar to ALU7_HUMAN A 
ESTs 

monokine induced by gamma Interferon 
Eos Control 



cydin G2 
ESTs 

Homo sapiens cONA FU1 1041 lis. clone PL 

ESTs 

diubkptin 

ESTs 

CocoaCrisp 



Ssiss < ENTH,LLWEQ,ENTH,l_LWEQ,DNAjnisjeESTs 1 Weakly similar to I38022 hypotheti 



,SS,TM 

SS.EGF,ULrecepLa.ldLreceptb,SS,TM,E 

SS 

TM 

TM 

SS,p450 P SS,p450 
SS.SS 

SS,CysJcnot,SS 
< death.ZU5.pktee.Activin_recp l 
SS 



ESTs 

tow density lipoprotein-related protein 
stanniocalctn 2 

Homo sapiens mRNA; cONA 0KFZp434O0515 (f 
ESTs 

cytochrome P450, subfamily IVB, polypept 
inositol poryphosphate-4-phosphatase, ty 
Nome disease (pseudogtioma) 
ESTs 

GDNF family receptor alpha 1 



SS,cpn10 
SS.TGF-beta.TGFbjwpeptide.SS 
.SS.kazal 
SS 
SS 
SS 
TM 

,SS f IMPDH - C.IMPDH_N J CBS 
SS 

TM.SNF.SS.TM.SNF. 
,SS,TM 

l TM,cpn60 - .TCPt l Sema, 
JSPN.vwc.tspJ^GF.thtored, 
l TM,cadhenn,Cadnerin_C_term, 
,SS,rrm, 

JM.SOF.UPARJ.Ye, 

TM 

SS 

.SS.UDPGT 

ss.ss 

SS 
SS 

SS.TMJM 

,SS^epJ^12B_propep l Reprofysm,tspJ 
.SSJM 



ESTs 

transforming growth factor, beta 2 
ESTs 

hypothetical protein FU13782 

Homo sapiens cONA FU 14035 fis, clone HE 

Target Exon 

gfr.rry57g01.s1 NCLCGAP_Pr18 Homo sapiens 
ESTs 

CGI-62 protein 

solute carrier family 6 (neurotransmitte 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA OKFZp761C1712 (f 

ESTs 

ESTs 

hypothetical protein FU20706 
transcription factor AP-2 beta (activatl 
hypothetical protein FU23045 
ublnudein 1 

UV-1 protein, estrogen regulated 
ESTs 

duodenal cytochrome b 



,SS,TKldislntegrinPepjyl12B4>ropep.Repro ESTs 
SSMmyb^DNA-bindlng v-myb avian myeloblastosis viral oncogen 

SS ~ • ESTs 

SS.EGF,tsp_3,SS,E2F JTDP, cartilage olgomeric matrix protein (COM 

Ss!aldedh 4 SS,atdedh, aldehyde dehydrogenase 3 family, member 
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9.5 
9.5 
9.4 
9.4 
9.4 
9.4 
9.3 
9.3 
9.1 
9.1 
9.1 
9.1 
9.1 
9.0 
8.9 
8.8 
8.8 
8.8 
8.8 
6.7 
8.7 
8.6 
8.6 
8.6 
8.5 
8.5 
8.5 
8.4 
8.4 
8.4 
8.3 
8.3 
8.3 
8.3 
8.3 
8.2 
8.2 
8.2 
8.2 
8.1 
8.1 
8.0 
8.0 
8.0 
8.0 
7.9 
7.9 
7.8 
7.8 
7.8 
7.7 
7.6 
7.6 
7.6 
7.6 
7.5 
7.5 
7.5 
7.5 
7.5 
7.4 
7.4 
7.3 
7.3 
7.2 
7.2 
7.2 
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426451 A1908165 Hs.169946 
450701 H39960 Hs.288467 
419519 A1196719 Hs.176376 
410555 U92649 Hs.64311 
433138 AB029496 Hs.59729 
411558 AA102670 Hs.70725 
409079 W87707 Hs.82065 
417275 X63578 Hs.295449 
432731 R31178 Hs.287820 
442618 AK001741 Hs.8739 
407366 AF026942 
427427 AF077345 Hs.177936 
410785 AW803341 

401045 ■-- , 

C1 1001883*:gil6753278|reflNP_033938.1| c 
418986 AI123555 Hs.81796 t SS,Reprolysin,tspJ, 
442082 R41823 Hs7413 
442861 AA243837 Hs.57787 
418836 A1655499 Hs.161712 
422060 R20893 Hs.325823 
444381 BE387335 Hs.283713 
404091 NA 

417866 AW067903 Hs-82772 
428819 AL135623 Hs.193914 
410275 U85658 Hs.61796 
425236 AW067800 Hs.155223 
415669 NM.005025HS.78589 
416319 AI815601 Hs.79197 
412140 AA219691 Hs73625 
442942 AW167087 Hs.131562 
446163 AA026880 Hs.25252 
443162 T49951 Hs.9029 
409602 W26713 Hs.256972 
Hs.184572 



SS.GATA, 
,SS,LRR 
SS 

,TM,dislntegrin.Reprolysin, 
$S f ig,Sema,SS,Sema,efhand 
$S,TM,SS,TM 
,TM,fh3 ( 

SS.efhand.SS.efhand.ras 
.SSMfoWMfnl 
WD40.SS 
.TM.IBR 
SS,lectin_c,SS 
SS 

ICE_p20,SS.ICEj)10 l !CE_p20 l ICE_p10,lCE_p 

~ — " 6.7 



GATA-blndlng protein 3 (T-cell receptor 
Homo sapiens cDNA RJ12280 fis, done MA 
ESTs 

a dlsintegrin and metalloproteinase doma 
semaphorin sem2 

ganma-aminobutyric acid (GABA) A recepto 
interieukin 6 signal transducer (gp130, 
parvatbumln 
flbronectin 1 

hypothetical protein OJ10879 
gb:Homo sapiens dg33 mRNA, partial sequ 
ESTs 

gb:IL2-UM0079-09030(W5(W)03 UM0079Homo 



428479 Y00272 
400300 X03363 
433404 T32982 Hs.102720 
410079 U94362 Hs.58589 
401781 

447359 NIVL012093HS.18268 
402230 NA 

427674 NMJJ03528HS.2178 
428398 AI249368 Hs.98558 
458098 BE550224 Hs74170 
419968 X04430 Hs.93913 
416636 N32536 Hs.42645 
419703 AI793257 Hs.128151 
424687 J05070 Hs.151738 
449679 AI823951 Hs.129700 
421296 NMJ002666HS.103253 
442117 AW664964 Hs.128899 
400303 AA242758 Hs.79136 
419440 AB020689 Hs.90419 
444858 AJ199738 Hs.208275 
432239 X81334 Hs.2936 
440705 AA904244 Hs.153205 
400286 NA 

446466 H38026 Hs.308 
423201 NM_000163Hs,125180 
433043 W57554 Hs.125019 
439509 AF086332 Hs.58314 
425247 NM 005940HS.155324 
409757 NML001898Hs,123114 
425292 NMj305824Hs.155545 
448045 AJ297436 Hs.20166 
452681 AF153330 Hs.30246 
452243 AL355715 Hs.28555 
439310 AF086120 Hs.102793 



,TM > EPH_lbd 1 pkinase l SAM,fn3, 
SS 

l TM,Activinjecp,pkinase,death ( ZU5, 
.SS.TM.CD36 
.SS.mitojsrr 
JM.7trrL3ANF_receptor, 
SS.CoIlagen.COLFl.TSPN.SSJSPN 
SS SS 

,SS,RibosomaLS4e 
SS 

,SS,serpln, 
SS.TMjg.SS.TM 
,SS.kinesln. 
,SS.ig i Sema > pkinase, 
.SS.TM.fn3. 

fiiament,SS,tHament,filament 
,SS,TM ( OAGKa,DAGKc, 
.SS.pkinase.pkinase 



ESTs 

ESTs; calsyntenin-2 

ESTs 

ESTs 

ESTs, Moderately similar to AIU5_HUMAN A 
ESTs, Weakly similar to S64054 hypotheti 
TaigetExon 

collagen, type XI, alpha 1 
K1AA0575 gene product 
transcription factor AP-2 gamma (activat 
stannlocalcin 2 

serine (or cysteine) proteinase inhibHo 
CD83 antigen (activated B lymphocytes, i 
RAB6 interacting, kinesin-fike (rabkines 
ESTs 

prolactin receptor 
DKFZP434G032 protein 
ESTs 

cell division cycle 2, G1 to S and G2 to 



.ss.pwnase.pwnase «;« umawu yjw> ^ « w w « 

,SS.TM 1 pkinase ) Recep^L.domain.SH2,PH^urHER2 receptor tyrosine kinase {c-erb-b2, 



SS 

GiycoJransfJ.SS 

.SS.filamentPribosyltran.filament.Armad 

SS.adenylatekinase, 

.SS.TM.p450, 

hlstone,SS,histone, 

.SS.TM 

SS 

SS.IL6.IL6. 
.SS.TM 
,SS,zf-C2H2, 



ESTs 
glycogenln 2 
TaigetExon 
adenylate kinase 5 
Target Exon 

H2B histone family, member Q 
ESTs 

metallothionein 1E (functional) 
interieukin 6 (interferon, beta 2) 
solute carrier family 16 (monocarboxylic 
ESTs 



SS.Pepfjdase M10,fn2 < hemopexin < SS l TM.Pepmatrix melafloproteinase 9 (gelatinase B 
SS tofloid-like 1 

SS perflipin 
.SS.TM ESTs 
SS.TM UV-1 protein, estrogen regulated 

Ss' KIAA0882 protein 

ss ESTs, Weakly similar to ALUA^HUMAN llll 

SS,Pepfidase_M10 ) hemopexin,SSJ 3 eptidasejnatrix metalloproteinase 13 (coilagenase 

C16000922:gi|7499103|pir||T20903 hypothe 
arrestin 3, retinal (X-arrestin) 
growth hormone receptor 
lymphoid nuclear protein (LAF-4) mRNA 
ESTs 

matrix metalloproteinase 11 (MMP1 1; stro 
cystatin SN 

37 kDa teucine-rich repeat (LRR) protein 
prostate stem cell antigen 
solute carrier family 19 (thiamine trans 
programmed cell death 9 
ESTs 



TM 

SSJM.ABCJran.ABCjnembrane.SS 
arrestin,SS 
SS.TM.fh3.SS 
SS 

.SS.TM.Syntaxin 

SS.Peptidase_M10.hemopexin.SS 
.SS.cystatin, 
SS 

,SS,TM 
.SS.TM 
SS 

.SS.TM.UDPGT/aseinJcappa 



7.1 
7.1 
7,1 
7.1 
7.0 
7.0 
7.0 
7.0 
6.9 
6.9 
6.8 
6.8 
6.7 



6.7 

6.7 

6.6 

6.6 

6.6 

6.6 

6.6 

6.6 

6.5 

6.4 

6.2 

6.2 

6.2 

6.2 

6.2 

6.1 

6.1 

6.1 

6.1 

6.1 

6.1 

6.1 

6.1 

6.1 

6.1 

6.1 

6.0 

6.0 

6.0 

6.0 

5.8 

5.8 

5.8 

5.8 

5.7 

5.7 

5.7 

57 

5.6 

5.6 

5.6 

5.5 

5.5 

5.5 

5.4 

5.4 

5.4 

5.4 

5.4 

5.3 

5.3 

5.2 
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441111 

452355 
427711 
418636 
429353 
441690 
430447 
429698 
425325 

423600 
414737 
403593 
407758 
445234 
411165 
420633 
414117 
416783 
401093 
411096 
457411 
436007 
450506 
417975 
421072 
427032 
447752 
403199 
427122 
445900 
413048 
419563 
442432 
452093 
442323 
450606 
435542 
417576 
446089 
445413 
424420 
432378 
452190 
434674 
419966 
421582 
410361 
426327 
406639 
452834 
427315 
446733 
442118 
421524 
453060 
453403 
444301 
453619 
432656 
426384 
431701 
416931 
420854 
418867 
443514 
447499 



AI805867 Hs.126594 
N54926 Hs.29202 
M31659 Hs.180408 
AW749855 

AL1 17406 Hs.200102 



R81733 
W17064 
AI685086 
X52730 



Hs.33106 
Hs.332848 
Hs.26339 
Hs.1892 



.SS.TM.Phosphodiest, 


ESTs 


5.2 


TMJtmJJM 


G protein-coupled receptor 34 


5.2 


SS 


solute carrier family 25 (mitochondrial 


5.2 


,SS.TM,HECT 


gb:QV4-BT0534-281299-053^c05 BT0534 Homo 


5.2 


.SSJM.ABCJraaABCjrierribrane, 


ATP-binding cassette transporter MRP8 


5.1 


,SS,HECT,zf-UBR1,PABP.140-3, 


ESTs 


5.1 


SS 


SWI/SNF related, matrix associated, acti 


5.1 


,SS.ras, 


ESTs, WeaWy similar to S21348 probable 


5.1 


SS,NNMT_PNMT__TEMT,SS ,NNMT JWMTJEMT.STAR 





phenyiethanolamine N-melhyltransferase 



Hs.310359 
Hs.125087 



A1633559 
A1160386 
HA 

D50915 Hs.38365 
AW137636 Hs.146059 
NAI.000169HS.69089 
NM_014581Ks^74480 
W88559 Hs.1787 
AA206186 Hs.79889 

U80034 Hs.68583 
AW085961 Hs.130093 
AI247716 Hs.232168 
NM_004460Hs.418 
AA641836 Hs.30085 
A1215069 Hs.69113 
AF012023 Hs.173274 
M73700 Hs.105938 
HA 

AW057736 Hs.323910 
AF070526 Hs.13429 
M93221 Hs.75182 
AA526235 Hs.193162 
8E093589 Hs.38178 
AA447453 Hs.27860 
AW016669 Hs.29190 
AI668605 Hs.60380 
AA687376 Hs.269533 
AA339449 Hs.82285 
AI860021 Hs.270651 
AA151342 Hs.12677 
BE614743 Hs.14668B 
AI493046 Hs.146133 
H26735 Hs.91668 
AA831879 Hs.136985 
AJ345455 HsJ8915 
A1910275 Hs.1406 
BE391804 Hs.62661 
W03242 Hs.44898 
M97711 

AI638627 Hs.105685 
AA179949 Hs.175563 
AA863360 Hs.26040 
AA976718 Hs.202242 
AA312082 Hs.105445 
AW294092 Hs.21594 
BE466639 Hs.61779 
AK000136 Hs.10760 
H87648 Hs.33922 
NM_000246Hs.3076 
AI472078 Hs.303662 
AW935490 Hs.14658 
045371 Hs.80485 
AW296927 
D31771 Hs.89404 
BE464288 Hs.141937 
AW262580 Hs.147674 



SS 
SS 
.CiOE-N.i 
SS.SS 
,SS,TM 



5.1 
ESTs 
ESTs 

Target Exon 

K1AA01 25 gene product 

ESTs 



SS,Atelibiase,BTKPH i pkinase,SH2.SH3^ibo galactosidase, alpha 
SS 1 fipocaIin f SS,Bpocalin odoranttilnding protein 2A 

,TM f ion_trans,K_tetra, proteollpid protein 1 (Pelizaeus-Merzbac 

SS.TM.TM monocyte to macrophage differentlation-a 

TM.LRRCT.TM.LRRCT, C12000586^gi|6330167|dbj|BAA86477.1| (A 

Peptidase M3. mitocrwndrial intermediate peptidase 

SS ESTs 
.SS.adrLzinc, ESTs 
SS.DPPlV^NJermPeptidase^Sg.SS.OPPIV.N.fibroblast activation protein, alpha 



.SS.trypsin 
SS 

,SS,14-3-3 

SS,transfemn,7trrLl .transferrin, 
SS,TM,FoIate_camer,SS,TM 1 Folate_came 



hypothetical protein FU23186 
ESTs 

integrin cytoplasmic domain-associated p 
lactotransfemn 

NNL025243*:Homo sapiens solute carrier f 



,SSJM^kirase,RecepJ^domain,SH2,PH,FurHER2 receptor tyrosine kinase (c-erb-b2, 



Homo sapiens clone 24787 mRNA sequence 
mannose receptor, C type 1 
Homo sapiens cONA FU1 1983 fis, done HE 
hypothetical protein FU23468 
Homo sapiens mRNA; cDNA DKFZp586M0723 (f 
ESTs 

ESTs, Moderately similar to ALU6_HUMAN A 
ESTs 



,SS,Ca channel B, 

SS,TM,lectin_c,fn2,Ricin B tectin.SS.TM 
SS 
SS 

,SS,TM,7tm_1, 
,$$,TM,CB$,voltageJXC 
,TM,GJyco_hydroJ 
,SS,pkinase,RhoGEF,lg,PH,SH3, 
AIRS,formyUransf 1 GARS,SS,GARS f AlRS,for phosphoribosylglycinamide formyttransfer 
.pkinase ESTs, Moderately similar to A47582 B-cel 

SS,UPF0099,SS,UPF0099, CGU47 protein 

.SSJM.MAPEG. prostaglandin E synthase 

.SS.TM.UDPGT ESTs 

,SS,TM,PH,SH2,Furin-like 1 pkinase,RecepJ- Homo sapiens clone PP1498 unknown mRNA 
,SS ( Histdeacetyl, ESTs 

pkinase,OPR, GA-binding protein transcription factor, 

SS.trefoa.SSJMJdLrecepta.SRCR.tryps . trefoil factor 1 (pS2) 
SSJM.GBP.TM.GBP 
SS 

SS.SS,lg ( 

,SS,DEAD.Fbn\Jiead 



,SS,TM,p450, 

.ig.Sema, 

SS 

,SS,ras 1 Y_phosphatase,ras 
.SS.HMGJwxJament, 

ss,LRass 

SS 

SSXXR, 

,SS,ArfGap« 

.SS.BIR 

SS.CIq.ColJagen.SS.CIq. 
l SS,TM,Peptidase_M1, 
SS.homeobox.homeobox« 
.SS.TM,MiP. 

.SS,TM t cadherin,cadherin 



guanylate binding protein 1, interferon- 
Homo sapiens ctone TCCCTA00151 mRNA sequ 
gb:Human T-cell receptor (V beta 18.1, J 
K1AA1688 protein 

Homo sapiens mRNA; cONA OKF2p564N0763 (f 
ESTs, Weakly similar to fatty acid omega 
ESTs 

GDNF family receptor atpha 1 
hypothetical protein MGC15754 
Homo sapiens cONA FU13591 fis, clone PL 
asporln(LRR class 1) 

Homo sapiens, ctone MG&9084, mRNA. comp 

MHCcfcssUtransadrvator 

ESTs 

Human chromosome 5q13.1 done 5G8 mRNA 
adipose most abundant gene transcript 1 
gbUl4«W0-ajoc-07-04Jl.s1 NCI__CGAP_Su 
msh (Orosophaa) homeo box homotog 2 
ESTs 

protocadherin beta 16 



5.1 
5.1 
5.1 
5.0 
5.0 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.8 
4.8 
4.8 
4.8 
4.8 
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4.7 
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4.6 
4.6 
4.6 
4.6 
4.6 
4.6 
4.6 
4.5 
4.5 
4.5 
4.5 
4.5 
4.5 
4.5 
4.5 
4.5 
4.5 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
A3 
4.3 
4.3 
4.3 

4.3 
4.3 
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441560 F13386 HsJ888 
409064 AA062954 Hs.141883 
422667 H25642 Hs.133471 
454032 W31790 Hs.194293 
5 432663 A1984317 Hs.122589 
401747 

432882 NM_013257Hs.279696 
437036 AI571514 Hs.133022 
447754 AW073310 Hs.163533 

10 443194 A1954968 Hs.279009 
451871 AI821005 Hs.1 18599 
457211 AW972565 Hs.32399 
421566 NM„000399Hs.1395 
431657 AI345227 Hs.105448 

15 427899 AA829286 Hs.332053 
444779 A1192105 Hs.147170 
442295 AI827248 Hs.224398 
436396 A1683487 Hs.152213 
446039 AI150491 Hs.90756 

20 422938 NM.001809Hs.1594 
406922 S70284 
439285 AL133916 Hs.172572 
424800 AL035588 Hs.153203 
429922 Z97630 Hs.226117 

25 447178 AW594641 Hs.192417 
409038 T97490 Hs.50002 
452747 8H153855 Hs.61460 
420139 NM.005357HS.95351 
408877 AA479033 Hs.130315 

30 403329 NA 

439926 AW014875 Hs.137007 
430832 AI073913 Hs.100686 
432481 AVV451645 Hs.151504 
452410 AL133619 Hs.29383 

35 418661 NM_001949Hs.1189 
431958 X63629 Hs.2877 
425071 NMJ)13989Hs.154424 
447197 R36075 
428722 U76456 Hs.190787 

40 428330 L22524 Hs.2256 
423242 AL039402 Hs.125783 
449048 Z45051 Hs.22920 
414831 M31158 Hs.77439 
413589 AW452631 Hs.313803 

45 408875 NM_015434Hs.48604 
418629 BE247550 Hs.86859 
450787 AB006190 Hs.25475 
414870 N72264 Hs.300670 
450325 AI935962 Hs.26289 

50 407633 NM_007069Hs.37189 
426172 AA371307 Hs.125056 
442262 BE170651 Hs.8700 
427961 AW293165 Hs.143134 
445563 AW873606 Hs.149006 

55 403943 

408761 AA057264 Hs.238936 
• 423279 AW959861 Hs.290943 
420440 NM_002407Hs.97644 
445107 AI208121 Hs.147313 

60 428303 AW974476 Hs.183601 
411667 8E160198 
427809 M26380 Hs.180878 
418203 X54942 Hs.83758 
430376 AW292053 Hs.12532 

65 444190 AI878918 Hs.10526 
433495 AW373784 Hs.71 
429638 A1916662 Hs,211577 



.SS.CUB, 
,SS.TM,FMO-like 

,ss,™ 

TM 

,SS,filament,filament 
pkinase,pkinase_C, 
,SS,TM 
.pkinase, 
,SS,TM 
.SS.GDNF, 
WH1.WH1 
zf-C2H2,SS 
.SS.TM.pkinase 



|UU| I III^MIKHH ^ - — i J 

,SS,SA^roteins,ABC jnembrane,ABC Jran,aerum amyloid A1 



Homo sapiens clone 23736 mRNA sequence 

ESTs 

ESTs 

ESTs, Weakfy similar to (54374 gene NF2 
ESTs 

Homo sapiens keratin 17 (KRT17) 
semm/glucocorticoid regulated kinase-li 
ESTs 

Homo sapiens cDNA FU14142 lis, done MA 

matrix Gla protein 

ESTs 

ESTs, Weakly similar to S51797 vasodilat 
early growth response 2 (Krox-20 (Drosop 
ESTs, Weakly similar to B34087 hypotheti 



ESTs 

Homo sapiens cDNA FU11469 fis, clone HE 
wingless-type MMTV integration site faml 
ESTs 

centromere protein A (17kD) 
gbrstearoyt-CoA desaturase {human, adipo 
hypothetical protein FU20093 
MyoD family inhibitor 
H1 hlstone family, member 0 
ESTs 

small inducible cytokine subfamily A (Cy 
lg superfamBy receptor LNIR 
lipase, hormone-sensitive 
ESTs, Weakiy similar to A47582 B-cetl gr 
Target Exon 
ESTs 

ESTs, Weakly similar to JE0350 Anterior 
Homo sapiens cDNA FU11973 fis, clone HE 
Homo sapiens mRNA; cDNA OKFZp434E2321 (f 
00 E2F transcription factor 3 

SSJM,Cadherin^C_term,cadherin,SS,TM t cadcadherin3, type 1, P-cadhenn (placenta 
SS.T4 delodinase,T4 deiodinase, deiodinase, iodothyronine, type II 

,TM,SDF, gb:yh88b01.s1 Soares placenta Nb2HP Homo 

.SS.TIMP, tissue inhibitor of metalloproteinase 4 

SS.PepCdaseJV)10,SS,Peptidase_M10 J hernopematrix metalloproteinase 7 (MMP7; uterin 
ss' DEME-6 protein 

SS.SS.TM similar to S68401 (cattle) ghicose indue 

,SS,cNMP binoTng t Rila,HMG box protein kinase, cAMP^dependent, regulato 

SS abhydrolase ESTs, Highly similar to AF1 57833 1 noncl 

'ss DKFZP434B168 protein 

SS,SH2,PH,SS,TM.PH,SH2,Furin-like,pklnas growth factor receptor-tound protein 7 



SS 

,COLFI,vwc,Collagen, 
,wnt 

,TM,GlycoJiydroJ 

.SSJM.thiolase, 

SS,TM,Desaturase,SS 

,SS,ig,pklnase,LRRNT ( LRRCT 

HLH.SS 

,SS,TM,linker_histone,7trnJ 

.SSJM 

SS.IL8,SS,I18 

,SS,HLH 

.ssjm^so, 

,SS,TM 
SS.SS 
SS 
SS 

,SS,Co!!agen,COLF! 1 TSPN, 
,SS,TM.ras 



SS,TM 1 MIP,SS,TM,MIP 
SS 
SS 

TM.TM 
,SS,DENN 
.SS.START, 
SS 

.SS.WH1.WH1 
p450,SS,p450 
.SSJMJtmJ, 
SS 

,SS,SRCR,Uteroglobin 
,SS,TM 

SS,RGS,RGS,RGS 
TM 

,SS t lipase.PLAT,Sec7,PH. 

CKS.SS.CKS, 

SS 

SS 

SS,lg,MHC l ( connexin,SCAN,SS ( TM 
SSJM.SS 



aquaporln7 
KIAA1204 protein 
ESTs 

similar to rat HREV107 
ESTs 

deleted in liver cancer 1 

ESTs 

ESTs 

C5000355:gi|4503225|reflNP_000765.1 1 cyt 
ESTs, Weakly similar to (deffine not ava 
ESTs 

mammagiobln 2 (MGB2; mammaglobin B; lip 
ESTs, Weakiy similarto 138022 hypotheti 
regulator of G-protein signalling 16 
gb:QV1-HT0413-010200-059-h03 HT0413 Homo 
lipoprotein lipase 
CDC28 protein kinase 2 
chromosome 1 open reading frame 21 
cysteine and glycine-rich protein 2 
alpha-2-glycoprotein 1, zinc 
kinectin 1 (kinesin receptor) 



4.3 
4.3 
4.3 
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454071 A1041793 Hs.42502 
451859 H44491 Hs.252938 
420281 A1623693 Hs.191533 
427691 AW194426 Hs.20726 
428824 W23624 Hs.173059 
424676 Y08565 Hs.151678 
418026 8E379727 Hs.83213 
457465 AW301344 Hs.122908 
417601 NM.014735HS.82292 
10 407999 AI126271 Hs.49433 
AA890023 Hs.1906 
AU076643 Hs.313 
AA676939 Hs.69285 
. v ™ Y13647 Hs.119597 
15 417511 AL049176 Hs.82223 
AW207175 Hs.106771 
T97307 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



425548 
446619 
411213 
406625 
417511 
428769 
407137 
401866 
451195 
426044 
426310 
440029 
408573 
431830 
444781 
431493 
414175 
411789 
418851 
453968 
407104 
449051 
434398 
454042 
459496 
414998 
407756 
442101 
449722 
452554 
421991 
420058 
425776 
407846 



445873 
418054 
414921 
452268 
428862 
412520 
410530 
401780 
447131 
418334 
415138 
421168 
431473 
421379 
411984 
408101 
405366 
414612 
411393 
435767 
416406 
433068 



U10492 Hs.438 
AA502490 Hs.336695 
NNL000909HS.169266 
AW089705 Hs.293711 
AA284775 Hs.43148 
Y16645 Hs.271387 
NM_014400Hs.11950 
AI791493 Hs.129873 
A1308876 Hs.103849 
AF245505 Hs.72157 
AI417828 Hs.192435 
AA847843 Hs.62711 
S57296 Hs.323910 
AW961400 Hs.333526 
AA121098 Hs.3838 
H22570 Hs.172572 
AA808940 Hs.274450 
NM_002543Hs77729 
AA1 16021 Hs.38260 
AI651930 Hs.135684 
BE280074 Hs.23960 
AW452434 Hs.58006 
NM.014918HS.1 10488 
AK001423 Hs.94694 
U25128 Hs.159499 
AA426202 Hs.40403 
L34041 Hs.9739 
AA250970 Hs.251946 
NML002318HS.83354 
BE390551 Hs.77628 
NM003512HS.28777 
NM.000346Hs.2316 
AA442324 Hs.795 
M25609 Hs.64173 

NNL004585HS.17466 
AA319233 Hs.5521 
C18356 Hs.295944 
AF182277 Hs.330780 
AA825686 Hs.321176 
Y15221 Hs.103982 
NMJH)5419Hs.72988 
AW968504 Hs.123073 

BE274552 Hs.76578 
AW797437 Hs.69771 
H73505 Hs.117874 
D86961 Hs.79299 
NWL006456HS.288215 



,TM,7tmJ ( 

.SS.TM.EGF.ldLrecepLaJdLrecepL.b.EGF 
.SSMA, 

,SS < G}ycos_transf_2, 
SS 

Glycos,transL2,Ricin_B_tectin,SS,Glycos 

lipocaGn.SS.Iipocarm.lipocafin.feniti 

.SS.Pribosyltran.Sutfatase 

PHD.pklnase,SS 

SS 

SS.TM,fn3,SS.TM l fn3 l 
,SS,TM,ethand.ion_trans 



SSJM,CUB,F5_F8_type_C.MAM,SS,TM.CUB l F5_neurDpiIin1 



ESTs 

ESTs. Weakly similar to ALU INHUMAN ALU S 

ESTs 

ESTs 

ESTs 

UDP-N-acetyl-a!pha-D-ga!adosamine;potyp 

fatty acid binding protein 4, adipocyte 

DNA replication factor 

WAA0215 gene product 

ESTs, Wealdy similar to YZ28_HUMAN HYPOT 

prolactin receptor 

secreted phosphoprotein 1 (osteopontin, 



stearoyi-CoA desaturase (delta-9-desatur 
chord in-like 
ESTs 

gb:ye53hQ5.s1 Soares fetal liver spleen 
Target Exon 

mesenchyme homeo box 1 
ESTs 

neuropeptide Y receptor Y1 
ESTs, Weakly simitar to S64329 probable 
ESTs 

small inducible cytokine subfamily A (Cy 
GPi-anchored metastasis-associated prote 
ESTs, Wealdy similar to A36036 cytochrom 
,THhemopexin,Peptidase - M10 ( hemopexin,Pehypothetical protein DKFZp761D1 12 
ig.LRRCT, DKFZP564H922 protein 

.SS.TM ESTs 

.SS.HMG _box, Homo sapiens, done IMAGE:3351295, mRNA 

,SS,TM 1 SH2 1 PH,pkinase,Recep - .L.domain,Furv-erb-b2 avian erythroblasfic leukemia v 



SS.TM.Desaturase.SS 
SS 

.SS,7tm 1.SPRY. 

,SS,TM,GDA1_CD39 

.SS.filament, 

SS.homeobox.Ets.SS.homeobox, 
SS 

SS,TM,7tmJ, 
SS 

,SS,TM,PMP22^Ciaudin l 

.SS.TMJL8 

.SS,PH,lactamase_B 

.ss.p450.p450 



HER2 receptor tyrosine kinase (c-erb-b2, 
serunvinducibie kinase 
hypothetical protein FU20093 
EST 

oxidised low density lipoprotein (lectin 
ubiquilin specific protease 18 
ESTs 
cycfin B1 

ESTs, Weakly similar to ALUS JiUMAN ALU S 
WAA0990 protein 

Homo sapiens cONA FU 10561 fis, done NT 
parathyroid hormone receptor 2 



SS 

pktoase,POLO_box t SS,pkinase,POLOJ>ox. 
.SS.ig.pWnase.LRRNT.LRRCT, 
,SSJHKRAB,SCAN,zf-C2H2,ig 
.SS.TM 

SS,UCH-1,UCH-2,SSJM f G_glu_transpept 
SS 

cyclin.SS.TM.cydin, 
SS 
SS 
SS 

SS,TM,7tnu2,SS,TM,7trru2 

JM,ABC_membrane,ABCJran,RirwsomaLS4e^ transactivator, wit 

,SS,TM,transport^rot,SWiB,RhoGAP,DAG_PEglycerol^hosphate dehydrogenase 1 (so 
.SS.rmi.PABP.pkinase.^-S.rrm poly{A}-binding protein, cytoplasmic 1-1 

.SS.TM.mitojarr.LysyLoxidase- lysyi oxidase-ltke 2 

SS.START.SS.START.NNMT PNMT TEMT, steroidogenic acute regulatory protein r 
SS.histone,Calc - CGRP_IAPP l ig ) MHCJ,SPRY,H2A histone family, member L 
SS.HMG.box, 
histone.SS.histone.BolA 
ATP-synLab ( SS,7tm_1 .ATP-synLab 
fdament,SS,filament,filament 
TM 

,SS,TM,RibosomaLL27e, 
,Kunlfc_8PTI, 
SS.p450.SS 
SS 

SS.IL8, 

SH2.STAT.SS.STAT 



RhoGEF,PH,SS,RhoGEF,PH, 

SAP.SS.FG-GAP.vwa 

SS.sushi.trypsln.vwa.rrm.fibrinogeiuC.fn 

,SS^eptidase_S8,P,PeptJdase_S8,P 

,SS,TM 

.SS.Pribosyllran, 
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SRY (sex deterrninfrig region Y)-box 9 (ca 
H2A histone family, member O 
ATPase, H transporting, lysosomal (vacuo 
NM_005557*:Homo sapiens keratin 16 (foca 
relinolc add receptor res ponder (tazaro 
ESTs 

tissue factor pathway inhibitor 2 
cytochrome P450, subfamfly IJB (phenobar 
ESTs, Weakly simitar to S65824 reverse t 
small inducible cytokine subfamily B (Cy 
signal transducer and activator of trans 
COC2-related protein kinase 7 
NMJ)0337r:Homo sapiens vav 2 oncogene ( 
protein Inhibitor of activated STAT3 
B-fector, properdin 
ESTs 

lipoma HMGIC fusion partner-like 2 
slaiyltransferase 
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445462 AA378776 Hs.28B649 SS.SS hypothetical protein MGC3077 3.3 

439452 AA918317 Hs.57987 SS.SS B-cell CLUlymphoma 11B {zinc finger pro 3.3 

452017 AF109302 Hs.27495 SS prostate cancer associated protein 7 3.3 

409099 AK000725 Hs.50579 SS hypothetical protein FU2071 8 3.3 

5 452106 AI141031 Hs.21342 SS ESTs 3.3 

447519 U46258 Hs.339665 SS ESTs 3.3 

426928 AF037062 Hs.172914 .SS.adh.shortTGF-beta.TGFbjpropepfide retinol dehydrogenase 5 (11-cis and 9*c! 3.3 

438825 BE327427 Hs.79953 ,SS 1 TM t histoneANF^receptor.guanyiate„cy ESTs 3.3 

414575 H11257 Hs.22968 ,SS,pkinase,ig, Homo sapiens done IMAGE.451939, mRNA se 3.3 

10 417837 AL079905 Hs.1 103 SS,TGFb_propeptide,TGF^eta,SS transforming growth factor, beta 1 3.3 

422128 AW881145 SS gb:QV0-OTD033^)10400-182-a07OT0033Homo 3.3 

445941 AI267371 Hs.172636 SS.SSJipoxygenasePLAT ESTs 3.3 

429973 AI423317 Hs.164680 ,SS,T-box,UDPGT ESTs 3.3 

444542 AI161293 Hs.280380 SS.SS.Peptidase_M1^GF.fg,iectin_c,sushi aminopeptidase 3.3 

15 459561 Ai54730$ Hs.134981 SS ESTs 3.3 

425741 AF052152 Hs.159412 ( pkinase, Homo sapiens clone 24628 mRNA sequence 3.3 

426501 AW043782 Hs.293616 SS ESTs 3.3 

456508 AA502764 Hs.123469 SS ESTs, Weakly similar to AF208855 1 BM-01 3.3 

434228 242047 Hs.283978 .SS,TM,7tn\J Homo sapiens PR02751 mRNA, complete cds 3.3 

20 415752 BE314524 Hs.78776 TM putative transmembrane protein 3.3 

400419 AF084545 ,SS,Peptidase_M1, Target 3.3 

439750 AL359053 Hs.57664 . ,TM 1 integrirLB,Ricin_8Jectln,rrm Homo sapiens mRNA full length insert cDN 3.3 

423858 AL137326 Hs.133483 ,SS,TM Homo sapiens mRNA; cDNA DKFZp434B0650 (f 3.3 

428514 AW236861 Hs.193139 ,SS,START,NNMTJ'NMT_TEMT, ESTs 3.3 

25 428698 AA852773 Hs.334838 SS KIM1 866 protein 3.3 

448988 Y09763 Hs,22785 SS.TM.TM gamma-aminobutyric acid (GABA) A recepto 3.3 

432072 N62937 Hs.269109 .Sema.ig. ESTs 3.3 

417433 BE270266 Hs.82128 SS,TM,LRRCT,LRRNT i LRR 1 TM,LRRCT, 5T4 oncofetal trophobiast glycoprotein 3.3 

452194 AJ694413 Hs,332649 ,SS,TM7tmJ>W_receptor,sush/ olfactory receptor, family 2, subfemify 3.2 

30 444051 N48373 Hs.10247 ,SS,ig, activated leucocyte cell adhesion molecu 3.2 

420042 AW015140 Hs.161723 .SS.CUB. ESTs 3.2 

457292 AI921270 Hs.334882 SS,TM,SS,TM,G-patch hypothetical protein FLJ14251 3.2 

421458 NM_003554Hs.1 04576 SS carbohydrate (keratan sulfete Gai-6) su! 3.2 

431104 AW970859 Hs.313503 .Semajg, ESTs 3.2 

35 443767 BE562136 Hs.9736 1 SS,PCI.RasGEF,hormonejec 1 zf-C4 ( proteasome (prosome, macropain) 26S subu 32 

419589 AW973708 Hs.201925 ,FGF, Homo sapiens cDNA FU13446 fis. clone PL 3.2 

415447 Z97171 Hs.78454 SS.OLF,OLF,OLF,RibosomaLL4 myodin. trabecular meshwork inducible 3.2 

443464 BE548446 Hs.5167 SS.TM.SSF.SS.TM Homo sapiens mRNA; cDNA DKFZp434F152 (fr 3.2 

423431 AA326062 ,SS,p450,p450 gb:EST29 171 Cerebellum II Homo sapiens c 3.2 

40 413278 BE563085 Hs.833 .SSJM.ubiquilin.laniiniruG.laminirLEGF.k interferon-stimulated protein, 15 kDa 3.2 

458451 AW297181 Hs,195922 ,SS,RibosomaLL14 ESTs 3.2 

440449 AA885430 Hs.201925 ,FGF, Homo sapiens cDNA FLJ13446 (is, clone PL 3,2 

413753 U17760 Hs.75517 SS,laminin_EGF,laminin_Nterm,adh,short,S laminin, beta 3 (nicein (125kD), kalinln 3.2 

434676 AF160477 Hs.61460 ,SS,HLH |g superfamiiy receptor LNIR 3.2 

45 435575 AF213457 Hs.44234 SS.Ig.SSJM triggering receptor expressed on myeloid 3.2 

415773 R21651 Hs.324725 ,SS,TW I R2)osomaLS3Ae 1 G-gamma ESTs, Moderately similar to A47582 B-cel 3.2 

446440 AV658411 Hs.42656 SS K1AA1681 protein 3.2 

450847 NMJ03155HS.25590 .SS.homeobox, stanniocalcin 1 3.2 

426075 AW513691 Hs.270149 ,SS,fn3, ESTs, Weakly similar to 21 09260AB cell 3.2 

50 452110 T47667 Hs.28005 ,SSJMAc6vin_recp,pklnase Homo sapiens cDNA FU1 1 309 fis, done PL 3.2 

439963 AW247529 Hs.6793 ,TM.p450,Ets platelet-activating factor acetylhydrola 3.2 

402837 NA SS ENSP000002413 12*:DJ947L8. 1.8 (novel Sush 3.2 

439451 AF086270 Hs.278554 ,SS,Chramo_.shadow,chrorno, heterochromatin-like protein 1 3.1 

406664 L34041 Hs.9739 .SSJM.transpoitjjrot.SWIB.RhoGAP.DAG^^^ 3.1 

55 417315 A1080042 Hs.336901 ,SS,RNAj)oLA,RNAj>oU\2,Riboson^ 3.1 

413011 AW068115 Hs.821 SS,LRR,LRRNT I SS,LRRNT,LRR, biglycan 3.1 

414987 AA524394 Hs.294022 ,SS,connexin,hormone_rec,zf-C4,connexin hypothetical protein FLJ14950 3.1 

429197 H24471 Hs.26930 .SS.Gelsolin, ESTs, Weakly similar to T20272 hypotheti 3.1 

448030 N30714 Hs.325960 ,SS,TM membrane-spanning 4-domains, subfamily A 3.1 

60 407604 AW191962 Hs.249239 ,SS,TM.C1q, collagen, type VIII, alpha 2 3.1 

419092 J05581 Hs.89603 SS.TM.SEA, mucin 1, transmembrane 3.1 

456672 AK002016 Hs.1 14727 iSS.PK.PK Homo sapiens, clone MGC:16327, mRNA, com 3.1 

443171 BE281128 Hs.9030 SS,TM,7tmJ,rrm,SS TONDU 3.1 

452256 AK000933 Hs.28661 JM.GDI^tmJ, Homo sapiens cDNAFU 10071 fis, clone HE 3.1 

65 432201 AI538613 Hs.298241 SS,TM 1 trypsin ( SS,TM,trefoll,trypsin,tref Transmembrane protease, serine 3 3.1 

406642 AJ245210 SS gb:Homo sapiens mRNA for immunoglobulin 3.1 

400903 NA SS TargetExon 3.1 
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434408 AI031771 Hs.132586 ,SS,GIycoJiydro_2 ESTs 3.1 

452994 AW962597 Hs.31305 SS.WD40.SS.WD40, WAA 1547 protein 3.1 

445903 AI347487 Hs.132781 fn3,SS,TM,EF1BD class I cytokine receptor 3.1 

424364 AW383226 Hs.201 189 SS ESTs, Weakly similar to G0 1763 atrophin- 3.1 

5 410196 AI936442 Hs.59838 UBACTjepeatSS.UBACTjepeatThiFJa^^ 3.1 

419150 T29618 Hs.89640 .TM.pkinase.fnS, TEK tyrosine kinase, endothelial (venous 3.1 

433417 AA587773 Hs.8859 .SS.SRCR, Homo sapiens. Similar to RIKEN cDNA 5830 3.1 

418624 AI734080 Hs.104211 ( Sema,ig, ESTs 3.1 

436291 BE568452 Hs.5101 .SS.abhydrolase, protein regulator of cytokinesis 1 3.1 

10 411000 N40449 Hs.201619 SS ESTs, Weakly similar to S38383 SEB48 pro 3.1 

412519 AA196241 Hs.73980 SS,Troponin,Hemagglutinln,SS I TM,C2,Tropo troponin T1, skeletal, slow 3.1 

450223 AA418204 Hs.241493 .SS.proJsomerase, natural WBer-tumor recognition sequenc 3.1 

422790 AA809875 Hs.25933 JM,histone,Sec1Justone,sugar„tr ESTs 3.1 

424269 AW137691 Hs.199754 ,SS,TM ( 7tnL2,GPS ESTs 3.1 

15 435854 AJ278120 Hs.4996 .SS.WD40 pulafive ankyrirwepeat containing prote 3.1 

447388 AW630534 Hs.76277 ,SS,TM,nmoxidoredjj6,oxidored_.q6 Homo sapiens, clone MGC:9381, mRNA, comp 3.1 

451631 R00866 SS gb:ye79c02.s1 Soares fetal Over spleen 3.0 

448105 AW591433 Hs.298241 ,SS,TM,trefoil,trypsin.trefo0 Transmembrane protease, serine 3 3.0 

438637 BE500941 Hs.126730 JM.PH, ESTs, Weakly similar to WAA1 21 4 protein 3.0 

20 423024 AA593731 Hs.325823 ,SS,TM,CD36 ESTs. Moderately similar to ALUS JiUMAN A 3.0 

456592 R91600 ,SS,Ran_BP1.LIM,Ran_BP1.GRlP,TPR.LIM gb:yq1Qc02j1 Soares fetal fiver spleen 3.0 

425920 AL049977 Hs.162209 SS,mSS.TM,PMP2^.ClaudinPMP22_Claudindaudin 8 3.0 

444670 H58373 Hs.332938 ,SS,TM hypothetical protein MGC5370 3.0 

401197 arf,arf, ENSP00000229263*:HSPC213. 3.0 

25 437755 AW204256 Hs.291887 ,wnt ESTs 3.0 

452560 BE077084 Hs.336432 ,SS,rrm,zf-RanBP,pkinase.C2,pkinase_.C t OA ESTs 3.0 

410274 AA381807 Hs.61762 SS.SS hypoxia-inducible protein 2 3.0 

450098 W27249 Hs.8109 SS hypothetical protein RJ21 080 3.0 

404826 .SS.TM Target Exon 3.0 

30 458389 H70284 Hs.160152 .SS.RA ESTs, Weakly similar to FPHU alpha-fetop 3.0 

408196 AL034548 Hs.43627 HMGJox,pkinase^f-CCHC,SS.TM^MG_box,SRY (sex determining region YHwx 22 3.0 

433675 AW977653 Hs.75319 ,SS,ribonuc_red„sm, ribonucleotide reductase M2 polypeptide 3.0 

418848 A1820961 Hs.193465 .death.ZUS.pkiiiaseActivinjecp, ESTs 3.0 

422095 A1868872 Hs.282804 SS.Cu-oxidase.SS.Cu-oxidase.Cu-oxidase hypothetical protein RJ22704 3.0 

35 415992 C05837 Hs.145807 ,SSJM hypothetical protein FU 13593 3.0 

424631 AA688021 Hs.179808 SS ESTs 3.0 

409956 AW103364 Hs.727 SS.TGF-beta.TGFbjjropeptide.SS.TGF-beta.inhibin, beta A (activin A, actwin AB a 3.0 

419667 AU077005 Hs.92208 SS,disintegrin,Reprolysin,Pep„M12B_prope a disintegrin and metalloproteinase doma 3.0 

450946 AA374569 Hs.127698 SS ESTs, Moderately similar to 2109260A B c 3.0 

40 447770 AB032417 Hs.19545 Frizzled,Fz,SS,TMf rizzled,Fz. frizzled (Drosophila) homolog 4 3.0 

423826 U20325 Hs,1707 SS.SS cocaine- and anTphetarnine-regulated trans 3.0 

418838 AW385224 Hs.35198 ,SS.TM,Phosphodiest ectonucteotidepyrophosphatase/phosphodi 3.0 

442804 AW300118 Hs.131257 ,SS.TM.G-gamma ESTs 3.0 

432284 AA532807 Hs.105822 .SSJIApkinase, ESTs 3.0 
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TABLE 20A 

Table 20A shows the accession numbers for those pkeys lacking unigenelD's for Table 20. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey 


CAT number 


410785 


1221055J 


411667 


1253334J 


418636 


1774Q2J 


420854 


197072J 


422128 


211994J 


423431 


228162 1 


423945 


233566J 


426878 


273265 1 


433687 


373061J 


447 1S7 


711623_ 1 


451631 


878098J 


456207 


165078_-1 


456592 


202684J 



Accessions 

AW803341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 

BE160198 AW935898 T11520 AW935930 AW856073 AW861034 

AW749855 AA225995 AW750208 AW750206 

AW296927 AI684514 AI263166 AA281079 

AW881 145 AA49071 8 M85637 AA304575 T06067 AA331991 

AA326062 AA325758 AW962182 

AA410943 AW948953 AA334202 AA332882 

BE069341 AW748403 AL044891 AI908240 AA393080 

AA743991 M604852 AW272737 

R36075AI366546R36167 

R00866 R01523AI806815 

AA1 93450 

R91600 T87079AA291455 
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TABLE 20B 



Table 20B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 20. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



10 



15 



20 



25 



30 



35 



40 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham 1. et aJ." refers to the publication 

entitled The DNA sequence of human chromosome 22/ Dunham L et aL, Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLpositon: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 

400608 9887666 Minus 96756-97558 

400903 2911732 Plus 59112-59228 

401045 8117619 Plus 90044-90184,91111-91345 

401093 8516137 Minus 22335-23166 

401197 9719705 Plus 176341-176452 

401747 9789672 Minus 118596-118816.119119-119244/I19609-11976U2M 
131932,132451-132575.133580-134011 

401780 7249190 Minus 28397-28617^920-290453135-29296,29411-29567^9705-29787,30224-30573 

401781 7249190 Minus 83215^3435,83531^3656,83740^83901,84237^84393,84955-85037,86290-86814 
401866 8018106 Plus 73126-73623 

402230 9966312 Minus 29782-29932 

402408 9796239 Minus 110326-110491 

402578 9884928 Plus 66350-66496 

402606 9909429 Minus 81747-82094 

402837 9369121 Minus 2013-2186.9570-9758,11136-11309.19429-196^ 

403199 9958183 Minus 58895-59036,66618-66789 

403329 8516120 Pius 96450-96598 

403593 6862650 Minus 62554-62712,69449-69602 

403943 7711864 Plus 100742-100904,101322-101503 

404091 7684554 Minus 82121-83229 

404347 9838195 Plus 74493-74829 

404826 6572184 Plus 47726-48046 

405366 2182280 Plus 22478-22632 

405654 4895155 Minus 53624-53759 
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TABLE 21: 210 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
LIKELY TO ENCODE EITHER ENZYMES OR PROTEINS 
AMENABLE TO MODULATION BY SMALL MOLECULES 

Table 21 shows 210 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode either enzymes or proteins amenable to modulation by small 
molecules. These were selected as for Table 19, except that the ratio was greater than or equal 
to 3.0, the "average" normal adult tissue level was set to the 85 th percentile value amongst 
144 non-malignant tissues, and the 96 th percentile value amongst 73 breast cancers was 
greater than or equal 80 units, and the predicted protein contained a structural domain that is 
indicative of enzymatic function or of being modulatable by small molecules (e.g. pkinase, 
peptidase, phosphatase, ATPase, or ionjransporter domains). The predicted protein domains 
are noted. 



Pkey. Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigeneJD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of 93rd percentile tumor to 85* percentile of normal body tissue 



Pkey ExAccn UnigeneJD Predicted Protein Domains 



449746 
400292 
424735 
4Q7UQ 
408045 
450375 
429170 
445730 
424634 
420757 
424399 
447350 
456207 
456938 
402578 
425692 
424001 
418007 
421727 
411869 
400289 
443348 
424086 
400295 
408771 
421155 
424905 
438167 
459583 
423945 
445263 



AI668594 Hs.176588 
AA250737 HsJ2472 
U31875 Hs.272499 
AA195651 Hs.104106 
AW138959 Hs.245123 
AA009647 Hs.8850 
NM_001394Hs.2359 
AI624342 Hs.170042 
NMJM3613HS.151407 
X78592 Hs.99915 
AI905687 Hs.2533 
AI375572 Hs.172634 
AA193450 

X52509 Hs.161640 

D90041 Hs.155956 
W67883 Hs.137476 
M13509 Hs.83169 
Y13153 Hs.107318 
W20027 Hs.23439 
X07820 Hs.2258 
AW873596 Hs.182278 
AI351010 Hs.102267 
W72838 Hs.2533 
AW732573 Hs.47584 
H87879 Hs.102267 
NNL002497HS.153704 
R28363 Hs.24286 
A1907673 
AA410943 

H57646 Hs.42586 



SS,p450 

death,ZU5 1 TM>ctivin_recp,pkinase, 
SS.TM 

SS.Dihydroorotase, 
Phosphodiest,Somatomedin_B, 

SS,TM,disintegrin,Pep_M12Bj)ropep,Reprol a dlsintegrin and metatloproteinase doma 



UnigeneTftle 

ESTs, Weakly slmilarto CP4Y_HUMAN CYTOC 
BMP-R1B 

short-chain alcohol dehydrogenase family 
ESTs 
ESTs 



DSPc,Rhodanese, 
SSJM.Catkxuefflux 
ig.tspJ.SS.AAA 

hormone rec^ndrogen_recep 1 zf-C4, 
SS 



SS,TM,p450,p450 



dual specificity phosphatase 4 
ESTs 

cartilage intermediate layer protein, nu 
androgen receptor (dihydrotestosterone r 
aldehyde dehydrogenase 9 family, member 
ESTs 

gb2r40e07j1 Soares_NhHMPu«S1 Homosapi 



SSJM.aminotran 1 2,Cadherin C_term,cadhtyrosine aminotransferase 
SS.^SO.SS.TM^bij C1001134:gi|2117372lpir||l65981 fatty ac 

SS,Acetyltransf2, N-acetyltransferase 1 (arylamine N-acety 

pkinase, paternally expressed 10 

SS,hemopexin l Peptidase_M10,SS,Peptidase_matrix metalioproteinase 1 (MMP1; Inters 

kynurenine 3-monooxygenase (kynurenine 3 
SS.Peptidase_M1, ESTs 

tiemopexin,Peptidase_M10,SS,Peptidase_M10matnx metalioproteinase 10 (MMP10; str 
SS.DENN calmodulin 2 (phospnorylase kinase, delt 

SS.LysyLoxidase lysyl oxidase 

SS aldehyde dehydrogenase 9 family, member 

TM 1 K_tetra l ionJrans ( SS f TM,K_tetra,ionJ potassium voitage-gated channel, delayed 
SS.LysyLoxidaseAtdose_epim,Epimerase l S lysyl oxidase 
pkinase,SS,TM,pkinase 1 po!yprenyLsynt, NIMA (never in mitosis gene a)-re1ated k 
$S,TM,7tm_1 ,0450,1™ ESTs 

pkinase, gb:IL-BT152-080399-004 BT152 Homo sapien 

death,2U5,TM,Activin_recp,pkinase ( gb:zt32h03.r1 Soares ovary tumor NbHOT H 

KIAA1560 protein 
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R1 

65.7 
55.9 
53.8 
39.3 
34.9 
25.7 
24.9 
24.1 
21.7 
21.7 
20.3 
19.2 
18.3 
18.1 
17.8 
16.7 
16.5 
15.7 
15.3 
13.9 
13.5 
13.0 
12.8 
12.7 
11.9 
11.8 
11.7 
115 
11.5 
11.4 
11.2 
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15 



20 
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45 



50 



55 



60 



65 



423887 
429859 
428368 
418912 
451952 
402408 
415539 
406687 
419948 
400285 



409203 
424902 
431725 
418092 
427811 
423554 
426261 
413374 
432677 
456986 
453331 
407721 
418004 
410555 
443695 
423545 
401045 

442082 
418836 
404091 
450865 
424085 
449650 
432304 
415669 
442942 
428795 
428479 
400300 
447359 
402230 
424687 
432328 
432239 
400286 
425247 
434737 
439310 
441111 
452355 
427711 
429353 
425325 

448706 
403593 
432777 
446232 
411165 
414117 
411096 
450506 
417975 
447752 
427122 



AL080207 Hs.134585 
NM_007050Hs.225952 
BE440042 Hs.83326 
NM000685HS.89472 
AL120173 Hs.301663 
NA 

AI733881 Hs.72472 
M31126 Hs.272620 
AB041035 Hs.93847 
NA 

AF123050 Hs.44532 
AA780473 Hs.687 
NMJ>03866Hs.1 53687 
X65724 Hs.2839 
R45154 Hs.1066O4 
M81057 Hs.180884 
M90516 Hs.1674 
AW242243 Hs.168670 
NM_001O34HsJ5319 
NIvL004482Hs.278611 
D38299 Hs.170917 



SS.TM.BRCT.anMBCjran.ABCjran DKFZP434G232 protein 
SS.TM,Y_phosphatase,MAM,fn3, protein tyrosine phosphatase, receptor t 

SS,PeptklaseJd10,hemopexin ( SS r Peptidasejnauix metalioproteinase 3 (strometysin 
SS,TM,7tmJ,SSJM,7tmJ, angiotensin receptor 1 

$S,pJdnase, ESTs 

SS,carb_anhydrase NrvL030920*:Homo sapiens hypothetical pro 

deaft,ZU5,TM,Activin_recp l pkinase ( BNP-R1B 
SSPeptidase_M10,hemopexin,SS,Peptidase - rnatr1x metalioproteinase 11 (MMP1 1; stro 



Ferric_reduct,TM r Ferrlc_reduct, 
TMABCJran.ABCjnembrane, 
S$,TWubiquitm,7&Tu3,ANF_receptor r sushi 
SS,p45O,SS.p450 
SS,$S 

SS,Cys_knot,SS 
death,ZU5,pkinase^ctryin_recp, 



NMJ)16931:Homo sapiens NADPH oxidase 4 ( 
Eos Control 
diuMquifti 

cytochrome P450, subfamily IVB, polypept 
inositol polyphosphates-phosphatase, ty 
Norria disease (pseudogfoma) 
ESTs 



AI24Q665 
Y12735 
U37519 
U92649 



Hs.8895 
Hs.38018 
Hs.87539 
Hs.64311 



AW204099 Hs.337720 
AP000692 Hs.129781 



C11001883*:gi|6753278|ref|NPJ)33938.1lc 



SS^{arbOpeptPropep_M14,SSPror^p_M14cart)oxypeptidase B1 (tissue) 
GATa$e_2,SIS, glutamlne4roctose-6iJhosphatetransamin 
SSJM,Wr>«),ubiqui«n ( E1-e_ATPas8,Cab'onperoxisornal famesyiated protein 
SS ribonucleotide reductase M2 polypeptide 

SSJM,Gty(»s_transL2,Ridn_B - lectin 1 DPP UOP-N-acetyl-aIpha-r>galactosamine:poryp 
SS,TM,7tm_1 , prostaglandin E receptor 3 (subtype EP3) 

SS,TM,dislntegrin,Pep_M12B_propep^eprol ESTs 

pkinase, dual-specificity tyrosine-(Y}-phosphoryl 

SS,aldedh,SS,atdedh. aldehyde dehydrogenase 3 family, member 

TM,disintegrIn,Reprolysin f a dlslntegrin and metalioproteinase doma 

ESTs. Weakly similar to AF126780 1 retin 
GAF.PDEase chromosome 21 open reading frame 5 

ICEj)20 ) SS I ICE_p10,ICEj)20 l iCE_p10 l ICEj) 



R41823 Hs7413 
AJ655499 Hs.161712 
NA 

AI248013 Hs.106532 
NMJ)Q2914Hs.139226 
AF055575 Hs.23838 
AA932186 Hs.69297 
NM_005025Hs78589 
AW167087 Hs.131562 
R45503 Hs.97469 
Y00272 Hs.184572 
X03363 

NMJ)12093Hs.18268 
NA 

J05070 Hs.151738 
AI572739 Hs.195471 
X81334 Hs.2936 
NA 

NMJW594GHS.155324 
AA828246 Hs.291884 
AF086120 Hs.102793 
AI806867 Hs.126594 
N54926 Hs.29202 
M31659 Hs.180408 
AL1 17406 Hs.200102 
X52730 Hs.1692 
phenytethanolamlne N-methyltransferase 
AW291095 Hs.21814 SS.TM,pkinase, 
NA * 

AA564991 Hs.269477 
AI281848 Hs.194691 
NM 000169HS.69089 
W88559 Hs.1787 
UB0034 Hs.68583 
NM.004460Hs.418 
AA641836 Hs.30085 
^473700 Hs.105938 
AW057736 Hs.323910 



TM.EPHJbd.pkinase.SAM.m3, 
TMAcuVirLrecp,pkinase.deathZU5, 
TM,7trru3,ANF_receptor, 
zf-C2H2 

SSMA,ViraLhelicase1 .rrrn, 
TM.ion_trans.SS,TM,ion_trans, 
TM.7tmJ, 
SS.serpin. 

SSjg.Sema.pklnase, 
SS.TM 

SS.pkinase.pkinase 



67 

ESTs; calsyntenln-2 
ESTs 

Target Exon 

ESTs. Weakly similar to 138588 reverse t 
replication factor C (activator 1) 2 (40 
calcium channel, voltage-dependent L ty 
ESTs 

serine (or cysteine} proteinase inhibito 
ESTs 

ESTs. Highly similar to A39769 N-acetyil 
cell division cycle 2, G1 to S and G2 to 



SS,TM,pkinase,Recep_L_domain,SH2.PH,FuriHER2 receptor tyrosine kinase (oerb-b2. 
SS,adenylatekinase t adenylate kinase 5 

SS,TM,p450. Target Exon 

SSPeptia^se_M10,fn2,terrK)pexin.SS.TM,Pepmatrix metalioproteinase 9 (gelatinase B 
6PF2K.PGAM, 6fhosprK)frocto-2-kinase/fructose-2.6-bi 
SS.PeptidasejtflO.hemopexm.SS.Peptidasejnatrix metalioproteinase 13 (collagenase 
SS,TM.ABCJran,ABC_membrane.SS C1600(»22gil7499103lpirlIT20903hypothe 
SS.PeptidaseJvUO.hemopexin.SS matrix metalioproteinase 1 1 (MMP11; stro 

U(>i-1.pkmase,OPR,Rh(>dar^e^MP-bindlng,ESTs 
SS.TM.UDPGT.caseinJoppa ESTs 
SS.TM^hosphodiest, ESTs 
TM,7tm_1 ,TM G protein-coupled receptor 34 

SS solute carrier family 25 (mitochondria] 

SS.TM,ABCJran.ABCjnembrane, ATP-btndtng cassette transporter MRP8 

SS,NNMT_PNMT_TEMT,SS/INMTJPNMT_TEMr i STAR 

5.1 

interleukin 20 receptor, alpha 
CIDE-N.pWnase Target Exon 

alpha-amylase, ESTs 
SS.TM.7tm J,RibosomaLL13 retinolc acid induced 3 

SS,Melibiase,BTKPH,pkinase l SH2,SH3,Ribo galactosWase, alpha 
TMjon Jrans,KJetra. prateoBpH protein 1 (Pelizaeus-Merzbac 

Peptidase__M3, mitochondrial intermediate peptidase 

SS.DPPIV^NJerm.Peptidase_S9,SS.DPPIV_N.fibroblast activation protein, alpha 
SS.trypsin hypothetical protein FU23186 

SS,transferrin,7tm 1, transferrin, tactotransferrin 
SS,TM,pkinase^e(»pXdornain,SH2.PH,FuriHER2 receptor tyrosine kinase (c-erb-b2, 
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6.9 

6.8 
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50 
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65 



400181 
452093 
435542 
417576 
446089 
424420 
452190 
419986 
421582 
446733 
453060 
400205 
420854 
432690 
441560 
416445 
439024 
432882 
447754 
453775 
431657 
427899 
422938 
418478 
406922 
439285 
429922 
420139 
425071 
424511 
428722 
428330 
414831 
413589 
418629 
413453 
403943 
444616 
408761 
427809 
418203 
454071 
424676 
457465 
417601 
446619 
406625 
428769 
426310 
417531 
444781 
431493 



414175 
455325 
429597 
425320 
431854 
407104 
449051 
434398 
454042 
407756 
401464 
412970 
412049 
425776 



NA 

AA447453 Hs.27860 
AA687376 Hs.269533 
AA339449 Hs.82285 
AI860021 Hs,270651 
BE614743 Hs.146688 
H26735 Hs.91668 
A1345455 HsJ8915 
AI910275 Hs.1406 
AA863360 Hs.26040 
AW294092 Hs.21594 
NA 

AW296927 

AF181490 Hs.278627 
F13386 Hs7888 
A1043004 Hs.79337 
R96696 Hs.35598 
NM_013257Hs.279696 
AW073310 Hs.163533 
NM 002916HS.35120 
AI345227 Hs.105448 
AA829286 Hs.332053 
NM_001809Hs.1594 
U38945 Hs.1174 
S70284 

AL133916 Hs.172572 
Z97630 Hs.226117 
NMJW5357HS.95351 
NMJ)13989Hs.154424 
BE300512 Hs.193557 
U76456 Hs.190787 
L22524 Hs.2256 
M31158 Hs.77439 
AW452631 Hs.313803 
BE247550 Hs.86859 
AA129640 Hs.128065 

AV653785 Hs.173334 
AA057264 Hs.238936 
M26380 Hs. 180878 
X54942 Hs.83758 
AJ041793 Hs.42502 
Y08565 Hs.151678 
AW301344 Hs.122908 
NM_014735Hs.82292 
AU076643 Hs.313 
Y13647 Hs.1 19597 
AW207175 Hs.106771 
NMJ00909HS.169266 
NMJ)03157Hs.1087 
NM 014400HS.11950 
AI791493 Hs.129873 
AF059214 Hs.194687 
A1308876 Hs.103849 
AW895719 
NM_003816Hs.2442 
U29344 Hs.83190 
AA383550 Hs.271699 
S57296 Hs.323910 
AW961400 Hs.333526 
AA121098 Hs.3838 
H22570 Hs.172572 
AA1 16021 Hs.38260 
AFQ39241 Hs.9028 
AB026436 Hs.177534 
N53437 Hs.18268 
U25128 Hs.159499 



SS,TM,3Beta HSD, ENSP00000171555:CDNA FU10727 lis, clone 

SS,TM,7tmJ, Homo sapiens mRNA; cONA DKFZp586M0723 (f 

SS,pkinase,RhoGEF,ig,PH.SH3, ESTs 

AlRS,formyLtransf,GARS,SS,GARS,AlRS ) for phosphoribosylglycmamide formyltransfer 
pldnase ESTs, Moderately similar to A47582 B-cel 

SS.TM.MAPEG, prostaglandin E synthase 

SS,TM > PH < SH2,Furin-llke,pkinase 1 Recep_L- Homo sapiens clone PP1498 unknown mRNA 
pkInase,OPR. GA-binding protein transcription factor, 

SS.trefon,SS 1 TM 1 WLrecepLa,SRCR 1 tryps trefoil factor 1 (pS2) 

ESTs, Weakly similar to fatty acid omega 
hypothetical protein MGC15754 
NMJ)06265*:Homo sapiens RAD21 (S. pombe) 
gb:UI-H-BW0-ajoc-07-0-Ul.s1 NCI_CGAP_Su 
prenylcystelne lyase 

Homo sapiens clone 23736 mRNA sequence 
K1AA0135 protein 
SS.TM.trypsin.vwd.ig ESTs 

pkinase.pWnase C, serum/glucocorticold regulated kinase-li 

pldnase, Homo sapiens cDNA FU14142 fis, clone MA 

SS,AAA,PI3 PI4 kInase,PI3Ka,PI3HLrbd 1 P13repHcation factor C (activator 1)4 (37 
SS,TM,pkinase ESTs, Weakly similar to B34087 hypotheti 

SS,SA/U>mtelns,ABCjTierrd>rane,ABCJran l serum amyloid A1 



SS,TM,p450. 
SS,ras,Y_phosphatase,ras 

SS.TM.PeptidaseJWI, 

SS 1 pyr_redox,SS,RibosomaLL39 

pkinase, 



SS,TM,thiolase, 
ank.ank 

SS^Desaturase.SS 
SS.lg.pklnase.LRRNURRCT, 
SS,TM,RnkerJistone ( 7trnJ 
SS,TM,p450, 

SS l T4_deiodinase,T4_deiodinase 1 
SS ( Y_phosphatase,Band_.41 
SS.TIMP, 



centromere protein A (17kD) 
cyclin-dependent kinase inhibitor 2A (me 
gb:stearoyl-CoA desaturase [human, adipo 
hypothetical protein FLJ20093 
H1 histone family, member 0 
lipase, hormone-sensitive 
deiodinase, lodothyronine, type I) 
ESTs, Moderately similar to ALU7JHUMAN A 
tissue inhibitor of metalloproteinase 4 



SSpeptWaseJfl10,SS,PeptWase_M10,hemopematrix metalloproteinase 7 (MMP7; uterin 
SS.cNMP bLndLng,R!!a,HMG _box protein kinase, cAMP-dependent regulato 

SS.abhydiolase ESTs, Highly similar to AF1 57833 1 noncl 

Ss!sH2,PH,SS,TM,PH.SH2Jurin4lke,pkinas growth factor receptortound protein 7 



SS,Peptidase_C1,gpdh 
p450,SS,p450 



SS,TM,7tmJ, 
$S,lipase,PLAT,Sec7,PH l 
CKS,SS,CKS. 
TM,7tmJ, 

GlycosJransU,R!cln_BJectin,SS,Glycos 
SS,Pribosy!tran,Sulfatase 
PHD.pkinase.SS 
SS.TM.efband.ionJrans 
SS,TM,Oesaturase,SS 
SS,7tmJ,SPRY, 
SS,TM,7tm_1, 

$S,pkinase,\rwa,vwa,GlycoJransL8 
SS,PH,lactamase.B 
SS,p450.p450 



ESTs 

C5000355:gi|4503225|ref|NPJ00765,1l cyt 
ELL-RELATED RNA POLYMERASE II, ELONGATIO 
ESTs, Weakly similar to (defline not ava 



COC28 protein kinase 2 
ESTs 

UDP-N-acetyklpha-D^a!actosamine:polyp 
DNA replication factor 
KIAA0215 gene product 
secreted phospboproteln 1 (osteopontin, 
stearoyl-CoA desaturase (deIta-9-desatur 
ESTs 

neuropeptide Y receptor Y1 
serine/threonine kinase 2 
GPl-anchored metastasis-associated prate 
ESTs, Weakly slmilarto A36036 cytochrom 
cholesterol 25-hydroxylase 
TM,hernopexin,Peptidase_M10,hernopexin,Pephypothetical protein DKFZp761D1 12 
TMjonJrans.KJetra, gb:GV4-NN0039-290300-154-f06 NN0039 Homo 

a dislntegrin and metalloproteinase doma 
AcyLtransUdrLZincketoacyl-syntpp^ fatty acid synthase 
IMS.SS polymerase (DNA directed) iota 

SS JM,SH2,PH,pldnase,Ftecep^domain,Fun>erb-b2 avian erythroblastic leukemia v 



SS 

pkinase,POLO_box,SS,pkinase,POLO_box, 

SS,ig,pkinase,LRRNT,LRRCT, 

SS.UCH-I.UCH^SSJM.G^transpept 

Peptidase_M24, 

DSPc.MyosinJail, 

SS,adenylatekinase, 

SS,TM,7tm_2,SS,TM,7tnL2 



HER2 receptor tyrosine kinase (c-erb-b2, 
serum-induclble kinase 
hypothetical protein FU20093 
ubiquitin specific protease 18 
histone deacetylase5 
dual specificity phosphatase 10 
adenylate kinase 5 
parathyroid hormone receptor 2 
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407846 AA426202 Hs.40403 


TM,ABC_membraneABC,tran.RlbosoniaLS4eCbp^00^teTacting transactivator, wit 


3.4 


406925 L34041 Hs.9739 


SS,TM,transportprotSWIB f RhoGAP,DAGJ ) E-glyceroW-phosphate dehydrogenase 1 (so 


3.4 


445873 AA250970 Hs.251946 


SS,rrrn,PABP,pkinase,14-3-3,rrm 


poly(A)-binding protein, cytoplasmic 1-1 


3.4 


418054 NM_002318Hs.83354 


SS,TM,mito carr.Lysyl oxidase 


fysyi oxidase-Bke 2 


.3.4 


406815 AA833930 Hs.288036 


SS.IPPT, 


tRNA isopentenyipyrophosphate transferas 


3.4 


410530 M25809 Hs.64173 


ATP-synLab,SS l 7trn_1 ,ATP-synt_ab 


ATPase, H transporting, lysosomal (vacuo 


3.4 


407021 U52077 




gb:Human marinerl transposase gene, comp 


3.4 


421168 AF182277 Hs.330780 


SS,p450,SS 


cytochrome P450, subfamily I1B (phenobar 


3.4 


431473 AA825686 Hs.321176 


SS 


ESTs. WeaWy sbnflarto S65824 reverse t 


3.4 


408101 AW968504 Hs.123073 


pkinase, 


CDC2-related protein kinase 7 


3.4 


422083 NMJ)01141Hs.111256 


lipoxygenase.PLAT, 


arachkJonate 15-iipoxygenase, second typ 


3.3 


411393 AW797437 Hs.69771 


SS.sushi, trypsin, vwa,rrm,fibrinogen_C,fn 


B-factor, properdin 


3.3 


435767 H73505 Hs.117874 


SS,Peptldase_S8,P,Peptidase_S8 ,P 


ESTs 


3.3 


433068 NM_006456Hs.288215 


SS,Pribosyltran, 


siaryhransferase 


3.3 


426928 AF037062 Hs.172914 


SS.adh_short,TGF-beta,TGFbj)ropeptide 


retincH dehydrogenase 5 (1 1-cis and 9-d 


3.3 


414575 H11257 Hs.22968 


SS.pkinase.lg, 


Homo sapiens clone IMAGE:451939, mRNA se 


3.3 


445941 AI267371 Hs.172636 


SS,SS,lipoxygenase,PUT 


ESTs 


3.3 


444542 AI161293 Hs.280380 


SS,SS,Peptidase J/l 1 ,EGF,rg,lec(iruc,susfii 


aminopeptida56 


3.3 


425741 AF052152 Hs.159412 


pkinase, 


Homo sapiens done 24628 mRNA sequence 


3.3 


434228 Z42047 Hs.283978 


SS.WtmJ 


Homo sapiens PR02751 mRNA, complete cds 


3.3 


433264 D85782 Hs.3229 




cysteine dioxygenase, type 1 


3.3 


400419 AF084545 


SS .Peptidase JA1, 


Target 


3.3 


439750 AL359053 Hs.57664 


TH!ntegrin3,Ricln„B_lectin p rirn 


Homo sapiens mRNA full length insert cDN 


3.3 


417757 R19897 Hs.106604 


death,ZlJ5,pkinase,Activin_recp, 


ESTs 


3.3 


452194 AI694413 Hs.332649 


SS.TrVtftm 3,ANF.jeceplor.sushi 


olfactory receptor, family 2, subfamily 


3.2 


421458 NIVL003654HS. 104576 


SS 


carbohydrate (keratan sulfate Gaf-6) su( 


3.2 


443767 BE562136 Hs.9736 


SS,PCl,RasGEF,rK)rrnone_rec ( zf-C4 l 


proteasome (prosome, macropain) 26S subu 


3.2 


422648 D86983 Hs. 118893 


peroxidase,LRRCT, 


Melanoma associated gene 


3.2 


423431 AA326062 


SS.p450.p450 


gb:EST29171 Cerebellum II Homo sapiens c 


3.2 


451264 AI768235 


SS.Trehalase 


gb:wg82g08.x1 Soares_NSF_F8_9W_OT_P/^P_S 


3.2 


452110 T47667 Hs.28005 


SS,TM,Activin_fecp,pkinase 


Homo sapiens cONA FU 1 1309 fo, clone PL 


3.2 


439963 AW247529 Hs.6793 


TM.p450.Ets 


platelet-activating factor acetylhydrola 


3.2 


453941 U39817 Hs.36820 


SS.DEAD,HRDC,helicase_C, 


Bloom syndrome 


3.1 


406664 L34041 Hs.9739 


SS.TM.transporU>rotSWIB t RhoGAP.DAG^PE- 


grycerol-3- 


phosphate dehydrogenase 1 (so 


3.1 






453487 R31770 Hs.23540 


TM,7tm_1. 


ESTs 


3.1 


420911 U77413 Hs.100293 




O-linked N-acetylglucosamine (GlcNAc) tr 


3.1 


443171 BE281128 Hs.9030 


SS.TM,7tmJ,rrm,SS 


TONDU 


3.1 


452256 AK000933 Hs.28661 


TM f GDI,7tra_1, 


Homo sapiens cDNA FLJ10071 fis, done HE 


3.1 


432201 AI538613 Hs.298241 


SS.TM.trypstn.SS.TM.trefoil.trypsin.tref 


Transmembrane protease, serine 3 


3.1 


419150 T29618 Hs.89640 


TM,pkinase,fn3, 


TEK tyrosine kinase, endothelial (venous 


3.1 


444443 AI149286 Hs.55099 


SS 


rab6 GTPase activating protein (GAP and 


3.1 


426283 NMJ)03937Hs.169l39 




kynureninase (L-kynurenine hydrolase) 


3.1 


436291 BE568452 Hs.5101 


SS,abhydroiase, 


protein regulator of cytokinesis 1 


3.1 


450223 AA418204 Hs.241493 


SS.proJsomerase, 


natural killer-tumor recognition sequenc 


3.1 


424269 AW137691 Hs.199754 


SSJM.7tmJJ.GPS 


ESTs 


3.1 


448105 AW591433 Hs.298241" 


SSJM.trefoil.uypsin.trefoil 


Transmembrane protease, serine 3 


3.0 


452560 BE077084 Hs.336432 


SS l rrrn.zf-RanBP.pkinase,C2,pklnase.C,OAGESTs 


3.0 
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TABLE 21A 



Table 21 A shows the accession numbers for those pkeys lacking unigeneDD's for Table 21. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



CAT number Gene cluster number 
Accession: Genbank accession numbers 

15 

Pkey CAT number Accessions 

420854 197072 1 AW296927 AI684514 AI263168 AA281079 

20 423431 228162 1 AA326062AA325758AW962182 

423945 233566 1 AA410943 AW948953AA334202AA332882 

451264 863988.1 A1768235 R31400 H29082 H23107 

455325 1279475J AW895719 N31451 N41451 

456207 165078.-1 AA193450 



10 



Pkey: 



Unique Eos probeset identifier number 
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TABLE 21B 



10 



15 



20 



Table 21B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 21. *For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



Pkey: 
Ref. 

Strand: 
NLposition: 



Pkey Ref 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham i. et al.' refers to the publication 

entitled "The DMA sequence of human chromosome 22/ Dunham I. et al., Nature (1999} 402:489495. 
Indicates DNA strand from which exons were predicted. 
Indicates nucleotide posifions of predicted exons. 



401045 8117619 

402230 9966312 

402408 9796239 

402578 9884928 

403593 6862650 

403943 7711864 

404091 7684554 



Strand NLposition 

Plus 90044-90184,91111-91345 

Minus 29782-29932 

Minus 110326-110491 

Plus 66350-66496 

Minus 62554-6271 2.69449-69602 

Plus 100742-100904.101322-101503 

Minus 82121-83229 
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TABLE 22: 739 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT BREAST 

Table 22 shows 739 genes up-regulated in breast cancer compared to normal adult breast. 
These were selected as for Table 19, except that the ratio was greater than or equal to 3.0, the 
denominator was the 85 th percentile value for 12 non-malignant breast specimens, and the 
96 th percentile value amongst the 73 breast cancers was greater than or equal 100 units. 



Pkey. Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigeneiO: Unigene number 

15 Unigene Tide: Unigene gene title 

RV Ratio of 90* percentile tumor to 85 th percentile normal breast tissue 



Pkey ExAccn Unigene© UnigeneTitle 



R1 



400292 AA250737 Hs.72472 BMP-R1B 51.5 

424735 U31875 Hs.272499 shorichain alcohol dehydrogenase family 38.3 

400297 AI127076 Hs.334473 hypothetical protein DKFZp56401 278 29.9 

431448 AL137517 Hs.334473 hypothetical protein DKFZp56401 278 26.9 

25 451110 AI955040 Hs.265398 ESTs, Weak^sirnflaftotransformaflon-f 25.8 

43121 1 M86849 Hs.323733 gap junction protein, beta 2, 26kD {conn 23.2 

418203 X54942 Hs.83758 CDC28 protBin kinase 2 22.6 

407980 AA046309 gb:zff2i0».$1 SoaresJebUieafLNbHH19W 19.3 

414646 AA353776 Hs.901 C048 antigen (B-cell membrane protein) 18.9 

30 446921 AB012113 Hs.16530 small inducible cytokine subfamily A (Cy 18.0 

409041 AB033025 Hs.50081 KIAA1199 protein 17.6 

412140 AA219691 Hs.73625 RAB6 interacting, kinesln-like (rabkines 17.6 

407824 AA147884 Hs.9812 Homo sapiens cDNA FU14388 fis, done HE 17.1 

453160 AI263307 Hs.239884 H2B histone family, member L 17.0 

35 407137 T97307 gb:ye53h05.s1 Soares fetal liver spleen 16.1 

425692 D90041 Hs. 1 55956 N-acetyltransferase 1 (arylamlne N-acety 16.1 

438533 AI440266 Hs.170673 ESTs, Weakly similar to T24832 hypotheti 16.0 

428227 AA321649 Hs.2248 small inducible cytokine subfamily B (CX 15.5 

444342 NM_014398Hs.10887 simitar to lysosome-associated membrane 15.1 

40 422505 AL120862 Hs.124165 programmed cell death 9 (PXD9) 14.9 

430515 AA746503 Hs.283313 ESTs J4J 

417308 H60720 Hs.81892 KIAA0101 gene product M.4 
452744 AI267652 Hs.30504 Homo sapiens mRNA; cDNA DKFZp434E082 (fr14.4 

412446 AI768015 Hs.92127 ESTs J4.2 

45 415539 AI733881 Hs.72472 BMP-R1B 14.1 

435496 AW840171 Hs.265398 ESTs, Weafdy similar to transformation-r 13.8 

438209 AL120659 Hs.6111 a ryl-hydrocarbon receptor nuclear transl 13.8 
400205 NA NM_006265*:Homo sapiens RA021 (S. pombe)13.5 

430965 AA489732 Hs.154918 ESTs ]™ 

50 415263 AA948033 Hs.130853 ESTs «3.3 

451952 AL120173 Hs.301663 ESTs ]j* 

449722 BE280074 Hs.23960 cyclinBV . ™.2 

406685 M18728 gb:Human nonspecific crossreacting antig 13,0 

406690 M29540 Hs,220529 carcinoembryonic antigen-related ceil ad 128 

55 429925 NMJX)0786Hs.226213 cytochrome P450. 51 {lanosteroi 14*lpha 12.8 

416498 U33632 Hs.79351 potassium channel subfamily K, member 1 12.7 

432378 AI493046 Hs.146133 ESTs ]2.5 

441377 BE218239 Hs.202656 ESTs ^ « f 

456207 AA193450 gb:zr40e07.r1 Soares_NhHMPu_S1 Homo sapi12.4 

60 422805 AA436989 Hs.121017 H2A histone family, member A 12.2 

407811 AW190902 Hs.40098 cysteine knotsuperfamily 1, BMP antagon 12.2 

407178 AA195651 Hs.104106 ESTs 12 -2 
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420931 AF044197 Hs.100431 small Inducible cytokine B subfamily (Cy 12.1 

421727 Y13153 Hs.107318 kynurenine 3-monooxygenase (kynurenine 3 12.0 

434408 AI031771 Hs.132586 ESTs 12.0 

446591 H44186 Hs.15456 PDZdomafo containing 1 11.9 

431385 BE178536 Hs.11090 membrane-spanning 4-domains, subfamily A 115 

443348 AW873596 Hs.182278 calmodulin 2 (phosphorylaSQ kinase. de!t 11.7 

416602 NM_Q06159Hs.79389 nel (chfcfeen)4*e 2 11 J 

433365 AF026944 Hs.293797 ESTs 115 

437866 AA156781 Hs.74170 metaltolhtonein 1E (fundiortaO 115 

412472 AW975398 Hs.293836 ESTs 1M 

416030 H15261 Hs.21948 ESTs 11-3 

439979 AW600291 Hs5823 hypofhelical protein FU 10430 11.3 

420757 X78592 Hs.99915 androgen receptor (dihydrotestasterone r 11.3 

411598 BE336654 Hs.70937 H3 histone family, member A 11.2 

423600 AI633559 Hs.310359 ESTs 11.2 

430770 AA765694 Hs.123296 ESTs «J0 

421037 AI684808 Hs.197653 programmed ceOdeafh 9 (POCD9) 10.9 

452461 N78223 Hs.108106 transcription tactor 10.7 

409269 AA576953 Hs.22972 hypothetical protein RJ 13352 10.6 

417791 AW965339 Hs.1 11471 ESTs 105 

447268 AI370413 Hs.36563 hypothetical protein RJ224 16 10.4 

424001 W67883 Hs. 137476 paternally expressed 10 10.4 

447342 AI199268 Hs.19322 Homo sapiens, Similar to RIKEN cDNA 2010 10.4 

424905 NM 002497Hs. 153704 NIMA (never in mitosis gene a)-related k 10.1 
453619 H87648 Hs.33922 Homo sapiens, clone MGC:9084, mRNA, comp10.1 

442942 AW167087 Hs.131562 ESTs 10.1 

434377 AW137148 Hs.306593 Homo sapiens cDNA FU1 1382 fe, clone HE 10.1 

427217 AA399272 Hs.144341 ESTs 10.1 

445730 AI624342 Hs.170042 ESTs 10.0 

432887 AI926047 Hs.162859 ESTs 10.0 

452243 AL355715 Hs.28555 programmed ceil death 9 9.9 

424590 AW966399 Hs.46821 hypothetical protein FU20086 9.9 

432169 Y00971 Hs.2910 phosphoribosyl pyrophosphate synthetase 9.9 

438950 H23789 Hs.144530 EST 9.9 

418836 AI655499 Hs.161712 ESTs 9.8 

430291 AV660345 Hs.238126 CGI-49 protein 9.8 

444665 BE613126 Hs.47783 B aggressive lymphoma gene 9.7 
407377 C16391 gb:C16391 Clontech human aorta polyA mRN 9.7 

445413 AA151342 Hs.12677 CGI-147 protein 9.7 

443462 AI064690 Hs.171176 ESTs 9.7 

442145 AI022650 Hs.8117 erob2-interacting protein ERBIN 9.7 

435570 AF212222 Hs.1 77812 uncharacterized bone marrow protein BM04 9.7 

439820 AL360204 Hs.283853 Homo sapiens mRNA full length insert cDN 9.6 

428966 AF059214 Hs.194687 cholesterol 25-hydroxylase 9.6 

449448 D60730 Hs.57471 ESTs 95 

433929 AI375499 Hs.27379 ESTs 95 

432731 R31178 Hs.287820 fibronectinl " " " 9.3 

411815 AA156679 Hs.125790 leucine-rlch repeat-staining 2 9.3 

415385 R17798 Hs.7535 COBW4ike protein 9.3 

422026 U80736 Hs. 110826 trinucleotide repeat containing 9 9.2 

432596 AJ224741 Hs.278461 matrilin3 9.2 

439451 AF086270 Hs.278554 heterochromaWkfi protein 1 9.2 

423945 AA410943 gb2t32h03.fi Soares ovary tumor NbHOT H 9.1 

442432 BE093589 Hs.38178 hypothetical protein FLJ23468 9.1 
446715 AI337735 Hs.173919 ESTs. Moderately similar to ZN91.HUMAN Z 9.0 

408771 AW732573 Hs.47584 potassium voltage-gated channel, delayed 9.0 

437021 AI076089 Hs.292239 ESTs 9.0 

428479 V00272 Hs.184572 ce« division cycle 2, G1 to Sand G2 to 8.9 
428839 AI767756 Hs.82302 Homo sapiens cDNA FU14814 fis, done NT 8.9 
402408 NA NM_030920*:Homo sapiens hypothetical pro 8.9 

418601 AA279490 Hs.86368 catmegin 8.8 
426327 W03242 Hs.44898 Homo sapiens clone TCCCTA001 51 mRNA sequ8.8 

419519 AI198719 Hs.176376 ESTs 8.8 

440621 AW296024 Hs.150434 ESTs 85 

446142 AI754693 Hs.145968 ESTs 8.8 

418196 AI745649 Hs.26549 KIAA1 708 protein 8.7 

447178 AW594641 Hs.192417 ESTs 8.7 

322 



WO 02/059377 PCT/US02/02242 



427585 D31 152 Hs.179729 collagen, type X. alpha 1 (Schmid rnetaph 8.6 

415857 AA866115 Hs.127797 Homo sapiens cONA FU1 1381 fis, clone HE 8.6 

435061 AI651474 Hs.163944 ESTs 8.6 

431374 BE258532 Hs.251871 CTP synthase 8.4 

5 417866 AW067903 Hs.82772 collagen, type Xt, alpha 1 8.4 

437211 AA382207 Hs.5509 ecotropic viral integration site 2B 8.3 
437751 AA767373 Hs.35669 ESTs, Moderately similar to ALU1__HUMAN A 8.3 

423887 AL080207 Hs.134585 DKFZP4346232 protein 8.2 

440941 BE268362 Hs.7535 COBW-fike protein 8.2 

10 429859 NNL007O50HS.225952 protein tyrosine phosphatase, receptor t 8.2 

410193 AJ132592 Hs.59757 zinc finger protein 281 8.2 

431725 X65724 Hs.2839 Nome disease (pseudogKoma) 8.1 

446258 AI2B3476 Hs.263478 ESTs 8-1 

416747 AW876523 Hs.15929 hypothetical protein HJ12910 8.1 

15 434424 AI811202 Hs.325335 Homo sapiens cONA: FU23523 fis, clone L 8.1 

421650 AA781795 Hs.122587 ESTs 8 0 

429534 AW976987 Hs.163327 ESTs, Weakly similar to 21 09260AB cell 8.0 

457465 AW301344 Hs.122908 DNA replication factor 8.0 

427961 AW293165 Hs.143134 ESTs 8-0 

20 436481 AA379597 Hs.5199 HSPC150 protein similar to ubiquitin-con 8.0 

418216 AA662240 Hs.283099 AF15q14 protein 8.0 

418250 U29926 Hs.83918 adenosine monophosphate deaminase {isofo 7.9 

400285 NA Eos Control 7.9 

401464 AF039241 Hs.9028 histonedeacetyiase5 7.9 

25 407242 M18728 gb:HumaiirK»speclfccrossreactingantig 7.8 

422232 D43945 Hs.113274 transcription factor EC 7.8 

454024 AA993527 Hs.293907 hypothetical protein FU23403 7.8 

444542 AI161293 Hs.280380 aminopeptkfase 7.8 

436396 AI683487 Hs.152213 wingless-type MMTV integration site fami 7.7 

30 437204 AL110216 Hs.12285 ESTs, Weakly similar to 155214 salivary 7.6 

408805 H69912 Hs.48269 vaccinia related kinase 1 7.6 

437207 T27503 Hs.15929 hypothetical protein FU1 2910 7.6 

442818 AK001741 Hs.8739 hypothetical protein FU 10879 7.6 

426283 NM 003937 Hs. 169139 kynureninase (L-kynurenine hydrolase) 7.5 

' 3 5 424687 J05070 Hs.151738 matrix metalloproteinase 9 (gelatinase B 7.5 

446315 NM_016293Hs.14770 bridging integrator 2 7.5 

433426 H69125 Hs.133525 ESTs 7.5 

406639 M97711 gb:Human T-celi receptor (V beta 18.1, J 7.5 

420077 AW512260 Hs.87767 ESTs 7.4 

40 457332 AA961694 Hs.105187 Wnesin protein 9 gene 7.4 

422938 NM_001809Hs.1594 centromere protein A (17kD) 7.4 
447555 AI391662 Hs.160963 Homo sapiens, done MGC:1 231 8, mRNA,com7.4 
444618 AV653785 Hs.173334 ELL-RELATED RNA POLYMERASE II, ELONGATI07.3 
410361 BE391804 Hs.62661 guanytate binding protein 1. interferon- 7.3 
45 400266 NA NM^003292:Homo sapiens translocated prom 7.3 
439509 AF086332 Hs.5B314 ESTs 7.3 
407771 AL138272 Hs.62713 ESTs 7.3 
407202 N58172 Hs.109370 ESTs 7.3 
433096 AU076803 Hs,282975 carboxylesterase 2 (intestine, liver) 7.2 
50 422094 AF129535 Hs.272027 F-box only protein 5 ' 7.1 
430832 AI073913 Hs.100686 ESTs, Weakly similar to JE0350 Anterior 7.1 
430287 AW182459 Hs.125759 ESTs, Weakly similar to LEU5_HUMAN LEUKE7.0 
423739 AA398155 Hs.97600 ESTs 7.0 
448212 AI475858 g b:tc87d07.x1 NCLCGAP_CLL1 Homosapiens7.0 
55 407277 AW170035 Hs.326736 Homo sapiens breast cancer antigen NY-BR 7.0 
454440 BE062906 Hs.28338 WAA1546 protein 7.0 
444783 AK001468 Hs,62180 aniOin (Drosophila Scraps homolog). act 7.0 
421373 AA608229 Hs.167771 ESTs &9 
431960 AW241821 Hs.301927 c6.1A 6.9 
60 424704 AI263293 Hs.1 52096 cytochrome P450. subfamily lU{arachido 6.8 
449517 AW500106 Hs.23643 serine/threonine protein kinase MASK 6.8 
439840 AW449211 Hs.105445 GDNF family receptor alpha 1 6.8 
414080 AA135257 Hs.47783 B aggressive lymphoma gene 6.8 
441243 A1767056 Hs.193002 ESTs 6J 
65 408380 AF123050 Hs.44532 diubiquitln 6.7 
422956 BE545072 Hs.122579 hypothetical protein FU 10461 6.7 
446651 AA393907 Hs.97179 ESTs 6.7 
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419839 U24577 Hs.93304 phospho!ipaseA2 l group VII (plateiet-ac 6.7 

437740 AA810265 Hs.122915 ESTs 6.7 

421582 A1910275 Hs.1406 trefoil factor 1 (pS2> 6.7 

427356 AW023482 Hs.97849 ESTs 6.6 

5 429597 NM_003816Hs.2442 a disinlegrin and metalloproteinase doma 6.6 

422634 NM_016010Hs.118821 CGl-62 protein 6.6 

421072 AI215069 Hs.89113 ESTs 6.5 

427718 A1798680 Hs.25933 ESTs 6.5 

411000 N40449 Hs^01619 ESTs, Weakly similar to S33383 SEB4B pro 6.5 

10 449343 AI151418 Hs.272458 protein phosphatase 3 {formeriy 2B), cat 6.4 

409757 NM_001898Hs.123114 cystatinSN 6.4 

447164 AF026941 Hs.17518 Homo sapiens cig5 mRNA, partial sequence 6.4 

456938 X52509 Hs. 161 640 tyrosine aminotransferase 6.4 

418848 AI820961 Hs.193465 ESTs 6.4 

15 424902 NM_003866Hs.153687 inositol polyphosphate-4 -phosphatase, ty 6.4 

452838 U65011 Hs.30743 preferentially expressed antigen in mela 6.4 

439452 AA918317 Hs.57987 B^Jl CLUIymphorna 118 (anc finger pro 6.4 

407266 AJ235664 gbiHomo sapiens mRNA for Immunoglobulin 6.3 

411078 AI222020 Hs.182364 CocoaCrisp 6.3 

20 433001 AF217513 Hs.279905 done HQ0310 PRO0310p1 6.3 

434340 A1193043 Hs.128685 ESTs, Weakly similar to T17226 hypothetl , 6.2 

429503 AA394183 Hs.26873 ESTs 6.2 

402578 C1001134:gl|21 17372|plr|p65981 fatty ac 6.2 

409646 AW161391 Hs.709 deoxycytidine kinase 6.1 

25 430447 W17064 Hs.332848 SW1/SNF related, matrix associated, act! 6.1 

432415 T16971 Hs.289014 ESTs, Weakly similar to A43932 mucin 2 p 6.1 

443709 AI082692 Hs.134662 ESTs 6.1 

420929 A1694143 Hs.296251 programmed cell death 4 6.1 

428368 BE440042 Hs.83326 matrix metalloprotelnase 3 (stromelysin ' 6.1 

30 428248 AI126772 Hs.40479 ESTs 6.0 

420344 BE463721 Hs-97101 putative G proteirv-coupled receptor 6.0 

453392 U23752 Hs.32964 SRY (sex determining region Y)hbox 11 6.0 

425397 J04088 Hs.156346 topobomerase (DNA) II alpha (170kD) 6.0 

418007 M13509 Hs.83169 matrix metalloproteinase 1 (MMP1 ; inters 6.0 

35 428585 AB007863 Hs.185140 KIAA0403 protein 6.0 

437608 AA761605 Hs.292308 ESTs, Weakly similar to ALU 1JHUMAN ALU S6.0 

427408 AA583206 Hs.2156 RAR-related orphan receptor A 6.0 

406687 M31126 Hs.272620 matrix metalloproteinase 11 (MMP11;stro 6.0 

418092 R45154 Hs.106604 ESTs 6.0 

40- 447051 AW139130 Hs.160951 ESTs, Weakly similar to Con1 [H.sapiens] 6.0 

441233 AA972965 Hs.135568 ESTs 6.0 

432239 X81334 Hs.2936 matrix metalloproteinase 13 (coflagenase 6.0 

435106 AA100847 Hs.193380 ESTs, Highly similar to AF 174600 1 F-box 5.9 

435525 AI831297 Hs.123310 ESTs 5.9 

45 458809 AW972512 Hs.20985 sin3-associated polypeptide, 30kD 5.9 
410785 AW803341 gb:IU4JM0079^9(W00-050-O03 UM0079 Homo5.9 

422576 BE548555 Hs.1 18554 CGI-83 protein 5.9- 

451398 AI793124 Hs.144479 ESTs 5.9 

441881 AW968904 Hs.179568 hypothetical protein FU22624 5.8 

50 412022 A1005043 Hs.24143 Wiskott-Aldrich syndrome protein interac 5.8 

416636 N32536 Hs.42645 solute carrier family 16 (monocarboxyfic 5.8 

447350 AI375572 Hs.172634 ESTs 5.8 

434094 AA305599 Hs.238205 hypothetical protein PRO20 13 5.8 

409151 AA306105 Hs.50785 SEC22, veside trafficking protein (S. c 5.8 

55 448807 AI571940 Hs,7549 ESTs 5.8 

452281 T93500 Hs.28792 Homo sapiens cDNAFUl 1041 fis. clone PL 5.8 

421281 AI299139 Hs.17517 ESTs 5.8 

430361 AI033965 Hs.239926 sterol-C4-methyl oxidase-like 5.8 

400289 X07820 Hs.2258 matrix metalloproteinase 10 (MMP10; sir 5.7 

60 440527 AV657117 Hs.184164 ESTs, Moderately similar to S65657 alpha 5.7 

434674 AA831879 Hs.136985 ESTs 5.7 

426320 W47595 Hs.169300 transforming growfil fector, beta 2 5.7 

452401 NM_007115Hs.29352 tumor necrosis factor, alpha-Induced pro 5.7 

448663 BE614599 Hs.106823 hypothetical protein MGC14797 57 

65 438199 AW016531 Hs.122147 ESTs 5.7 

446203 Z47553 Hs.14286 flavin containing monooxygenase 5 5.7 

428336 AA503115 Hs.183752 microseminoprotein, beta- 5.6 
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430379 AF134149 Hs.240395 potassium channel, subfamily K, member 6 5.6 

422835 BE218705 Hs.121378 metallothioneln-like 5, tesfe-speciftc 5.6 

444758 AL044878 Hs.11899 3-hydroxy-3-me%!g(utaryWoen2ymeAre 5.6 

443426 AF098158 Hs.9329 chromosome 20 open reading frame 1 5.6 

5 400301 X03635 Hs.1657 estrogen receptor 1 5.6 

447078 AW885727 Hs.301570 ESTs 5.6 
432015 AL157504 Hs.159115 Homo sapiens mRNA; cONA DKFZp586O0724 {(5.5 

438691 AA906288 Hs.212184 ESTs 5.5 

439809 R41396 Hs.101774 hypothetical protein FU23045 5.5 

10 415786 AW419196 Hs.257924 hypothetical protein FU 13782 5.5 

456373 BE247706 Hs.89751 membrane-spanning 4-domains, subfamily A 5.5 
401645 MA C16001440*:gil12330704lgblAAG52890.1|AF35.5 

437967 BE277414 Hs.5947 mel transforming oncogene (derived from 5.5 

445885 AI734009 Hs.127699 KIAA1 603 protein 5.4 

15 439138 AI742605 Hs.193696 ESTs 5.4 

440270 NM.015986Hs.7120 cytokine receptor-like molecule 9 5.4 

437536 X91221 Hs.144465 ESTs 5.4 

438167 R28363 Hs.24286 ESTs 5.4 

452741 BE392914 Hs.30503 Homo sapiens cDNA FLJ1 1344 lis, clone PL 5.4 

20 426214 H59846 Hs.128355 ESTs, Moderately similar to ALU7JHUMAN A 5.4 

413554 AA319146 Hs.75426 secretogranin II ((^romogranin C) 5.4 

422867 L32137 Hs.1584 cartilage oligomeric matrix protein (COM 5.4 

434263 N34895 Hs.44648 ESTs 5.4 

446382 AW205168 Hs.150823 ESTs 5.4 

25 422406 AF025441 Hs.116206 Opa-interacting protein 5 5.3 

438321 AA576635 Hs.6153 CG1-48 protein 5.3 

418310 AA814100 Hs.86693 ESTs 5.3 

419625 U91616 Hs.91640 nuclear factor of kappa light poiypeptid 5.3 

450701 H39960 Hs.288467 Homo sapiens cDNA RJ12280 fis, clone MA 5.3 

30 445900 AF070526 Hs.13429 Homo sapiens clone 24787 mRNA sequence 5.2 

449051 AW961400 Hs.333526 HER2 receptor tyrosine kinase (c-erW>2, 5.2 

418758 AW959311 Hs.172012 hypothetical protein DKFZp434J037 5.2 

431070 AW408164 Hs.249184 transcription factor 19 (SC1) 5.2 

417079 U65590 Hs.81134 interleukin 1 receptor antagonist 5.2 

35 421928 AF013758 Hs.109643 polyadenylate binding protein-interaciin 5.2 

428804 AKD00713 Hs.193736 hypothetical protein FU20706 5.2 

427427 AF077345 Hs.177936 ESTs 5.2 

403485 C3Q01813*:gi|12737279|ref|XPJ)12163.1|k 5.2 

422168 AA586894 Hs.1 12408 S100 calcium-binding protein A7(psorias 5.1 

40 421937 AI878857 Hs.109706 hematological and neurological expressed 5.1 

426752 X69490 Hs.172004 titin 5.1 

453310 X70697 Hs.553 solute carrier family 6 (neurotransmitte 5.1 

423198 M81933 Hs.1634 ceil division cycle 25A 5.1 

412281 AI810054 Hs.14119 ESTs 5.1 

45 447513 AW955776 Hs.313500 ESTs, Moderately similar to ALU7_HUMAN A 5.1 

453931 AL121278 Hs.25144 ESTs 5.1 

404347 TaigetExon 5.1 

431808 M30703 Hs.270833 anphiregulin{schwannoma-derived growth 5.1 

429113 D28235 Hs.195384 prostaglandin-endoperoxide synthase 2 (p 5.1 

50 436291 BE568452 Hs.5101 protein regulator of cyloWnesis 1 5.1 

450603 R43646 Hs.12422 ESTs 5.1 

434725 AK000796 Hs.4104 hypothetical protein 5.0 

435981 H74319 Hs.188620 ESTs 5.0 

407376 AA993138 Hs,1 42287 ESTs, Weakly similar to ALUF_HUMAN !1!1 5.0 

55 431689 AA305688 Hs.267695 UDP-Gal:belaGlcNAc beta 1,3-galactosyltr 5.0 
405348 NA C7001664:gi|12698061ldbHBAB21849.1|(AB 5.0 
436196 AK001084 Hs.333498 Homo sapiens cDNA FU10222 fis, clone HE 5.0 

437065 AL036450 Hs.103238 ESTs 5.0 

410196 A1936442 Hs.59838 hypothetical protein FU 10808 5.0 

60 429412 NM_.006235Hs.2407 POU domain, class 2, associating factor 5.0 

446619 AU076643 Hs.313 secreted phosphoprotein 1 {osteopontin, 4.9 

403329 NA TaigetExon 4.9 

442875 BE623003 Hs.23625 Homo sapiens done TCCCTA001 42 mRNA sequ4,9 

442441 A1820662 Hs.129598 ESTs 4.9 

65 430375 AW371048 Hs.93758 H4 histone family, member H 4.9 
424128 AW966163 gb:EST378236 MAGE resequences, MAGI Horno4.9 

408873 AL046017 Hs.182278 calmodulin 2 (phosphorylase kinase, delt 4.9 
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407910 
432606 
453204 
452020 
449048 
408369 
431645 
423575 
444246 
421524 
452827 
414222 
456086 
419078 
418973 
447033 
451621 
419968 
424326 
431585 
429294 
416814 
439897 
429687 
422880 
405801 
432435 
439544 
425354 
436027 
424623 
403366 
402542 
450193 
411678 
456844 
448072 
408045 
423782 
447388 
448140 
452561 
425331 
428801 
428500 
426075 
437259 
400409 
412863 



AA650274 Hs.41296 
NM.002104Hs.3066 
R10799 Hs.191990 
AA722012 Hs.255757 
Z45051 Hs.22920 
R38438 Hs.182575 
AF078849 Hs.266483 
C18863 Hs.163443 
H93281 Hs.10710 
AA312082 Hs.105445 
AI571835 Hs.55468 
AL135173 Hs.878 
AL161999 Hs.77324 
M93119 Hs.89584 
AA233056 Hs.191518 
AI357412 Hs.157601 
AI879148 Hs.26770 
X04430 Hs.93913 
NWL014479HS.145296 
BE242803 Hs.262823 
AA095971 Hs.198793 
AW192307 Hs.80042 
NMJ15310Hs.6763 
AI675749 Hs.211608 
AF228704 Hs.121524 

BE218886 Hs.282070 
W26354 Hs.28891 
U62027 Hs.155935 
A1864053 Hs.39972 
AW963062 Hs.337404 
NA 



AI916071 Hs.15607 
AI907114 Hs.71465 
AI264155 Hs.152981 
A1459306 Hs.24908 
AW138959 Hs.245123 
A1472209 Hs.323117 
AW630534 Hs.76277 
AF146761 Hs.20450 
A1692181 Hs.49169 
AW962128 

AW277121 Hs.254881 
AI815395 Hs.184641 
AW513691 Hs.270149 
A1377755 Hs.120695 
AF153341 Hs.283954 

AA121673 Hs.59757 

50 426989 AI815206 Hs.99395 



fibronectin leucine rich transmembrane p 4.9 

granzyme K (serine protease, granzyme 3; 4.9 

ESTs 4.8 
ESTs, Weakly similar to AT2A.HUMAN POTEN4.8 

similar to S68401 (cattle) glucose Indue 4.8 

solute carrier family 15 (H?7? transport 4.8 

dynein light cha!n-A 4.8 

Homo sapiens cONA RJ11576 fis, done HE 4.8 

hypothetical protein RJ20417 4.8 

6DNF family receptor alpha 1 4.8 

ESTs 4.8 

sorbitol dehydrogenase 4.8 

eukaryotic translation termination facto 4.8 

insulinoma-associated 1 4.8 

ESTs 4.8 

ESTs 4.8 

fatty add binding protein 7, brain 4.7 

Interieukin 6 (Interferon, beta 2) 4.7 

disintegrin protease 4.7 

hypothetical protein FU10326 4.7 

Homo sapiens cONA: FU22463 fis, clone H 4.7 

do«chyW>^lcMan9GlcNAc2-PP-dQtichylgl 4.7 

K1AA0942 protein 4.7 

nudeoporin 153kD 4.7 

glutathione reductase 4.7 
NMJW0390:Homo sapiens choroideremia (Ra4.6 

ESTs 4.6 

hypothetical protein FU 1 1 360; artemis p 4.6 

complement component 3a receptor 1 4.6 

ESTs, Weakly similar to I38588 reverse t 4.6 

ESTs 4.6 

Target Exon 4.6 

Target Exon 4.6 



401866 
418819 
406348 
412138 
428550 
411743 



423291 
423456 
452190 
424871 
429575 
429922 
421379 
400300 
437258 
446595 



AA228776 Hs.191721 

AW895387 
AW297880 Hs.98661 
AW862214 

BE081342 Hs.283037 
NMJ)04129Hs.126590 
AL110151 Hs.128797 
H26735 Hs.91668 
NMJ04525HS.1 53595 
AA706003 Hs.99387 
Z97630 Hs.226117 
Y15221 Hs.103982 
X03363 

AL041243 Hs.174104 
T57448 Hs.15467 



Homo sapiens Fanconl anemia complementat 4.6 
squalene epoxidase 4.6 
COP-diacytglycerot synthase (phosphatida 4.6 
ESTs 4.5 
ESTs 4.5 
ESTs 4.5 
Homo sapiens, clone MGC:9381, mRNA, comp4.5 
BCM-like membrane protein precursor 4.5 
KIAA1634 protein 4.5 
gb:EST374201 MAGE resequences, MAGG Homo4.5 
ESTs 4.5 
fatty acid desaturase 2 4.5 
ESTs, Weakly similar to 2109260A B cell 4.4 
ESTs 4.4 
Homo sapiens winged herix/forkhoad uans 4.4 
zinc finger protein 281 4.4 
ESTs 4.4 
Target Exon 4.4 
ESTs 4.4 
Target Exon 4.4 
gb:GV4-NN0038^3Q030Q-157-c10 NN0038 Homo4.4 
ESTs 4.4 
gb:QV4-CT0361-3Q1299474-b05 CT0361 Homo4,4 
HSPC039 protein 4.4 
guanylate cyclase 1 , soluble, beta 2 4.4 
DKFZP586D0824 protein 4.4 
Homo sapiens done PP1498 unknown mRNA4.4 
low density Bpoproteirwelated protein 43 
ESTs 4.3 
H1 hlstone family, member 0 4.3 
small Inducible cytokine subfamily B(Cy 4.3 
HER2 receptor tyrosine kinase {c-erW>2. 4.3 
ESTs 4.3 
hypothetical protein FU20725 4.3 
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403011 NA ENSP0O00O21533O*:Probabieserine/threoni 4.3 

419055 AI365384 Hs.11571 Homo sapiens cDNA FU1 1570 fis, clone HE 4.3 

418661 NMJ01949HS.1189 E2F transcription factor 3 4,3 

407786 M687538 Hs.38972 tetraspan 1 4.3 

5 429183 AB014604 Hs.197955 K1AA0704 protein 4.3 

442914 AW188551 Hs.99519 hypothetical protein RJ 14007 4.3 

441029 AI091795 Hs.179246 ESTs 4.3 

452194 AI694413 Hs.332649 olfactory receptor, family 2, subfamily 4.3 

414821 M63835 Hs.77424 Fc fragment of IgG, high affinity la, re 4.2 

10 410102 AW248508 Hs.279727 Homo sapiens cONA FU1 4035 fis, done HE 4.2 

452110 T47667 Hs.28005 Homo sapiens cDNA FU1 1309 fis, done PL 4.2 

442007 AA301116 Hs.142838 nudeolar phosphoprotein Nopp34 4.2 

417318 AW953937 Hs.12B91 ESTs 4.2 

431818 AW510444 Hs.191705 ESTs, Weakly similar to T47184 hypolhefi 4.2 

15 443646 AI085198 Hs.164226 ESTs 4.2 

419169 AW851980 Hs.262346 ESTs, Weakly similar to S72482 hypotheti 4.2 

446839 BE091926 Hs.16244 mitotic spindle coiled-coii related prat 42 

423242 AL039402 Hs.125783 DEME-6 protein 4.2 

432116 AA902953 Hs.308538 ESTs 4.2 

20 409038 T97490 Hs.50002 small inducible cytokine subfamily A (Cy 4.2 

445625 BE246743 Hs.288529 hypothetical protein FU22635 4.2 

425139 AW630488 Hs.325820 protease, serine, 23 4.2 

447397 BE247676 Hs.16442 E-1 enzyme 4.2 

410166 AK001376 Hs,59346 hypothetical protein FU 1051 4 4.1 

25 437295 AW779318 Hs.88417 ESTs 4.1 

430486 BE062109 Hs.241551 chloride channel, calcium activated, fam 4.1 

441790 AW294909 Hs.132208 ESTs 4.1 

410129 BE244074 Hs.58831 regulator of Fas-induced apoptosis 4.1 

427521 AW973352 Hs.290585 ESTs 4.1 

30 425247 NM_005940Hs.155324 matrix metalloproteinase 11 (MMP11;stro 4.1 

412866 AF041163 Hs.74647 Human T-ceB receptor active alpha-chain 4.1 
441153 BE562826 gb:601336534F1 NIHMGCJ4 Homo sapiens c4.1 



AAA'ifM Al/nnnilC U<. m7Rn wnnrin n RP Hat« 11 4,1 

426711 AA383471 Hs.180669 conserved gene amplified in osteosarcoma 4.1 

35 405850 NA Target Exon 4.1 

440283 AI732892 Hs.190489 ESTs 4.0 

432441 AW292425 Hs.163484 ESTs 4.0 

400284 NA estrogen receptor 1 4.0 

417341 N91453 Hs.102987 ESTs 4.0 

40 429732 U20158 Hs.2488 lymphocyte cytosoDc protein 2 (SH2 doma 4.0 

411393 AW797437 Hs.69771 B-factor, properdin 4.0 

425704 U79293 Hs.159264 Human clone 23948 mRNA sequence 4.0 

419594 AA013051 Hs.91417 topoisomerase (DNA) II binding protein 4.0 

419092 J05581 Hs.89803 mucin 1, transmembrane 4.0 

45 443147 A1034351 Hs.19030 ESTs 4.0 

408633 AW963372 Hs.46677 PRO2000 protein 4,0 

433404 T32982 Hs.102720 ESTs 4.0 

421506 BE302796 Hs.105097 thymidine kinase 1. soluble 4.0 

417900 BE250127 Hs.82906 CDC20 (cell division cyde 20, S. cerevi 3.9 

50 414602 AW630088 Hs.76550 Homo sapiens mRNA; cONA DKFZp564B1264 (f3.9 

413762 AW411479 Hs.848 FK506-binding protein 4 (59kD) 3.9 

404580 NM_014112*:Homo sapiens trichorhinophala 3.9 

452046 AB018345 Hs.27657 KIAA0802 protein 3.9 
459587 AA031956 gb:zk15e04.s1 Soaresj>regnanUiterus_NbH3.9 

55 416658 U03272 Hs.79432 fibrillin 2 {congenital contractural ara 3.9 

426647 AA243464 Hs.294101 pre-8-cell leukemia transcription factor 3.9 

429353 AL1 17406 Hs.200 102 ATP-binding cassette transporter MRP8 3.9 

419038 AW134924 Hs.190325 ESTs 3.9 

418918 X07871 Hs.89476 CD2 antigen (p50), sheep red blood cell 3.9 

60 421977 W94197 Hs.110165 ribosomal protein L26 homolog 3.9 

442567 AI201183 Hs.130251 ESTs 3.9 

421168 AF182277 Hs.330780 cytochrome P450, subfamily IIB (phenobar 3.9 
431 701 AW935490 Hs. 14658 Human chromosome 5q13. 1 done 5G8 mRNA3.9 

418526 BE019020 Hs.85838 solute carrier family 16 (monocarboxylic 3.9 

65 414998 NM_002543Hs.77729 oxidised low density lipoprotein (lectin 3.9 

422790 AA809875 Hs.25933 ESTs 3.9 

419741 NMJ07Q19HS.93002 ubiquitin carrier protein E2-C 3.9 
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430017 AA263172 Hs.35 protein tyrosine phosphatase, non-recept 3.9 

458814 AI498957 Hs.170861 ESTs, Weakly similar to Z195_HUMAN ZINC 3.8 

428514 AW236881 Hs.193139 ESTs 3.8 

434521 NH.002267Hs.3886 karyopherin alpha 3 fimportln alpha 4) 3.8 

5 409425 U40462 Ks.54452 zinc finger protein, subfamily 1A, 1 (Ik 3.8 

439560 BE565647 Hs.74899 hypothetical protein RJ12820 3.8 

424028 AF055084 Hs.153692 HornosapienscDNAFU14354 fis, done Y7 3.8 

400021 AFFX control - HUM1SGF3AM97935_MA 3.8 

453403 BH466639 Hs.61779 Homo sapiens cDNAFUl 359 1 fc, clone PL 3.8 

10 445941 AI267371 Hs.172636 ESTs 3.8 

434378 AA631739 Hs.335440 EST 3.8 

429220 AW207206 Hs.136319 ESTs 3.8 

439176 AI446444 Hs.190394 ESTs, Weakly similar to B28096 line-1 pr 3.8 

401045 C11001883 , :gi|6753278IreflNP_033938.1|c 3.8 

15 430178 AW449612 Hs.152475 ESTs 3.8 

423397 NM_001838Hs.1652 chemokine (C-C motif) receptor 7 3.8 

447630 AI660149 Hs.44865 lymphoid enhancer-binding factor 1 3.8 

436391 AJ227892 Hs.146274 ESTs 3.8 

413011 AW068115 Hs.821 blglycan 3.8 

20 422121 AI767949 Hs.179833 ESTs 3.8 

452268 NM_003512Hs.28777 H2A histone family, member L 3.8 

427811 M81057 Hs.180884 carboxypeptldase B1 (tissue) 3.6 

415579 AA165232 Hs.222069 ESTs 3.8 

437330 AL353944 Hs.501 15 Homo sapiens mRNA; cDNA DKFZp761 J11 12 (f 3.8 

25 427122 AW057736 Hs.323910 HER2receptor tyrosine kinase (cerb-b2, 3.7 

400286 NA C16000922:gi[7499103(pin;iT20903hyrjothe 3.7 

420281 AI623693 Hs.1 91533 ESTs 3.7 

419926 AW900992 Hs.93796 DKFZP586D2223 protein 3.7 . 

417541 AI992191 Hs.180040 hypothetical protein FU22439 37 

30 426172 AA371307 Hs.125056 ESTs 3.7 

429638 AJ916662 Hs.21 1577 kinectin 1 (fanesin receptor) 3.7 

457001 J03258 Hs.2062 vitamin D (1,25- dihydroxyvitamin 03) re 3.7 

424109 AW406878 gb:Ui-HF-BL0-adg-g-06-tWJl.r1 NIH_MGCJ73.7 

417022 NM_014737Hs.80905 Ras association (RaIGDS/AF-6) domain fam 3.7 

35 436222 AI208737 Hs. 122810 Homo sapiens cONAFU 11489 lis, clone HE 3.7 

430448 AI633553 Hs.13303 Homo sapiens cDNA: FU21784 fis. clone H 3.7 

432729 AK000292 Hs.278732 hypothetical protein FU20285 3.7 

413916 N49813 Hs.75615 apolipoprotein C-ll 3.7 

421662 NM_014141Hs.106552 cell recognition molecule Caspr2 3.7 

40 441633 AW958544 Hs.1 12242 normal mucosa of esophagus specific 1 3.7 

408761 AA057264 Hs.238936 ESTs, Weakly similar to (deffine not ava 3.7 

406153 Target Exon 3.7 

445563 AW873606 Hs.149006 ESTs 3.7 

453464 AI884911 Hs.32989 receptor (calcitonin) activity modifying 3.7 

45 446918 AB011152 Hs.22572 KIAA0580 protein 3.7 

413936 AF113676 Hs.297681 serine (or cysteine) proteinase fnhblto 3.6 

448069 U76248 Hs.20191 seven in absentia (Drosophila) homolog 2 3.6 

453313 BE005771 Hs.153746 hypothetical protein FU22490 3.6 

425234 AW152225 Hs.165909 ESTs, Weakly similar to I38022 hypotheti 3.6 

50 419941 X98654 Hs.93837 phosphatWyfinositol transfer protein, m 3.6 

402397 AF188625 Hs.189507 phosphofipase A2. group IfD 3.6 

430378 229572 Hs.2556 tumor necrosis factor receptor superfami 3.6 

448106 AI800470 Hs.171941 ESTs 3.6 

426431 NM.000458HS.169853 transcription factor 2, hepatic; LF-B3; 3.6 

55 431843 AA516420 Hs.183526 ESTs, Weakly similar to 138022 hypotheti 3.6 

426878 BE069341 gb:QV3^T038 1 -270100073^08 BT0381 Homo3.6 

434061 AW024973 Hs.283675 NPD009 protein 3.6 

445292 AV653264 Hs.1 3982 Homo sapiens cDNA FU14666 fis, done NT 3.6 

452101 T60298 Hs. 10844 Homo sapiens cDNA FUI 4476 fis. done MA 3.6 

60 427581 NNL014788HS.179703 KIAA0 129 gene product 3.6 

409047 AW961434 Hs.31539 ESTs 3.6 

416820 NM.000402HS.80206 glucose-6-phosphate dehydrogenase 3.6 

410386 W26187 Hs.3327 Homo sapiens cONA: FU22219 fe, clone H 3.6 

440516 S42303 Hs.161 cadherin 2, type 1, N-cadherin (neuronal 3.6 

65 434360 AW015415 Hs.127780 ESTs 3.6 

428970 BE276891 Hs.194691 retinoic acid induced 3 3.6 

415079 R43179 Hs.22895 hypothetical protein FU 23548 3.6 
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409619 AK001015 Hs.55220 BCL2-assoctated alhanogene 2 3.6 

430044 AA464510 Hs.152812 ESTs 3.6 

430829 AW451999 Hs.194024 ESTs 3.6 

434224 AA380731 Hs.84 interleukin 2 receptor, gamma (severe co 3.6 

5 439247 AFQ88020 Hs.46767 EST 3.6 

431542 H63010 Hs.5740 ESTs 3.5 

430713 AA351647 Hs.2642 eukaryotic translalion elongation factor 3.5 

434988 AI418055 Hs.161160 ESTs 3.5 

437748 AF234882 Hs.5814 suppression of tumorigenicity 7 3.5 

10 418322 AA284166 Hs.84113 cydln-dependent kinase Inhibitor 3 (COK 3.5 

439569 AW602166 Hs.222399 CE6P1 protein 3.5 
459583 A1907673 gb:l-BT152-080399-004BT152Homosapien3.5 

403212 NM_019595:Homo sapiens intersedin 2 (IT 3.5 

409099 AKD00725 Hs.50579 hypothetical protein RJ20718 3.5 
15 " 453968 AA847843 Hs.62711 Homo sapiens, done IMAGE:3351 295, mRNA3.5 

436338 W92147 Hs.118394 ESTs 3.5 

422890 Z43784 Hs.75893 ankyrin3,nodeofRanvier(ankyrinG) 3.5 

442295 A1827248 Hs.224398 Homo sapiens cDN A FU 11 469 lis, clone HE 3.5 

417975 AA641836 Hs.30085 hypothetical protein FU23186 3.5 

20 433730 AK002135 Hs.3542 hypothetical protein FLJ1 1273 3.5 

439926 AW014875 Hs.137007 ESTs 3.5 

445873 AA250970 Hs.251946 poly(A)-binding protein, cytoplasmic 1-1 3.4 

410153 BE311926 Hs.15830 hypothetical protein FU12691 3.4 
422128 AW881145 gb:QV(K)T0033^10400-182-a07 OT0033 Homo 3.4 

25 414921 BE390551 Hs.77628 steroidogenic acute regulatory protein r 3.4 

441134 W29092 Hs.7678 cellular retinoic acid-binding protein 1 3.4 

444564 AI167877 Hs.143716 ESTs 3.4 

402470 NA Target Exon 3.4 

418120 AA213437 Hs.192249 ESTs 3.4 
30 422414 AW875237 Hs.13701 ESTs 3.4 
433345 AI681545 Hs.152982 hypothetical protein FU1 31 17 3.4 
409213 U61412 Hs.51133 PTK6 protein tyrosine kinase 6 3.4 
422611 AA158177 Hs,1 18722 fucosyltransferase 8 (alpha (1,6) fucosy 3.4 
423554 M90516 Hs.1674 glutamine4nj<^e-6^hosphate transamln 3.4 
35 442619 AA447492 Hs.20183 ESTs, Weakly similar to AF164793 1 prate 3.4 
402359 NA C19001991*^il12656111|gb|AAK00751.1|AF23.4 
439398 AA2B4267 Hs.221504 ESTs 3.4 
415208 F01020 Hs.172004 tifin 3.4 
452853 AA812633 Hs,10845 ESTs 3.4 
40 429345 R11141 Hs.199695 hypothetical protein 3.4 
449027 AJ271216 Hs,22880 dlpeptidylpeptidase 111 3.4 
412115 AK001763 Hs,73239 hypothetical protein FU10901 3.4 
432180 Y18418 Hs.272822 RuvB (E coli homobg)-1ike 1 3.4 
428977 AK001404 Hs.194698 cydinB2 3.4 
45 431611 U58766 Hs.264428 tissue spedfic transplantation antigen 3.4 
418286 AA622528 Hs.319825 Homo sapiens, done IMAGE361 6574. mRNA,3.4 
436895 AF037335 Hs.5338 carbonic anhydrase XII (tumor antigen H 3.4 
443378 AW392550 Hs.9280 proteasome (prosome, macropain) subunit, 3.4 
428450 NNL014791HS.184339 KIAA01 75 gene product 3.3 
50 449571 AW016812 Hs.200266 ESTs 3.3 
412777 AI335773 Hs.270123 ESTs 3.3 
420542 NMJW0505HS.1321 coagulation factor XII (Hageman factor) 3.3 
412754 AW1 60375 Hs.74565 amyloid beta (A4) precursor-like protein 33 
418327 U70370 Hs.84136 paired-like homeodomain transcription fa 3.3 
55 449065 AI627393 Hs.258998 ESTs, Weakly similar to high mobility gr 3.3 
425999 AW513051 Hs.332981 ESTs, Weakly simflar to 138022 hypotheti 3.3 
430280 AA361258 Hs.237868 Interleukin 7 receptor 33 
407777 AA161071 Hs.71465 squatene epoxidase 3.3 
426516 BE262660 Hs.1 70197 glutamic-oxaloacetic transaminase 2, mit 3.3 
60 414361 AI086138 Hs.204044 ESTs 3.3 
427080 AW068287 Hs.173466 ras-related C3 botullnum toxin substrate 3.3 
426429 X73114 Hs.169849 myosin-binding protein C, slow-type 3.3 
446163 AA026880 Hs.25252 prolactin receptor 3.3 
428566 U41763 Hs.184916 clathrin, heavy polypeptide-fflce 1 3.3 
65 418641 BE243136 Hs.86947 a disintegrin and metalloproteinase doma 3.3 
436293 AI601188 Hs.120910 ESTs 3.3 
411257 AA628967 Hs.115274 ESTs, Highly similar to IHH..HUMAN INDIAN 3.3 
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430253 AK001514 Hjl236844 hypothetical protein FU10652 3.3 

430066 A1929659 Hs.237825 signal recogniflon partoe 72kD 3.3 

436469 AK001455 Hs.5198 Down syndrome critical region gene 2 3.3 

437786 BE142681 Hs.155573 polymerase {DNA directed), eta 3.3 

5 444079 H09048 Hs.23606 ESTs 3.3 

457183 H91882 Hs.1 18569 Dvl-binding protein IDAX {Inhibition of 3.3 
431215 AA496078 Hs.121554 Human DNA sequence from clone RP11 -21 8C 13.3 

424563 AA446932 Hs.151428 ret finger protein 2 3.3 

450828 AW270655 Hs.193804 ESTs 3.3 

10 408652 R43409 Hs.6829 Homo sapiens mRNA for KIAA1644 protein, 3.3 

445142 AW978484 Hs.93842 Homo sapiens cDNA: FU22554 fis, clone H 3.3 
426761 AI015709 Hs.172089 Homo sapiens mRNA; cDNA DKFZp586l2022 (0.3 

439237 AW408158 Hs.318893 ESTs, Weakly simflar to A47582 B-ceUgr 3.3 

422616 BE300330 Hs.1 18725 setenophosphate synthetase 2 3.3 

15 443247 BE614387 Hs.333893 c-Myc target JP01 3.3 

406663 U24683 Hs,302063 immunoglobulin heavy constant mu 3.3 

434137 AA907734 Hs.124895 ESTs 3.3 

408877 AA479033 Hs.130315 ESTs, WeaWy similar to A47582 B-ceUgr 3.3 

439101 C01765 Hs.38750 hypothetical protein FU 11 526 3.3 

20 408221 AA912183 Hs.47447 ESTs 3.3 

447519 U46258 Hs.339665 ESTs 3.3 

404755 NA Target Exon 3.3 

451871 AI821005 Hs.1 18599 ESTs 3.2 

420319 AW406289 Hs.96593 hypothelica! protein 3.2 

25 430580 AA806105 Hs.300697 immunogloburin heavy constant gamma 3 (G 3.2 
400202 NA NfvL002795 # :Homo sapiens proteasome (pros32 
400222 NA NMJ)02082*;Homo sapiens G protein-couple 3.2 

425988 BE045897 Hs.274454 ESTs, Weakly simflar to I38022 hypotheti 3.2 

458098 BE550224 Hs.74170 metallothionein1E(functionaO 3.2 

30 430589 AJ002744 Hs.246315 UDP^^acetyU!pha-D^galactosamine:poryp 3.2 

431563 AI027643 Hs.120912 ESTs 3.2 
442353 BE379594 Hs.49136 ESTs, Moderately similar to ALU7_HUMAN A 3.2 

422309 U79745 Hs.1 14924 solute carrier family 16 (monocarboxytic 3.2 

419703 AI793257 Hs.128151 ESTs 3.2 

35 420380 AA640891 Hs.102406 ESTs 3.2 

410853 H04588 Hs.30469 ESTs 3.2 

454417 AI244459 Hs. 110826 trinucleotide repeat containing 9 3.2 
432745 AI821926 gtxnt78K)5.x5 NCI_CGAP_Pr3 Homo sapiens 3.2 

422032 AA476966 Hs.1 10857 polymerase (RNA) 111 (DNA directed) poly 3.2 

40 415339 NM_015156Hs.78398 K1AA0071 protein 3.2 

426384 AI472078 Hs.303662 ESTs 3.2 

448030 N30714 Hs.325960 membrane-spanning 4-domains, subfamily A 3.2 

418739 AA310964 Hs.88012 SHP2 interacting transmembrane adaptor 3.2 
442053 R35343 Hs.24968 Human DNA sequence from clone RP1-233G163.2 

45 434747 AA837085 Hs.220585 ESTs 3.2 
427297 AW292593 Hs.334907 Homo sapiens, clone MGC:17333, mRNA, com3.2 

412228 AW503785 Hs.73792 complement component (3d/Epstein Barrvl 3.2 

452304 AA025386 Hs.61311 ESTs. Weakly similar to S10590 cysteine 3.2 

453953 AW408337 Hs.36972 CD7 antigen (p41) 3.2 

50 407758 D50915 Hs.38365 K1AA0125 gene product 3.2 

451149 AL047586 Hs.10283 RNA binding motif protein 8B 3.2 

430015 AW768399 Hs.112157 ESTs 3.2 

433313 W20128 Hs.296039 ESTs 3.2 

418334 AA319233 Hs.5521 ESTs 3.2 

55 450223 AA418204 Hs.241493 natural killer-tumor recognition sequenc 3.2 

454365 AW966728 Hs.54642 methionine adenosyltransferase II, beta 3.2 
451128 AL118668 gb£)KF2p761 1031 Ojl 761 (synonym: hamy2)3.2 

417793 AW405434 Hs.82575 small nuclear ribonucleoprotein polypept 3.2 

428027 U22029 Hs.334345 cytochrome P450, subfamily 1IA (phenobar 3.2 

60 441197 8E244638 Hs.166 sterol regulatory element binding transc 3.2 

424634 NMJ03613HS.151407 cartilage intermediate layer protein, nu 3.2 

419986 AI345455 Hs.78915 GA-binding protein transcription factor, 3.2 

416714 AF283770 Hs.79630 CD79A antigen fimmunoglobulin-associated 3.2 

449465 NMJ104380HS.23598 CREB binding protein (Rubinstein-Taybi s 3.2 

65 422166 W72424 Hs.1 1 2405 S1 00 calchim-birKJIng protein A9 (calgran 3.2 

409079 W87707 Hs.82065 interleukin 6 signal transducer (gp1 30, 3.2 

423551 AA327598 Hs.233785 ESTs 3.2 
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453553 M036849 Hs.61829 Homo sapiens cDNA aJ 12763 fis, done NT 3.2 
442580 AI733682 Hs.1 30239 ESTs t 3.2 

458079 A179687Q Hs.54277 DNA segment on chromosome X (unique) 9923.2 
425700 AF076292 Hs.159251 forkhead box H1 3.2 
5 417124 BE1 22762 Hs.25338 ESTs 3.2 
407104 S57296 Hs.323910 v-erb-b2 avian erythroblastic leukemia v 3,2 
442215 AI703172 Hs. 129005 ESTs, Weakly similar to 21 09260ABceH 3.1 
430271 T06199 Hs.237506 DnaJ (Hsp40) homolog, subfamily B, membe 3.1 
425317 AW205118 Hs.210546 imerfeuWn21 receptor 3.1 
10 426095 A1278023 Hs.89986 ESTs 3,1 
442313 BE388898 Hs.8215 hypothetical protein RJ1 1307 3.1 
424709 AL137589 Hs.152149 hypothetical protein DKFZp434K0410 3.1 
429671 BE379335 Hs.211594 proteasome (prosome, macropacn) 26S subu 3.1 
432715 AA247152 Hs.200483 ESTs, WeaWy similar to K1AA1074 protein 3.1 
15 431574 AW572659 Hs.261373 hypothetical protein dJ434014.3 3.1 
436876 AI124756 Hs.5337 isocitrate dehydrogenase 2 (NADP).mitoc 3.1 
405017 NA Target Exon 3.1 

433805 AA706910 Hs.112742 ESTs *] 
437352 AL353957 Hs.284181 hypothetical protein DKFZp434P0531 3.1 
20 430105 X70297 Hs.2540 cholinergic receptor, nicotinic alpha p 3.1 
422083 NM 001141Hs.11 1256 arachidonate lipoxygenase, second typ 3.1 
413507 BE145360 Hs.190064 ESTs, Weakly similar to 138022 hypolheti 3.1 
415989 A1267700 Hs.317584 ESTs 3.1 
422907 AI879263 Hs.6986 Human glucose transporter pseudogene 3.1 
25 425548 AA890023 Hs.1906 prolactin receptor 3.1 
422599 BE387202 Hs.1 18638 non-metastatic cells 1, protein (NM23A) 3.1 
439963 AW247529 Hs.6793 platelet-activating factor acetylhydrola 3.1 
453883 A1638516 Hs.22630 cofactor required for Sp1 transcriptiona 3.1 
458021 A1885190 Hs.155089 ESTs, Weakly similar to repressor protei 3.1 
30 418478 U38945 Hs.1174 cyclirwiependent kinase inhibitor 2A (me 3.1 
400814 NA Target Exon 3.1 

402327 Target Exon 3.1 

416935 M190712 gb:zp87f09.ri Straiagene HeLa ceu s3 93 • 

439838 AL355722 Hs.106875 Homo sapiens EST from clone 35214, full 3.1 
35 437036 AI571514 Hs.133022 ESTs 3.1 
449523 NM_000579Hs,54443 chemokine(OC motif) receptor 5 3.1 
408642 AJ245210 gb:Homo sapiens mRNA for Immunoglobulin 3.1 

406624 AF052762 gb:Homo sapiens clone csneg8-1 immunogta 3.1 

421924 BE514514 Hs.109606 coronin, actin-binding protein, 1A 3.1 
40 414523 AU076633 Hs.76353 serine (or cysteine) proteinase inhibito 3.1 
416379 N38857 Hs.203933 ESTs 3.1 
422823 D89974 Hs.121102 vanln2 3.1 
433904 AI399956 Hs.208956 ESTs 3. 
421904 BE143533 Hs.109309 hypothetical protein FU20035 3.1 
45 428834 AW899713 Hs.339315 ESTs 3.1 
436043 AW963838 Hs.168830 Homo sapiens cDNA FU12136 fis, done MA 3.1 
452823 AB012124 Hs.30696 transcription factor-like 5 (basic helix 3.1 
405381 NA Target Exon 3.1 

428746 AW503820 Hs.192861 Spi-B transcription factor (SpM/PU.1 r 3.1 
50 435147 AL133731 Hs.4774 Homo sapiens mRNA; cDNA DKFZp761C1712 (f3.1 
425782 U66468 Hs.159525 cell growth regulatory with EF-handdoma 3.1 
423306 W88562 Hs.108198 ESTs 3.1 
419123 AA234276 Hs.88253 ESTs 3.1 
438581 AW977766 Hs.292133 ESTs, Moderately similar to I78885 senn 3.1 
5 5 417105 X60992 Hs.81226 CD6 antigen 3.0 
428361 NM 01 5905 Hs.1 83858 transcriptional intermediary factor 1 3.0 
417880 BE241595 Hs.82848 selectin L (lymphocyte adhesion molecule 3.0 
402606 NMju24626:Homo sapiens hypothetical prot 3.0 

40^45-} NM_004496*:Homo sapiens hepatocyte nude 3.0 

60 421878 AA299652 Hs.1 11496 Homo sapiens cDNA FU1 1643 fis, done HE 3.0 
409518 BE384836 Hs.3454 KIAA1 821 protein 3.0 
416933 BE561850 Hs.80506 small nudear ribonudeoprotem polypept 3.0 
414324 Y14768 Hs.890 tymphotoxin beta (TNF superfamily. membe 3.0 
425081 X74794 Hs.154443 minichromosome maintenance deficient (S 3.0 
65 401519 c15000476*:gi|12737279|reflXPJ)12163.1| 3.0 

411704 A1499220 Hs.71573 hypothetical protein FU10074 3.0 
428819 AL135623 Hs.193914 WAA0575 gene product 3.0 
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428423 AU076517 Hs.184276 solute carrier family 9 (sodium/hydrogen 3.0 

413835 AI272727 Hs.249 163 fatty add hydroxylase 3.0 

412600 L28824 Hs.74101 spleen tyrosine kinase 3.0 

410491 AA465131 Hs.64001 Homo sapiens clone 25218 mRNA sequence 3.0 

5 433658 L03678 Hs. 1561 10 immunoglobulin kappa constant 3.0 

427666 AI791495 Hs.180142 calnwdutin-like sWn protein 3.0 

452514 AI904898 gbflC-BT068-1 30399-085 BT068 Homo sapien3.0 

429500 X78565 Hs.289114 hexabrachion (tenascin C ( cytotactin) 3.0 

432485 N90866 Hs^76770 COW52 antigen (CA^PATH-1 antigen) 3.0 

10 437400 AB011542 Hs.5599 EGF-like-domain, multiple 5 3.0 

452234 AW084176 Hs.223296 ESTs. Weakly similar to I38022 hypothetl 3.0 

413269 BE167526 gb:CM4-HT0509-080300-107-g07 HT0509 Homo 3.0 

453216 AL137566 Hs.32405 Homo sapiens mRNA; cONA DKFZp586G0321 (f 3.0 
400929 ENSP000G0252232*:Sterol regulatory eteme 3.0 

15 445145 AI961702 Hs,147434 ESTs 3.0 

432615 AA557191 Hs.55028 ESTs, Weakly similar to 154374 gene NF2 3.0 

423279 AW959861 Hs.290943 ESTs 3.0 

429392 AL109712 Hs^96506 Homo sapiens mRNA fufl length insert cON 3.0 

408548 AA055449 Hs.63187 ESTs, Weakly stmflar to ALUCJHUMAN IHl 3.0 

20 451346 NM 006338 Hs.263 12 glioma amplified on chromosome 1 protein 3.0 

413109 AW389845 Hs.1 10855 ESTs 3.0 

401714 NA ENSP00000241802*:CDNAFU11007FIS f CLON 3.0 

421462 AF016495 Hs.104624 aquaporin9 3.0 

421750 AK000768 Hs.107872 hypothetical protein FU20761 3.0 

25 453293 AA382267 Hs.10653 ESTs 3.0 

457085 AA412446 Hs.98138 ESTs 3.0 

438930 AW843633 Hs.306163 hypothetical protein All 101 1 5 3.0 
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TABLE 22A 

Table 22A shows the accession numbers for those pkeys lacking unigenelD's for Table 22. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset Identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbere 

Pkey CAT number Accessions 

411743 1256098 1 AW862214 AW85981 1 AWB62215 

412138 1279172~1 AW895387 AW895547 AW895564 AW895323 AW895405 AW895539 AW895538 

413269 135696f 1 BE167526 BE167651 BE076401 R24654 

416935 163179 1 AA190712 AA190665 AA252564 

422128 211994~1 AW881145 AA490718 M85637AA304575T06067 AA331991 

423945 233566 1 AA410943AW948953AA334202AA332882 

424109 235506^1 AW406878AW966560 AW966151 AW966496AA336174AA335376AA335537 

424128 235728*1 AW966163 AA335983AA336011 AA335668 AA335973 

425331 250199 1 AW962128 AA355353 AA427363 

426878 273265 J BE069341 AW748403 AL044591 A1308240 AA393G80 

432745 353673 1 AIB21926 AA658826 AA564492 AA635129 AI791191 

441153 51084 2 BE562826 BE378727 

448212 755099 1 AM75858 AW969013 

451128 859865J AL1 18668 D78823 AI762176 

452514 920172 1 AI904898 A1904849 AI904899 

456207 165078.-1 AA193450 
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TABLE 22B 



Table 22B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 22. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 
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Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled The DNA sequence of human chromosome 22." Dunham I. et a)., Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which axons were predicted. 

NLposition: indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


400814 


8569925 


.Minus 


72840-72924,74761-74849 


400929 


7651921 


Minus 


122033-122241,123483-124028 


401045 


8117619 


Pius 


90044-90184,91111-91345 


401451 


6634068 


Minus 


119926-121272 


401519 


6649315 


Pius 


157315-157950 


401645 


7657839 


Minus 


34986-35133 


401714 


6715702 


Plus 


96484-96681 


401866 


8018106 


Pius 


73126-73623 


402327 


7656695 


Minus 


108675-108770,109801-109910 


402359 


9211204 


Minus 


4040341961 


402408 


9796239 


Minus 


110326-110491 


402470 


9797107 


Plus 


195129-195776 


402542 


9801558 


Minus 


67076-67594 ■ 


402578 


9884928 


Plus 


66350-66496 


402606 


9909429 


Minus 


81747-82094 


403011 


6693597 


Minus 


3468-3623 


403212 


7630897 


Minus 


156037-156210 


403329 


8516120 


Plus 


96450-96598 


403366 


8783692 


Minus 


4932349652 


403485 


9966528 


Plus 


2888-3001,3198-3532,36554117 


404347 


9838195 


Plus 


74493-74829 


404580 


6539738 


Minus 


240588-241589 


404755 


7706327 


Minus 


53729-53846 


405017 


6532084 


Plus 


35551-35690 


405348 


2914717 


Minus 


4331043462 


405381 


6006920 


Minus 


7636-8054 


405801 


2924321 


Plus 


63469-63694 


405850 


6164995 


Plus 


13871-14110 


406153 


9929734 


Minus 


12902-13069 


406348 


9255985 


Minus 


71754-71944 
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TABLE 23: 320 GENES DOWN-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT BREAST 



10 



Table 23 shows 320 genes down-regulated in breast cancer compared to normal adult breast. 
These were selected as for Table 22, except that the numerator was set to the median value 
for 12 non-malignant breast specimens, the denominator was set to the median value amongst 
the 73 breast cancers, the 90 th percentile value amongst the 12 non-malignant breast 
specimens was greater than or equal 80 units, and the ratio was greater than or equal to 4.0 
(i.e. 4-fold down-regulated in tumor vs. normal breast). 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

1 5 UnigenelD: Unigene number 

Unigene Title: Unigene gene title 



R1: 



Ratio of 50* percentile normal body tissue to 75* percentile tumor 



20 Pkey ExAccn Unigene© UnigeneTitJe Ratio 

428722 U76456 Hs.190787 tissue Inhibitor of metalloproteinase 4 2Z4 

428848 NM 000230 Hs.194236 teptin (murine obesity homolog) 17.4 

445263 H57646 Hs.42586 KJAA1 560 protein 15.4 

25 418935 T28499 Hs.89485 carbonic anhydrase IV 15.0 

407228 M25079 Hs.155376 hemoglobin, beta 14.6 

417511 AL049176 Hs.82223 chordirvtike 14.6 

439285 AL133916 Hs.172572 hypothetical protein FU20G93 14.3 

412442 AI983730 Hs.26530 serum deprivation response (phosphatidyl 13.6 

30 410544 AI446543 Hs.95511 ESTs 12.6 

412047 AA934589 Hs.49696 ESTs 12.2 

422667 H25642 Hs.133471 ESTs 12.0 

406664 L34041 Hs.9739 glycerol-3-phosphate dehydrogenase 1 (so 12.0 

423201 NMJ500163 Hs.125180 growth hormone receptor 11.7 

35 422163 AF027208 Hs.112360 prominin (mouselike 1 10.8 

428769 AW207175 Hs.106771 ESTs 10.6 

407049 X72632 NWL021724*:Homo sapiens nuclear receptor 10.1 

412295 AW088826 Hs.1 17176 poly(A)-binding protein, nuclear 1 9.8 

425126 N32759 Hs.172944 chorionic gonadotropin, beta polypeptide 9.8 

40 406791 A1220684 Hs.272572 hemoglobin, alpha 2 9.5 

447471 AF039843 Hs.18676 sprouty (Drosophila) homolog 2 9.5 

451533 NM.004657 Hs.26530 serum deprivation response (phosphatidyl 9.4 

419407 AW410377 Hs.41502 hypothetical protein FU21 276 9.0 

411939 AI365585 Hs.146246 ESTs 9.0 

45 410532 T53088 Hs.155376 hemoglobin, beta 8-9 

425707 AF1 15402 Hs,11713 E74-like factor 5 (ets domain transcript 8,8 

416585 X54162 Hs.79386 leiomodin 1 (smooth muscle) 8.7 

443060 078874 Hs.8944 procollagen C-endopeptidase enhancer 2 8.6 

435265 AA779958 Hs.185932 ESTs 8-5 

50 422511 AU076442 Hs.117938 collagen, type XVII, alpha 1 8.4 

433138 AB029496 Hs.59729 semaphorin sem2 8.3 

402195 NM_004497*;Homo sapiens hepatocyte nude 8.1 

429350 AI754634 Hs.131987 ESTs 8.1 

445107 AI208121 Hs.147313 ESTs, Weakly similar to 138022 hypotheti 8.1 

55 406643 N77976 Hs.272572 hemoglobin, alpha 2 8.0 

410199 AW377424 Hs.205126 Homo sapiens cDNA: FU22667 fis, done H 8.0 

417225 AA815048 Hs.24078 hypothetical protein FU12649 7.8 

437569 AA760849 Hs.294052 ESTs 7.5 

436062 AKD00027 Hs.98633 ESTs 7.5 

60 425078 NMJQ02599 Hs.154437 phosphodiesterase 2A, cGMP-stimulated 7.5 

430327 AW973636 Hs.55931 ESTs 7.4 
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447577 AI393693 Hs.183297 

446039 AI150491 Hs.90756 

422060 R20893 Hs.325823 

424455 AA452006 Hs.333199 

424343 AW956360 Hs.474B 

442792 AI352340 Hs.131194 

406714 A1219304 Hs.283108 

407571 AI446183 Hs.9572 

429580 AA346839 Hs.209100 

453500 AI478427 Hs.43125 

422233 AB002058 Hs.1 13275 

420205 AA256395 Hs.88156 
404368 NA 

447261 NNL006691 Hs.17917 

417090 AA193282 Hs.85863 

410677 NM.003278 Hs.65424 

427283 AL119796 Hs.174185 
415011 AW963085 

412068 S72043 Hs.73133 

416253 BE250659 Hs.15463 

435885 AA701483 Hs.36341 
402779 NA 

418138 AA213626 Hs.136204 

439335 M742697 Hs.62492 

427019 AA001732 Hs.173233 
411478 BE143068 
452654 BE004783 

447359 NM.012093 Hs.18268 

414323 NMJJ14759 Hs.334688 

441266 H15968 Hs.293845 

417011 F08212 Hs.234898 
400089 NA 

433614 W07475 Hs.277101 

440439 N92818 Hs.64754 
454404 BE067414 

436704 AA062610 Hs.148050 
406563 NA 

433490 AW451023 Hs.65848 

419313 AA843387 Hs.87279 

409196 NM.001874 Hs.334873 
410882 AW809163 

453469 AB014533 Hs.33010 

441899 AI372588 Hs.8022 

426210 AA372052 Hs.334559 
413065 BE063555 

454192 AW876813 Hs.3343 

425187 AW014486 Hs.225Q9 

429757 AW452355 Hs.256037 

420202 AL036557 Hs,9591G 

416284 A1695473 Hs.298006 

428553 AA181641 Hs.184907 
404689 NA 

438887 R68857 Hs.265499 

406082 S47833 Hs.82927 

449748 H23963 Hs.32043 

431048 R50253 Hs.249129 
452205 C15819 

434040 AW444613 Hs.288809 

407744 AB020629 Hs,38095 

450606 AI668605 Hs.60380 

414629 AA345824 Hs.76688 
401665 

436107 T99079 Hs.191194 

444432 AI161428 Hs.75916 

434715 BE005346 Hs.1 16410 

447205 BE617015 Hs.11006 

408122 AW32652 Hs.42824 



DKFZP566F2124 protein 7.4 

ESTs 72 

ESTs. Moderately similar to ALUS HUMAN A 7.2 

ESTs 7.1 

adenylate cyclase activating polypeptide 7.1 

ESTs " 7.0 

hemoglobin, gamma G 6.9 

ESTs, HigWy similar to CYA5.HUMAN ADENY 6.8 

DKFZP434C171 protein 6.7 

esophageal cancer related gene 4 protein 6.7 

purinergic receptor P2X-0<e 1, orphan r 6.7 

ESTs 6.6 

ENSP00000241075 4 :TRRAP PROTEIN. 6.6 

extracellular link domain-containing 1 6.5 

ESTs, Weakly slmflar to B34612 zinc fing 6.5 

tetranectin (piasminogen-blnding protein 6.5 

ectonudeotide pyrophosphatase/phosphodi 6.5 
gb:EST375158 MAGE resequences, MAGH Homo6.4 

metallothioneln 3 (growth inhibitory lac 6.4 

Homo sapiens, clone IMAGE2959994, mRNA 6.4 

ESTs 6.3 

Target Exon 6.3 

EST 6.3 

ESTs, Weakly similar to B39066 proline* 6.3 

hypothetical protein FU10970 6.2 
gb:MR0-HT015843020(HXXM)G9 HT0158 Homo6.2 
gb:MR2-BNQ114-270400-004-e11 BN0114 Homo6.1 

adenylate kinase 5 6.1 

K1AA0273 gene product 6. 1 

Homo sapiens, clone IMAGE3502329, mRNA, 6.1 

ESTs, WeaWyslmiiarto2109260ABcell 6.0 

Eos Control 6.0 

cytochrome c oxidase subunit IV isoform 5.9 

ESTs, Weakly simitar to potential COS [H 5.9 
gb:MR4-BT0355-200100-201-e05 BT0355 Homo5.9 

EST 5.9 

Target Exon 5.9 

hypothetical protein DKFZp7610132 5.9 

ESTs 5.9 

carboxypeptidase M 5.8 
gb:MR4-ST01 18-261099-012^03 ST0118 Homo5.8 

KIAA0633 protein 5.8 

TU3A protein 5.8 

Homo sapiens cONA FU14458 fis, clone HE 5.8 
gb:CM1-8T0283481 199-033^09 BT0283Homo5.8 

phosphoglycerate dehydrogenase 5.7 

ESTs 5.7 

ESTs 5.7 

putative lymphocyte G0/G 1 switch gene 5.7 

ESTs 5.7 

G protein-coupled receptor 1 5.6 

Target Exon 5.6 

ESTs 5.6 

adenosine monophosphate deaminase 2 (iso 5.6 

ESTs 5.6 

cell death-inducing DFFA-tike effector a 5.5 

gb:C15819 Clontech human aorta poly A mRN 5.5 

hypothetical protein FU20159 5.5 

ATP-binding cassette, sub-family A (ABC1 5.5 

ESTs, Moderately similar to ALU6JHUMAN A 5.5 

carboxylesterase 1 (monocyte/macrophage 5.5 

C11000703:gil10048448|reqNP_065258.1)g 5.5 

ESTs 5.5 

splicing factor 3b, subunit 2, 145kO 5.5 

ESTs 5.5 

ESTs, ModeraterysIm3artoT17372 plasm 5.5 

hypothetical protein FU10718 5.5 
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454016 AW016806 Hs.233108 ESTs 5.5 

414913 R25621 gb:yh45f06.r1 Soares placenta Nb2HP Homo 5.4 

459033 AA017590 Hs.129907 ESTs 5.4 

441003 BE172240 Hs.126379 ESTs, Weakly similar to I38022 hypolhetj 5.4 

450637 N49826 Hs.18602 ESTs 5.4 

442398 AA994520 gb:ou42g09.s1 $oares_NFL_T_GB(LS1 Homo s5.4 

403612 NA Target Exon 5.3 

407102 AA007629 Hs.9739 glyceroW-phosphate dehydrogenase 1 (so 5.3 

410057 R66634 Hs.268107 multimerin 5.3 

428232 BE272452 Hs.183109 monoamine oxidase A 5,3 

432769 AA620814 Hs.144959 ESTs 5.3 

431344 R99530 Hs.272572 hemoglobin, alpha 2 5.3 

427032 AF012023 Hs.173274 integrin cytoplasmic domain-associated p 5.3 

406305 BE261320 Hs.158196 transcriptional adaptor 3 (ADA3, yeast h 5.3 

437411 AW613948 Hs.194915 ESTs 5.3 

442800 AI809481 Hs.131227 ESTs 5.3 

402054 NA Target Exon 5.3 

432035 AF212829 Hs.272406 potassium channel, subfamily K, member 9 5.3 

415313 R59638 Hs.6181 ESTs 5.2 

459159 AI904646 gb:QV-BT065-020399-103 BT065 Homo sapien 5.2 

427164 AB037721 Hs.173871 KIAA1300 protein 5.2 

441391 BE467930 Hs.170381 ESTs 5.2 

458959 AI285901 Hs.181297 ESTs 5.2 

402698 NA ENSPOOQ00251335*;DJ1003J2.1 (sodium and 5.2 

401810 NA Target Exon 5.2 

438879 AA827674 Hs.189073 ESTs 5.2 

414657 AA424074 Hs.76780 protein phosphatase 1, regulatory (irihib 5.2 

427809 M26380 Hs. 180878 lipoprotein lipase 5,1 

456063 NM_0O6744 Hs.76461 retinol-binding protein 4, interstitial 5.1 

451186 AW023469 Hs.65256 ESTs, Weakly similar to leucine-rich gli 5.1 

451882 AI821324 Hs.100445 ESTs 5.1 

402583 NA NM_021620:Homo sapiens PR domain contain -5.1 

431130 NM_006103 Hs.2719 HE4; WFDC2; putative ovarian carcinoma m 5.1 

458218 AI435179 Hs.126820 ESTs 5.1 

416083 R53467 Hs.269122 ESTs, Weakly similar to ALU U^MAN ALU S 5.1 

455282 BE143867 gb:MRO-HT0164-070100-013-h02 HT0164 Homo5.1 

426488 X03350 Hs.4 alcohol dehydrogenase 18 (class I), beta 5.1 

426156 BE244537 Hs.167382 natriuretic peptide receptor A/guanylate 5,1 

407891 AA486620 Hs.41135 endomucin-2 5.0 

408610 AW026692 Hs.224829 ESTs 5.0 

445967 D59597 Hs.118821 CGI-62 protein 5.0 

434813 AI524307 Hs.162870 ESTs 5.0 

437526 AI076012 Hs.121388 ESTs, Weakly similar to MDHCJiUMAN MALAT 5.0 

454775 BE160229 gb:QV1-HT0413490200-062-a12HT0413 Homo 5.0 

409451 AF012626 Hs.54472 fragile X mental retardation 2 5,0 

409853 AW502327 gb:Ul-HF-BR0p*ka-a-07-O-Ul.r1 NIH_MGC_5 5.0 

405062 Target Exon 5.0 

446490 AK000706 Hs.15125 hypothetical protein FU20699 5.0 

417622 AW298163 Hs.82318 WAS protein family, member 3 5.0 

421978 AJ243662 Hs.110196 NICE-1 protein 5.0 

440338 R62431 Hs.12758 ESTs 5.0 

415421 R35009 Hs.24903 ESTs 5.0 

417574 R00348 gb:ye69e06.r1 Soares fetal liver spleen 5.0 

409882 AJ243191 Hs.56874 heatshock 27kD protein fa mBy, member 7 5.0 

447998 AI768289 Hs.304389 ESTs 4.9 

445613 BE550889 Hs.158491 ESTs 4.9 

443074 AW341470 Hs.144907 ESTs 4,9 

451324 AI783600 Hs.208052 ESTs 4.9 

432433 AW014734 Hs.157969 ESTs 4.9 

449654 A1989812 Hs.199850 ESTs 4.9 

414519 N94587 - Hs.55063 ESTs 4.9 

457531 AW973716 Hs.13913 KIAA1 577 protein 4.9 

433200 AA682722 Hs.192725 ESTs 4.9 

430782 AF026263 Hs.247920 choBnergic receptor, muscarinic 5 4.8 

427555 AW137094 Hs.97990 ESTs 4.8 

433545 AA868510 Hs.1 12496 ESTs 4.8 

420334 AI349351 Hs.1 18944 hypothetical protein FLJ22477 4.8 
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65 



421795 


X63094 


Hs.283822 


427138 


N77624 


Hs.173717 


413072 


BE063965 




443721 


AW450451 


Hs.266355 


408053 


AW139474 


Hs.246862 


427067 


M843716 


Hs.177927 


442969 


AI025499 


Hs.132238 


426220 


AI383475 


Hs.171697 


414593 


BE386764 




426893 


AA398716 


Hs.97418 


434046 


AW292618 


Hs.113011 


401590 


NA 




457971 


AW134679 


Hs.242849 


427722 


AKO 00123 


Hs.180479 


443793 


AA045290 


Hs.25930 


407737 


R49187 


Hs.6659 


441955 


AA972327 


Hs.142903 


441499 


AW298235 


Hs.101669 


447517 


AI382726 


Hs.182434 


403017 






450580 


N40037 


Hs.15248 


404611 


H58589 


Hs.35156 


414631 


M31158 


Hs.77439 


459290 


NMJ01546 


Hs.34853 


444341 


AI142027 


Hs.146650 


408614 


AL1 37698 


Hs.46531 


449638 


AW204277 


Hs.250723 


434418 


AF134707 


Hs.278679 


447360 


AI375984 


Hs.167216 


419583 


F00312 




440698 


AI346455 


Hs.147492 


451199 


AJ290653 


Hs.124758 


438338 


NMJ)14861 


Hs.6168 


433756 


AW015933 


Hs.1 12654 


423301 


S67580 


Hs.1645 


417237 


H86335 


Hs.81737 


439745 


AL369981 


Ks.149219 


424137 


AA335769 


Hs.16262 


449338 


H73444 


Hs.394 


434744 


N94835 


Hs283S2B 


407402 


AF035303 




443510 


NM.012190 


Hs.9520 


415754 


AA169114 


Hs.12247 


415986 


Z43619 




457416 


BE142052 


Hs.62654 


418064 


BE387287 


Hs.83384 


437120 


AI356125 


Hs.157767 


453950 


AA156993 


Hs.211568 


401093 






436935 


AW206494 


Hs.253560 


457974 


AW842353 


Hs.321717 


428222 


AL133112 


Hs.183085 


442705 


A1264634 


Hs.131127 


437409 


AL359599 


Hs.283850 


458494 


AI380906 


Hs.158436 


410490 


H03589 




416069 


R37101 


Hs.20982 


438463 


AA807958 


Hs.314232 


444043 


A1499723 


Hs.135089 


438327 


H87407 


Hs.172944 


457711 


AF147401 


Hs.23917 


400870 






441425 


AA933590 


Hs.28937 


416267 


H45384 




407262 


M12873 




444567 


AV654020 


Hs.184261 


403263 


NA 





Rhesus blood group, D antgen 4.8 

phosphatidic acid phosphatase type 2B 4.8 
gb:QV3^0296-14Q200485-h01 BT0296 Homo 4.8 

ESTs 4.8 

ESTs 4.8 

ESTs 4.7 

ESTs 4.7 

ESTs, Weakly similar to T13924 sdk prote 4.7 
gb:601273249F1 NIH_MGC_20 Homo sapiens c 4.7 

ESTs 4.7 

ESTs 4.7 

Target Exon 4.7 

ESTs 4.7 

hypothetical protein FU201 16 4,6 

ESTs, Weakly similar to 2109260A B cell 4.6 

ESTs 4.6 

ESTs 4.6 

ESTs 4.6 

ESTs 4.6 

Target Exon 4.6 

ESTs 4.6 

Homo sapiens cDNA FU 11027 fis. done PL 4,6 

protein \ana$e t cAMP-dependent, regulato 4.6 

inhibitor of DNA binding 4, dominant neg 4.6 

ESTs 4.6 
Homo sapiens mRNA; cONA DKF2p434C1915 (f 4.6 

hypothetical protein MGC2747 4.6 

a disintegrin and metalloproteinase doma 4.6 

ESTs 4.6 

gb:HSBB0D101 STRATAGENE Human skeletal m4.6 

Homo sapiens cDNA FU1 1777 fis, done HE 4.6 

ESTs 4.6 

KIAA0703 gene product 4.6 

Homo sapiens, done MGCr9764, mRNA, comp 4.5 

cytochrome P450, subfamily IVA, polypept 4.5 

palmttoyt-protein thioesterase 2 4.5 

Homo sapiens mRNA full length insert cON 4.5 

ESTs 4.5 

adrenomedullin 4.5 

Homo sapiens genomic DNA, chromosome 21q 4.5 

gb:Homo sapiens clone 23943 mRNA sequenc 4.5 

formyltetrahydrofolate dehydrogenase 4.5 

hypothetical protein FU1 1413 4.5 

gb:HSC1GE121 normalized infant brain cDN 4,5 

kringte-containing transmembrane protein 4.5 

S10O caldum-binding protein, beta (neur 4.4 
ESTs. Weakly similar to HXA2_HUMAN HOMEO 4.4 

eukaryotic translation initiation factor 4.4 

C12000586*gi|6330167|db]IBAA86477.1| (A 4.4 

ESTs 4.4 

ESTs, Weakly similar to S22765 heterogen 4.4 

Homo sapiens mRNA; cDNA DKFZp434K098 (fr 4.4 

ESTs 4.4 
Homo sapiens mRNA; cDNA DKFZp547C126 {fr 4.4 

ESTs 4.4 

gb:yj42d08.r1 Soares placenta Nb2HP Homo 4.4 

ESTs 4.4 

ESTs 4.4 

ESTs 4.4 

chorionic gonadotropin, beta polypeptide 4.4 

ESTs 4.3 

C1 1000905:gi|11692565lgblAAG39879.1lAF28 4.3 

homeobox protein from AL590526 4.3 

gb:yn99c10.r1 Soares adult brain N2b5HB5 4.3 

gbiHuman (g rearranged H-chain mRNA VDJ4 4.3 

ESTs. Weakly similar to T26686 hypotheti 4.3 

Target Exon 4.3 
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410034 BE067414 

456804 AI421645 Hs.139851 

446427 BE395260 Hs.309438 

416931 D45371 Hs.80485 

421296 NMJ02666 Hs.103253 
400973 NA 

452602 AW366194 Hs.55962 

412330 NWJD05100 Hs.788 
405016 

435104 A1475671 Hs.88607 
406118 

418556 T02850 

429745 AA480818 Hs.221736 

433088 AW451206 Hs.1 15899 

444445 AA342329 Hs.115920 

453880 AI803166 Hs.28462 

447384 AI377221 Hs.40528 

414541 BE293116 Hs.76392 

444975 AV652165 Hs.182482 
403921 NA 

451477 AI798425 Hs.42710 
406344 

416970 AA191201 Hs.35861 

413662 BH155866 Hs.25522 

458504 AW070634 Hs.144794 
404682 NA 

418089 N69913 Hs.6858 
403433 

446532 AW975460 Hs.143563 

414217 AI309298 Hs.279898 

418425 AI871247 Hs.6262 

419589 AW973708 Hs.201925 

457029 AA397789 Hs.161803 

447860 AF193807 Hs.131835 

448988 Y09763 Hs.22785 

440610 AI733098 Hs.13080Q 
439590 AF086410 

427240 AA399975 Hs.274151 

408932 AW594172 Hs.278513 

436112 T77545 Hs.187559 

444382 AI144152 Hs.58246 

456716 AA318060 Hs.135121 

419846 NM-015977 Hs.285681 

410036 R57171 Hs.57975 
400545 NA 
403051 NA 

420139 NM_005357 Hs.95351 

450244 AA007534 Hs.125062 

453261 AA034116 Hs.1 18494 

440246 W52010 Hs.191379 

414516 AI307802 Hs.135560 

438232 AI150595 Hs.122226 

410233 AA082947 

412179 BE270758 Hs.69428 

441871 A/306150 Hs.153450 

426411 AK000708 Hs.169764 

453692 AL110416 

448640 AW817177 Hs.102558 

417481 AA203281 Hs.21798 

412912 AW118878 Hs.1 10835 

454183 AW807116 

426328 AW631296 

435942 R06285 Hs.191215 

417629 T76945 

403593 NA 

402690 

418190 R49591 Hs.270425 



gb:MR4-BTO355-2OO100-2O1-e05 BT0355 Homo4.3 



caveofin 2 4.3 

EST 4.3 

adipose most abundant gene transcript 1 4.3 

periiipin 4.3 

ENSP00000236667*:Mucin 5B (Fragment). 4.3 

ESTs 4.3 

A kinase (PRKA) anchor protein (gravEn) 4.3 

CY00017r.-gl|92804O5(gblAAF86402.1|AF245 4.3 

ESTs, Highly similar to F-box protein FB 4.3 

ENSP00000246632:CONA FU2Q261 fis, clone 4.3 

gb:FB1 2A9 Fetal brain. Stratagene Homo s 4.3 

ESTs 4.3 

ESTs 4.3 

Homo sapiens cDNA: FU22816 lis, clone K 4.3 

ESTs, Weakly similar to 138022 hypothetl 4.3 

ESTs 4.2 

aldehyde dehydrogenase 1 family, member 4.2 

ESTs, Weakly similar to T00362 hypotheti 4.2 

C500021 2*:gi| 1 0O47237|dfaj|BAB1 3407.1 j (A 4.2 

ESTs 4.2 

C5001660:gi|1 1611 537 jd bjJSAB 1 8935. 1 1 (AB A2 

OKFZP586E1621 protein 4.2 

KJAA1808 protein 42 

ESTs A2 

C9001 188*:gil12738842lrefjNPJ73725.1I p 4.2 

ESTs. Weakly similar to 178885 serine/lh 4.2 

NM_001622:Homo sapiens alpha-2-HS-glycop 4.2 

ESTs 4.2 

Homo sapiens cDNA: FU23165 lis, clone L 42 

hypothetical protein MGCB407 4.2 

Homo sapiens cDNA FU13446 fis, clone PL 4.2 

ESTs 42 

Rhesus blood group, B glycoprotein 4.2 

gamma^rninobutyric acid (GABA) A recepto 4.2 

ESTs 4.2 

gb:Homo sapiens fall length h^seit cONA 4.2 

figatin 4.2 

TP53TG3 protein 4.2 

ESTs 42 

ESTs 42 

hypothetical protein FU22415 4.2 

Williarns-Beuren syndrome chromosome regi 42 

calsequestrin 2 (cardiac muscle) 4.1 

Target Exon 4.1 

Target Exon 4.1 

lipase, hormone-sensitive 4.1 

ESTs 4.1 

ESTs 4.1 

ESTs 4.1 

ESTs, Weakly similar to T43458 hypotheti 4.1 

ESTs 4.1 

gb:zn10g07.s1 Stratagene hNT neuron (937 4.1 

hypothetical protein MGC3020 4.1 

ESTs.WeaWysimiiarto1909123ANagIuc 4.1 

hypothetical protein FU20701 4.1 

gb:DKF2p434K0431_r1 434 {synonym: htes3) 4.1 

Homo sapiens, done MGC:5352, mRNA, comp 4.1 

ESTs 4.1 

ESTs 4.1 
gb:MR4-ST0062-04010(W)24-b12ST0062Homo4.1 

gb:hh83c09.y1 NCLCGAPJ3U1 Homo sapiens 4.1 

ESTs 4.1 

gb:yc92c07.r1 Soares Infantbram 1NIB H 4.1 

Target Exon 4.0 

Target Exon 4.0 

ESTs 4.0 
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408641 AW245207 Hs.5555 hypothetical protein MGC5347 4.0 

427899 AA829286 Hs.332053 serum amyloid A1 4.0 

445975 A1811536 Hs.145734 ESTs 4.0 

438831 BE263273 Hs.6439 synapsin II 4.0 

455578 BE005350 Hs.14355 Homo sapiens cDNA FU13207 fis, done NT 4.0 

401840 NA Target Exon 4.0 

413753 U17760 Hs.75517 laminln, beta 3 (ntceln (125kD), kaDnin 4.0 

445030 A1205925 Hs.147238 ESTs, Highly similar to AAC3_HUMAN ALPHA 4.0 

433873 AW156913 Hs.150478 ESTs, Weakly simfar to A Chain A. Cryst 4.0 

456736 AW248217 Hs.1619 achaete-scute complex (Drosophila) homol 4.0 

450112 BE047734 Hs.5473 ESTs, Moderately similar to ALU5_HUMAN A 4.0 

448906 A1589567 Hs.309719 ESTs 4.0 
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TABLE 23A 



Table 23A shows the accession numbers for those pkeys lacking unigenelD's for Table 23. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



409853 1 156226 1 AW502327 AW502488 AW501829 AW502625 AW502687 

20 410034 1170594 J BE067414 BE067958 BE067419 BE067953 AW577127AW601412 

410233 118656J AA082947 AAQ83036 

410490 1205347 1 H03589 AW750687 AW750688 

410882 1225686 1 AW809163 AW809247 AW809177 AW809190 AW809225 

411478 1247073*1 BE143068 AW849143 AW848705 AW848569 AW848071 AW848475 AW848092 AW848C05 

25 413065 1347960~1 BE063555 BE1 51321 BE151319 BE151657 BE151655 BE063556 BE1 51322 

413072 1348163~1 BE063965 BE063968 BE064034 BE064028 BE063874 BE063966 BE063869 BE064043 BE0 

414593 1464909*1 BE386764 BE387560 

414913 1506721 1 R25621 C03959 C04010 

415011 151328 1 AW963085AA159005AW963073 

30 415986 1564410 1 Z43619 R61274 H12206 R12883 

416267 1583547.1 H45384 H49 125 H4 1699 

417574 1687770 1 R00348R09593 

417629 1690392J T76945 R20210 R05755 

418556 1767866_-1 T02850 

3 5 419583 186198 1 F00312 AA247490 F31427 AA383663 F22045 

426328 264901 1 AW631296AA375484 

439590 47413J AF086410 W94386 W74609 

442398 541271 1 AA994520AW393574 

452205 90415 T C15819 AA024741 AA024742 

40 452654 925931 1 BE0O4783 BE004947 AI911790 



10 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



15 



Pkey CAT number Accessions 



45 



50 



453692 977825J 
454183 1049636J 




55 



454404 1170594J 

454775 1234106.1 

455282 1273020J 

459159 919998J 



BE141629 BE141625 BE141624 BE141636 BE141630 AW807405 AW807290 AW807353 
BE067414 BE067958 BE067419 BE067963 AW577127 AW601412 

BE160229 AW819879 AW820179 AW819882 AW819876 AW820169 BE153201 AW993736 BE15291 1 
BE143867 AW935060 AW886684 
A1904646 BE179494 BE179421 
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TABLE 23B 

Table 23B shows the genomic positioning for those pkeys lacking unigene ED's and 
5 accession numbers in Table 23. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 
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Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbeis in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al.' refers to the publication 

entitled "The ONA sequence of human chromosome 72' Dunham I. et al., Nature (1999) 402:469-495. 
Strand: Indicates DNA strand from which exons were predicted. 

Nt_position: indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand Nt_posttion 


400545 


9800107 


Minus 


124618-124881 


400870 


9838306 


Minus 


34081-35027 


400973 


7960452 


Minus 


98119-98253 


401093 


8516137 


Minus 


22335-23166 


401590 


9966320 


Minus 


33547-33649 


401665 


7145001 


Plus 


121591-122537 


401810 


7342191 


Plus 


129063-129476 


401840 


7684597 


Plus 


56283^56439 


402054 


8083691 


Minus 


82884806 


402195 


7689778 


Minus 


147901-148884 


402583 


7684486 


Pius 


94883-95003 


402690 


8348058 


Plus 


13368-13998 


402698 


8570304 


Minus 


108641-108903 


402779 


9588555 


Minus 


38173-39210 


403017 


6693623 


Plus 


78630-79367 


403051 


4827080 


Minus 


5269-5411 


403263 


7770677 


Plus 


52431-52737 


403433 


9719611 


Minus 


72225-72437 


403593 


6862650 


Minus 


62554-62712,69449-69602 


403612 


8469060 


Minus 


94723-94859 


403921 


7711590 


Minus 


3297-3536 


404368 


7630956 


Minus 


102053-102199 


404682 


9797231 


Minus 


40977-41150 


404689 


7534100 


Plus 


119461-119717 


405016 


6524300 


Plus 


51997^53308 


405062 


7657730 


Plus 


101283-101432 


406118 


9143818 


Plus 


53997-54629 


406344 


9255974 


Plus 


20254-20374,20526-20659,20835-21097 


406563 


7711604 


Plus 


34401-34538 
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TABLE 24: 

Table 24 depicts Seq ID No., UnigeneiD, UnigeneTitle, Pkey, Pred.Cell.Loc., and ExAccn 
for all of the sequences in Table 25. The information in Table 24 is linked by Seq ID No. to 
Table 25. 
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Pkey: 
ExAccn: 



Unique Eos prabeset Identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 



UnigeneTitle: 
Pred.CeII.Loc.: 
SeqJD.No.: 



Predicted Cellular localization 

Sequence Identification Number found in Table 25 



Pkey ExAccn UnigeneiD UnigeneTitle 



Pred.Cellloc. Seq. ID. No. 



449746 
407276 
415539 
400297 
450375 
102457 
429170 
424399 
422505 
449765 
425692 
426215 
439840 
410102 
429220 
416276 
409079 
442818 
442082 
444381 
446163 
416636 
442117 
433043 
429353 
452190 
446733 
452747 
423242 
417433 
432201 
423961 
439569 
114480 
404561 
325372 
112287 
335824 
424735 
400289 
427585 
429925 
429441 
421155 
420931 
420813 
452744 



AI668594 Hs.176588 
AI951118 HS.326736 
AI733881 Hs72472 
AI127076 Hs.334473 
AA009647 Hs.8850 
NM.001394Hs.2359 
NMJ01394HS.2359 
AI905687 Hs.2533 
AL120862 Hs.124165 
N92293 Hs.206832 
D90041 Hs.155956 
AW963419 Hs.155223 
AW449211 Hs.105445 
AW248508 Hs279727 
AW207206 Hs.136319 
U41060 Hs.79136 
W87707 Hs.82065 
AK001741 Hs.8739 
R41823 Hs.7413 
BE387335 Hs.283713 
AA026880 Hs.25252 
N32536 Hs.42645 
AW664964 Hs.128899 
W57554 Hs.125019 
AL1 17406 Hs.200102 
H26735 Hs.91668 
AA863360 Hs.26040 
BE153855 Hs.61460 
AL039402 Hs.125783 
BE270266 Hs.82128 
AI538613 Hs.298241 
D13666 Hs.136348 
AW602166 Hs.222399 
BE066778 Hs.151678 

NA 

AB033064 Hs.S34806 
NA 

U31875 Hs.272499 

X07820 Hs.2258 

D31152 Hs.179729 
NM_000786Hs^26213 

AJ224172 Hs.204096 

H87879 Hs.102267 

AF044197 Hs.100431 

X51501 Hs.99949 

A1267652 Hs.30504 



ESTs, Weakly similar to CP4Y.HUMAN CYTOC 
Homo sapiens breast cancer antigen NY-BR 
BMP-R1B 

hypothetical protein DKFZp56401278 

a disintegrin and metaHoproteinase doma 

dual specificity phosphatase 4 nuclear 

dual specificity phosphatase 4 nuclear 

aldehyde dehydrogenase 9 family, member cytoplasm 

ESTs 

ESTs, Moderately similar to ALU8.HUMAN A 

N-acetyltransferase 1 {aryiamine N-acety 

stanniocaltin 2 

GDNF family receptor alpha 1 

Homo sapiens cONA FU14035 fis, done HE 

ESTs 

U\M protein, estrogen regulated 
interieukin 6 signal transducer (gp130, 
hypothetical protein FLJ 10879 
ESTs 

ESTs, Weakly similar to S64054 hypotheii 
Homo sapiens cDNA FU13603 fis, clone PL 
solute carrier family 16 (monocarboxylic 
ESTs 

lymphoid nuclear protein (LAF-4) mRNA 
ATP-binding cassette transporter MRP8 
Homo sapiens clone PP1498 unknown mRNA 
ESTs, Weakly similar to fatty acid omega 
Ig superfamBy receptor LNIR 
DEME-6 protein 

5T4 oncofetal trophobtastgtycopratein 
Transmembrane protease, serine 3 
osteoblast specific factor 2 (fascicTin 
CEGP1 protein 

UDP-N-acetyl-atpha-D-5alactosamine:poIyp 
NM_014112*:Homo sapiens trichorhinophala mitochodrla 
Phase2&3Exons nuclear 
K1AA1238 protein 

ENSP00000249072*:DJ222E13.1 (N-TERMINAL 
short-chain alcohol dehydrogenase family 
matrix metaUoprateinase 10 (stromelysin 
collagen, type X, alpha 1 {Schmid metaph 
cytochrome P450, 51 {lanosterol 14-alpha ER 
Bpophliin B (uteroglobin family member) 
lysyl oxidase v extracellular 
small inducible cytokine B subfamily (Cy 
prolactin-induced protein 
Homo sapiens mRNA; cDNA DKFZp434E082 (fr 



SeqlD1&2 
Seq ID3&4 
SeqlD5&6 
SeqlD7&8 
SeqlD9&10 
Seq ID 11 & 12 
Seq 1D1U12 
Seq ID 13 & 14 
Seq ID 15 & 16 
Seq ID 17 & 18 
Seq ID 19 & 20 
Seq ID 21 & 22 
Seq!D23&24 
Seq ID 25 & 26 
SeqlD27&28 
Seq ID 29 & 30 
Seq ID 3U 32 
Seq ID 33 & 34 
Seq ID 35 & 36 
SeqlD37&38 
SeqlD39&40 
SeqlD41&42 
SeqlD43&44 
Seq ID 45 & 46 
SeqlD47&48 
SeqlD49&50 
SeqlD51&52 
Seq ID53&54 
Seq ID 55 & 56 
SeqlD57&58 
Seq ID 59 & 60 
Seq ID 61 & 62 
Seq ID 63 & 64 
Seq ID 65 & 66 
Seq ID 67 & 68 
Seq ID 69 & 70 
Seq ID 71 & 72 
SeqlD73&74 
Seq ID 75 & 76 
Seq ID 77 & 78 
Seq ID 79 & 80 
Seq ID 81 & 82 
Seq ID 83 & 84 
Seq ID 85 & 86 
Seq ID 87 & 88 
Seq ID 89 & 90 
Seq ID 91 & 92 
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10 



15 



20 



420757 
424905 
429659 
446921 
445537 
428227 
' 424001 
421727 
452838 
419667 
414812 
426320 
422867 
411558 
417866 
428398 
431958 
428722 
412970 
421379 
415752 
444051 
451110 



X78592 Hs.99915 
NMJ02497HS.153704 
NM_007050Hs.225952 
AB012113 Hs.16530 
AJ245671 Hs.12844 
AA321649 Hs.2248 
W57883 Hs.137476 
Y13153 Hs.107318 
U65011 H&30743 
AU077005 Hs.92208 
X72755 Hs.77367 
W47595 Hs.169300 
L32137 Hs.1584 
AA102670 Hs.70725 
AW067903 Hs.82772 
AI249368 Hs.98558 
X63629 Hs.2877 
U76456 Hs.190787 
AB026436 Hs.177534 
Y15221 Hs.103982 
BE314524 Hs.78776 
N48373 Hs.10247 
AI955040 Hs.265398 
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TABLE 24 A 



Table 24A shows the accession numbers for those pkeys lacking unigenelD's for Table 24. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



1 0 Pkey: Unique Eos probeset identifier number 

CAT number. Gene cluster number 
Accession: Genbank accession numbers 



15 Pkey CAT number Accession 

335824 CH22_3197FG 619J1_LINK_E 
325372 c12_hs 
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TABLE 24B 

Table 24B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 24. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

1 0 Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I et al." refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

Ntposition: Indicates nucleotide positions of predicted exons. 



15 



20 



Pkey Ref Strand HIjosition 
404561 9795980 Minus 69039-70100 
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Table 25 

The 69 gene sequences identified to be overexpresscd in breast cancer may be used to identify coding regions from die public DNA databases (nr and htgs in 
Genbank). The sequences may be used to either identify genes that encode known proteins, or they may be used to predict the coding regions from genomic 
DNA using exon prediction algoriUims, such as FGENESH (Salamov and Solovyev, 2000, Genome Res. 10:5 16-522) 

Seq ID NO: 1 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted ORF 

Coding sequence: 1-1518 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

ATGGAGCCCT CCTGGCTTCA GG AACTCATG GCTCACCCCT TCTTGCTGCT GATCCTCCTC 60 
TGCATGTCTC TGCTGCTGTT TCAGGTAATC AGGTTGTACC AGAGGAGGAG ATGGATGATC 120 
AGAGCCCTGC ACCTGTTTCC TGCACCCCCT GCCCACTGGT TCTATGGCCA CAAGGAGTTT 180 
TACCCAGTAA AGGAGTTTGA GGTGTATCAT AAGCTGATGG AAAAATACCC ATGTGCTGTT 240 
CCCTTGTGGG TTGGACCCTT TACGATGTTC TTCAGTGTCC ATGACCCAGA CTATGCCAAG 300 
ATTCTCCTG A AAAGACAAGA TCCCAAAAGT GCTGTTAGCC ACAAAATCCT TGAATCCTGG 360 
GTTGGTCGAG GACTTGTGAC CCTGGATGGT TCTAAATGG A AAAAGCACCG CCAG ATTGTG 420 
AAACCTGGCT TCAACATCAG CATTCTGAAA ATATTCATCA CCATGATGTC TG AGAGTGTT 480 
CGGATG ATGC TG AACAAATG GG AGG AACGC ATTGCCCAAA ACTCACGTCT GGAGCTCTTT 540 
CAACATGTCT CCCTGATGAC CCTGGACAGC ATCATGAAGT GTGCCTTCAG CCACCAGGGC 600 
AGCATCCAGT TGGACAGTAC CCTGGACTCA TACCTGAAAG CAGTGTTCAA CCTTAGCAAA 660 
ATCTCCAACC AGCGCATGAA CAATTTTCTA CATCACAACG ACCTGGTTTT CAAATTCAGC 720 
TCTCAAGGCC AAATCTTTTC TAAATTTAAC CAAGAACTTC ATCAGTTCAC AGAGAAAGTA 780 
ATCCAGGACC GGAAGG AGTC TCTTAAGGAT AAGCTAAAAC AAG ATACTAC TCAGAAAAGG 840 
CGCTGGGATT TTCTGGACAT ACTTTTGAGT GCCAAAAGCG AAAACACCAA AGATTTCTCT 900 
GAAGCAGATC TCCAGGCTGA AGTGAAAACG TTCATGTTTG CAGGACATGA CACCACATCC 960 
AGTGCTATCT CCTGGATCCT TTACTGCTTG GCAAAGTACC CTGAGCATCA GCAGAGATGC 1020 
CGAGATGAAA TCAGGGAACT CCTAGGGGAT GGGTCTTCTA TTACCTGGGA ACACCTGAGC 1080 
CAGATGCCTT ACACCACGAT GTGCATCAAG GAATGCCTCC GCCTCTACGC ACCGGTAGTA 11 40 
AACATATCCC GGTTACTCGA CAAACCCATC ACCTTTCCAG ATGGACGCTC CTTACCTGCA 1200 
GGAATAACTG TGTTTATCAA TATTTGGGCT CTTCACCACA ACCCCTATTT CTGGGAAGAC 1260 
CCTCAGGTCT TTAACCCCTT GAGATTCTCC AG GG AAA ATT CTG AAAAAAT ACATCCCTAT 1 320 
GCCTTCATAC CATTCTCAGC TGGATTAAGG AACTGCATTG GGCAGCATTT TGCCATAATT 1380 
GAGTGTAAAG TGGCAGTGGC ATTAACTCTG CTCCGCTTCA AGCTGGCTCC AGACCACTCA 1440 
AGGCCTCCCC AGCCTGTTCG TCAAGTTGTC CTCA AGTCC A AOAATGGAAT CCATGTGTTT 1500 
GCAAAAAAAG TTTGCTAATT TTAAGTCCTT TCGTATAAGA ATTAATGAGA CAATTTTCCT 1560 
ACCAAAGGAA GAACAAAAGG ATAAATATAA TACAAAATAT ATGTATATGG TTGTTTGACA 1620 
AATTATATAA CTTAGGATAC TTCTGACTGG TTTTGACATC CATTAACAGT AATTTTAATT 1680 
TCTTTGCTGT ATCTGGTGAA ACCCACAAAA ACACCTGAAA AAACTCAAGC TGACTTCCAC 1740 
TGCGAAGGGA AATTATTGGT TTGTGTAACT AGTGGTAGAG TGGCTTTCAA GCATAGTTTG 1800 
ATCAAAACTC CACTCAGTAT CTGCATTACT TTTATCTCTG CAAATATCTG CATG ATAGCT 1 860 
TTATTCTCAG TTATCTTTCC CCAATAATAA AAAA 



Scq ID NO: 2 Protein sequence: 

Protein Accession #: FGENESH predicted 

1 II 21 31 41 51 

MEPSWLQELM AHPFLLLILL CMSLLLFQVI RLYQRRRWMI RALHLFPAPP AHWFYGHKEF 60 
YPVKEFEVYH KLMEKYPCAV PLWVGPFTMF FSVHDPDYAK ILLKRQDPKS AVSHKILESW 120 
VGRGLVTLDG SKWKXHRQIV KPGFNISILIC IFITMMSESV RMMLNKWEER IAQNSRLELF 1 80 
QHVSLMTLDS IMKCAFSHQG SIQLDSTLDS YLKAVFNLSK ISNQRMNNFL HHNDLVFKFS 240 
SQGQIFSKFN QELHQFTEKV IQDRKESLKD KLKQDTTQKR RWDFLDILLS AKSENTKDFS 300 
EADLQAEVKT FMFAGHDTTS SAISWILYCL AKYPEHQQRC RDEIRELLGD GSSITWEHLS 360 
QMPYTTMCQC ECLRLYAP W N1SRLLDKP1 TFPDGRSLPA GITVFINIWA LHHNPYFWED 420 
PQVFNPLRFS RENSEKIHPY AFIPFSAGLR NCIGQHFAII ECKVAVALTL LRFKLAPDHS 480 
RFPQPVRQVV UCSKNGIHVF AKKVC 

Seq ID NO: 3 DNA sequence 

Nucleic Acid Accession #: NM J)52997 

Coding sequence: 1 00-4 1 2 5 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

CTAGTCTATA CCAGCA ACGA CTCCTACATC GTCCACTCTG GGGATCTTAG AAAGATCCAT 60 
AAAGCTGCCT CCCGGGGACA AGTCCGGAAG CTGGAGAAG ATGACAAAGAG GAAGAAG ACC 120 
ATCAACCTTA ATATACAAGA CGCCCAGAAG AGGACTGCTC TACACTGGGC CTGTGTCAAT 180 
GGCCATGAGG AAGTAGTAAC ATTTCTGGTA GACAGAAAGT GCCAGCTTGA CGTCCTTGAT 240 
GGCG AAC ACA GGACACCTCT GATG AAGGCT CTACAATGCC ATCAGGAGGC TTGTGCAAAT 300 
ATTCTGATAG ATTCTGGTGC CG ATATAAAT CTCGTAGATG TGTATGGCAA CATGGCTCTC 360 
CATTATGCTG TTTATAGTG A GATTTTGTCA GTGGTGGCAA AACTGCTGTC CCATGGTGCA 420 
GTCATCG AAG TGCACAACAA GGCTAGCCTC ACACCACTTT TACTATCCAT AACG AAAAGA 480 
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AGTGAGCAAA TTGTGOAATT TTTGCTGATA AAAAATGCAA ATGCGAATGC AGTTAATAAG 540 
TATAAATGCA CAGCCCTCAT GCTTGCTGTA TGTCATGGAT CATCAGAGAT AGTTGGCATG 600 
CTTCTTCAGC AAAATGTTGA CGTCTTTGCT GCAG ATATAT GTGGAGTAAC TGCAGAACAT 660 
TATGCTGTTA CTTGTGGATT TCATCACATT CATGAACAAA TTATGGAATA TATACGAAAA 720 
TTATCTAAAA ATCATCAAAA TACCAATCCA OAAGG AACAT CTGCAGGAAC ACCTGATGAG 780 
GCTGCACCCT TGGCGGAAAG AACACCTGAC ACAGCTGAAA GCTTGGTGGA AA AAACACCT 840 
GATGAGGCTG CACCCTTGGT GGAAAGAACA CCTGACACGG CTGAAAGCTT GGTGGAAAAA 900 
ACACCTGATG AGGCTGCATC CTTGGTGGAG GGAACATCTG ACAAAATTCA ATGTTTGGAG 960 
AAAGCGACAT CTGGAAAGTT CGAACAGTCA GCAGAAGAAA CACCTAGGGA AATTACGAGT 1020 
CCTGCAAAAG AAACATCTGA GAAATTTACG TGGCCAGCAA AAGGAAGACC TAGGAAGATC J 080 
GCATGGGAGA AAAAAGAAGA CACACCTAGG GAAATTATGA GTCCCGCAAA AGAAACATCT 1 140 
GAGAAATTTA CGTGGGCAGC AAAAGGAAGA CCTAGGAAGA TCGCATGGGA GAAAAAAGAA 1200 
ACACCTGTAA AGACTGGATG CGTGGCAAGA GTAACATCTA ATAAAACTAA AGTTTTGGAA 1260 
AAAGGAAGAT CTAAG ATG AT TGCATGTCCT ACAAAAGAAT CATCTACAAA AGCAAGTGCC 1 320 
AATGATCAGA GGTTCCCATC AGAATCCAAA CAAGAGGAAG ATGAAGAATA TTCTTGTGAT 1380 
TCTCGGAGTC TCTTTGAGAG TTCTGCAAAG ATTCAAGTGT GTATACCTGA GTCTATATAT 1440 
CAAAAAGTAA TGGAGATAAA TAGAGAAGTA GAAGAGCCTC CTAAGAAGOC ATCTGCCTTC 1500 
AAGCCTGCCA TTGAAATGCA AAACTCTGTT CCAAATAAAG CCTTTGAATT GAAGAATGAA 1560 
CAAACATTGA GAGCAGATCC GATGTTCCCA CCAGAATCCA AACAAAAGGA CTATGAAGAA 1620 
AATTCTTGGG ATTCTGAGAG TCTCTGTGAG ACTGTTTCAC AGAAGGATGT GTGTTTACCC 1680 
AAGGCTACAC ATCAAAAAGA AATAGATAAA ATAAATGGAA AATTAGAAGA GTCTCCTAAT 1740 
AAAGATGGTC TTCTGAAGGC TACCTGCGGA ATG AAAGTTT CTATTCCAAC TAAAGCCTTA 1 800 
GAATTGAAGG ACATGCAAAC TTTCAAAGCG GAGCCTCCGG GG AAGCCATC TGCCTTCGAG 1 860 
CCTGCCACTG AAATGCAAAA GTCTGTCCCA AATAAAGCCT TGGAATTGAA AAATG AACAA 1920 
ACATGGAGAG C AG ATG AG AT ACTCCCATCA GAATCCAAAC AAAAGGACTA TGAAGAAAAT 1980 
TCTTGGGATA CTGAGAGTCT CTGTGAGACT GTTTCACAGA AGGATGTGTG TTTACCCAAG 2040 
GCTGCGCATC AAAAAGAAAT AGATAAAATA AATGGAAAAT TAGAAGGGTC TCCTGTTAAA 2100 
GATGGTCTTC TGAAGGCTAA CTGCGGAATG AAAGTTTCTA TTCCAACTAA AGCCTTAGAA 2160 
TTGATGGACA TGCAAACTTT CAAAGCAGAG CCTCCCGAGA AGCCATCTGC CTTCGAGCCT 2220 
GCCATTGAAA TGCAAAAGTC TGTTCCAAAT AAAGCCTTGG AATTGAAGAA TGAACAAACA 2280 
TTGAGAGCAG ATGAGATACT CCCATCAGAA TCCAAACAAA AGGACTATGA AGAAAGTTCT 2340 
TGGGATTCTG AGAGTCTCTG TGAGACTGTT TCACAOAAGG ATGTGTGTTT ACCCAAGGCT 2400 
ACACATCAAA A AG A A AT AG A TAAAATAAAT GGAAAATTAG AAGAGTCTCC TGATAATGAT 2460 
GGTTTTCTGA AGGCTCCCTG CAG AATGAAA GTTTCTATTC CAACTAAAGC CTTAGAATTG 2520 
ATGGACATGC AAACTTTCAA AGCAG AGCCT CCCGAGAAGC CATCTGCCTT CGAGCCTGCC 2580 
ATTGAAATGC AAAAGTCTGT TCCAAATAAA GCCTTGGAAT TGAAGAATGA ACAAACATTG 2640 
AGAGCAGATC AGATGTTCCC TTCAG AATCA AAACAAAAGA AGGTTGAAGA AAATTCTTGO 2700 
GATTCTG AGA GTCTCCGTGA GACTGTTTCA CAGAAGGATG TGTGTGTACC CAAGGCTACA 2760 
CATCAAAAAG AAATGGATAA AATAAGTGGA AAATTAGAAG ATTCAACTAG CCTATCAAAA 2820 
ATCTTGGATA CAGTTCATTC TTGTGAAAGA GCAAGGGAAC TTCAAAAAGA TCACTGTGAA 2880 
CAACGTACAG GAAAAATGGA ACAAATGAAA AAGAAGTTTT GTGTACTGAA AAAGAAACTG 2940 
TCAGAAGCAA AAGAAATAAA ATCACAGTTA GAGAACCAAA AAGTTAAATG GGAACAAGAG 3000 
CTCTGCAGTG TGAGATTGAC TTTAAACCAA GAAGAAGAGA AGAGAAGAAA TGCCGATATA 3060 
TTAAATGAAA AAATTAGGGA AGAATTAGGA AGAATCGAAG AGCAGCATAG GAAAGAGTTA 3120 
GAAGTGAAAC AACAA CTTGA ACAGGCTCTC AGAATACAAG ATATAGAATT GAAGAGTGTA 3180 
GAAAGTAATT TGAATCAGGT TTCTCACACT CATG AAAATG AAAATTATCT CTTACATGAA 3240 
AATTGCATGT TGAAAAAGGA AATTGCCATG CTAAAACTGG AAATAGCCAC ACTGAAACAC 3300 
CAATACCAGG AAAAGGAAAA TAAATACTTT GAGGACATTA AGATTTTAAA AGAAAAGAAT 3360 
GCTGAACTTC AGATGACCCT AAAACTGAAA GAGGAATCAT TAACTAAAAG GGCATCTCAA 3420 
TATAGTGGGC AGCTTAAAGT TCTGATAGCT GAGAACACAA TGCTCACTTC TAAATTG AAG 3480 
GAAAAACAAG ACAAAGAAAT ACTAG AGGCA GAAATTG AAT CACACCATCC TAGACTGGCT 3540 
TCTGCTGTAC AAGACCATGA TCAAATTGTG ACATCAAGAA AAAGTCAAGA ACCTGCTTTC 3600 
CACATTGCAG GAGATGCTTG TTTGCAAAGA AAAATG AATG TTGATGTGAG TAGTAGGATA 3660 
TATAACAATG AGGTGCTCCA TCAACGACTT TCTGAAGCTC AAAGGAAATC CAAAAGOCTA 3720 
AAAATTAATC TCAATTATGC AGGAGATGCT CTAAGAGAAA ATACATTGGT TTCAGAACAT 3780 
GCACAAAGAG ACCAACGTGA AACACAGTGT CAAATGAAGG AAGCTG AACA CATGTATCAA 3840 
AACGAACAAG ATAATGTG AA CAAACACACT GAACAGCAGG AGTCTCTAGA TCAGAAATTA 3900 
TTTCAACTAC AAAGCAAAAA TATGTGGCTT CAACAGCAAT TAGTTCATGC AC AT AAG AAA 3960 
GCTGACAACA AAAGCAAGAT AACAATTG AT ATTCATTTTC TTG AGAGGAA AATGCAACAT 4020 
CATCTCCTAA AAGAGAAAAA TG AGG AGATA TTTAATTACA ATAACCATTT AAAAAACCGT 4080 
ATATATCAAT ATGAAAAAGA GAAAGCAGAA ACAGAAAACT CATG AG AG AC AAGCAGTAAG 4140 
AAACTTCTTT TGGAGAAACA ACAGACCAGA TCTTTACTCA CAACTCATGC TAGGAGGCCA 4200 
GTCCTAGCAT CACCTTATGT TGAA AATCTT ACCAATAGTC TGTGTCAACA GAATACTTAT 4260 
TTTAGAAGAA AAATTCATGA TTTCTTCCTG AAGCCTACAG ACATAAAATA ACAGTGTGAA 4320 
GAATTACTTG TTCACGAATT GCATAAAGCT GCACAGGATT CCCATCTACC CTGATGATGC 4380 
AGCAG ACATC ATTCAATCCA ACCAGAATCT CGCTCTGCAC TCC AGCCT AG GTGACAGAGT 4440 
GAGACTCCAC CTCGGAAA 



Scq ID NO: 4 Protein sequence: 
Protein Accession*: NP 443723.I 

1 11 21 31 41 51 
I I I 1 I I 

MTKRKKTINL NIQDAQKRTA LHWACVNGHE EVVTFLVDRK CQLDVLDGEH RTPLMKALQC 60 
HQEACANILI DSGADINLVD VYGNMALHYA VYSEILSWA KLLSHGAVIE VHNKASLTPL 120 
LLSITKRSEQ IVEFLLIKNA NANAVNKYKC TALMLAVCHG SSEIVGMLLQ QNVDVFAADI 1 80 
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CGVTAEHYAV TCGFHHIHEQ IMEYIRKLSK NHQNTNPEGT SAGTPDEAAP LAERTPDTAE 240 
SLVEKTPDEA APLVERTPDT AESLVEKTPD EAASLVEGTS DKIQCLEKAT SGKFEQSAEE 300 
TPREITSPAK ETSEKFTWPA KGRPRKIAWE KKEDTPREIM SPAKETSEKf TWAAKGRPRK 360 
TAWEKKETPV KTGCVARVTS NKTKVLEKGR SKMIACPTKE SSTKAS ANDQ RFPSESKQEE 420 
5 DEEYSCDSRS LFESSAKIQV OPESIYQJCV MEINREVEEP PKKPSAFKPA lEMQNSVE^K 480 

AFELKNEQTL RADPMFPPES KQKDYEENSW DSESLCETVS QKDVCLPKAT HQ^IDK^G 540 
KLEESPNKDG LLKATCGMKV S1PTKALELK DMQTFKAEPP GKPSAFEPAT EMQKSVPNKA 600 
LELKNEQTWR ADEILPSESK QKDYEENSWD TESLCETVSQ KDVCLPKAAH QKJE1DKJNGK 660 
LEGSPVKDGL LKANCGMKVS IPTKALELMD MQTFKAEPPE KPSAFEPAIE MQKSVPNKAL 720 

10 ELKNEQTLRA DEILPSESKQ KDYEESSWDS ESLCETVSQK DVCLPKATHQ KEIDK^GKL 780 
EESPDNDGFL KAPCRMKVS1 PTKALELMDM QTFKAEPPEK PSAFEPAIEM QKSVPN^\LE 840 
LKNEQTLRAD QMFPSESKQK KVEENSWDSE SLRETVSQKD VCVPKATHQK EMD^SGKLE 900 
DSTSLSKJLD TVHSCERARE LQKDHCEQRT GKMEQMKKKF CVLKKKLSEA K^SQ^Q ' *» 
KVKWEQELCS VRLTLNQEEE KRRNADILNE KIREELGRffi EQHRKELEVK QQLEQALWQ 1020 

15 DIHXSVESN LNQVSHTHEN ENYLLHENCM LKKEIAMLKL EIATUCHQYQ ^^J^^H^ ^ 
KILKEKNAEL QMTLKLKEES LTKRASQYSG QLKVLIAENT MLTSKLKEKQ DKEILEAEIE 1 140 

sto^r^asav qdS>ojvtsr ksqepafhia gdaclqrkmn vdvssttynn ^JH^^?!^^ 0 ^, >. n 

0RK5K5LKIN LNYAGD ALRE NTLVSEHAQR DQRETQCQMK EAEHMYQNEQ DNVNKHTEQQ 1260 

1320 

20 NNHLKNRJYQ YEKEKAETEN S 



25 



30 



Scq ID NO: 5 DNA sequence 

Nucleic Acid Accession*: none found 

Coding sequence: 273-1785 (undcriined sequences correspond to start and stop codons) 



U 21 31 41 

1 I I t 1 



CGCGGGGCGC GGAGTCGGCG GGGCCTCGCG GGACGCGGGC AGTGCGGAGA COGCGGCGCT 60 
GAGGACGCGG GAGCCGGGAG CGCACGCGCG GGGTGGAGTT CAGCCTACTC ; TJTOTAGAT 120 
GTGAAAGGAA AGGAAGATCA TTTCATGCCT TGTTGATAAA GGTTCAGACT TCTGCTGATT 180 
C^CA^TGGCTCTGAGCTATGACAAGAGAG^ 240 
TGCCATAAGT G AGA AGCAAA CTTCCTTGAT AACATGCTTT TGCG AAGTGC AGG AAAATTA 300 
SSSJ^S^A^AGOATCCTOAG AGTACAGCCC CCACCCCttGTCC MAGGTC 360 
35 TTGCGTTGTA AATGCCACCA CCATTGTCCA GAAGACTCAG TCAACAATAT ™CAGGACA 420 
G/^GGATATT GTTTCACGAT GATAGAAGAG GATGACTCTG GGTTGCCTGT SSJS^^IT^ 4 *° 
ntTTTGCCTAG GACTAGAAGG CTCAG ATTTT CAGTGTCGGG AC ACTCCC AT TCCTC^TCAA 540 ^ 
AGAAGATCAA TTGAATGCTG CACAGAAAGG AACGAATGTA ATAAAG ACC 1 vf> 
CTGCCTCCAT TGAAAAACAG AGATTTTGTT 
40 ATATPTGTGA CTGTCTGTAG TTTGCTCTTG GTCCTTATCA TATTATTTTG TTACTTCCGG 720 

ATTCCTCCTG GAGAATCCCT GAGAGACTTA ATTG AGCAGT CTCAGAGCTC AGGAAGTGGA 840 
TCAGGCCTCC CTCTGCTGGT CCAAAGGACT ATAGCTAAGC ^AGATTCAGAT G^AAACAG 900 
ATTGGAAAAG GTCGCTATGG GGAAGTTTGG ATGGGAAAGT GGCGTGGCGA AAAGGTAGCT 960 
45 OT^AAAGTGTTCTTCACCAC AGAGGAAGCC ^^^^^^S^^^^f^f^^^^^^ £? 
ACAGTGTTGA TG AGGCATGA AAACATTTTG GGTTTCATTG CTGCAGATAT CA^GACA 080 

so ssssmas«SSS 

aatcacttcc agtcttacat catggctgac a ^^ a ^ a 9^T^^^^3^^I^I^Sq9 c 

GTTGCTAGGA GATGTGTATC AGGAGGTATAGTGGAAGAAT^GCTTCC ™TCATGAC 560 
55 CTAGTGCCCAGTGACCCCTCTTATGAGGAC ATCAGOiAGATTG^G^T^GAAOTTA 1620 
CGCCCCTCAT TCCCAAACCG GTGGAGCAGT GATOAQTOTC TAAGGCAGAT GGGAAAACTC 680 
ATGACAGAAT GCTGGGCTCA CAATCCTGCA TCAAGGCTGA ^^9^°™°^ >*» 
ACACTTGCCA AAATGTC AGA GTCCCAGGAC ATTAAACTCTGATAGGAGAG GAAMGTA^j 1800 
riTrrrw* fiAAAGCCAAC AGGTACTCTT CTGTTTGTGG GCAGAGCAAA AGACATCAAA 1860 

TCTGTTTGTA GGCGG AG AAA CCGTTGGGTA ACTTGTTCAA GATATGATGC AT 



65 m NO: 6 Protein sequence: 

Protein Accession #: none found 



70 



DSG^WTSG CLGLEGSDFQ CRDTPIPHQR RSIECCTERN ECNKDLHPTL PPLKNRDFVD 120 
GPUffi^EU SVTVCSLLLVLIILFCYFRY KRQETRPRYS IGLEQDETYI ^^^^^Ll 180 m 
mS^SGSGS GLPLXVQRTI AKQIQMVKQI GKGRYGEVWM GKWRGEKVAV KWFTTEEAS 240 
7S ^RETETYOT VLMRHENILG FIAADKGTG SWTQLYUTD YHENGSLYDY LKSTTLDAKS 300 
^^LA YS^S^G^l^-HTEff STOGKPAIAH RDUCSKNILV KKNGTCCIAD LGLAVKFISD 360 
K! YSFGULWEV ARRCVSGGIV 420 
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EEYQLPYHDL VPSDPSYEDM REIVCIKKLR PSFPNRWSSD ECLRQMGKLM TECWAHNPAS 480 
RLTALRVKKT LAKMSESQDI KL 

Seq ID NO: 7 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence : 482-30O7(underlined sequences correspond to start and stop codons) 



1 U 21 31 41 51 

AACTGAGCTA ACAAGAAATA CTAGAAAAGG AGGAAGGAGA ACATTGCTGC AGCTTGGATC 60 
TACAACCTAA G AAAGCAAG A GTG ATCAATC TCAGCTCTGT TAAACATCTT GTTTACTTAC 1 20 
TGCATTCAGC AGCTTGCAAA TGGTTAACTA TATGCA AAAA AGTCAGCATA GCTGTG AAGT 1 80 
ATGCCGTGAA TTTTAATTGA GGGAAAAAGG GACAATTGCT TCAGGATGCT CT AOTAT GCA 240 
CTCTGCTTGA AATATTTTCA ATGA AATGCT CAGTATTCTA TCTTTGACCA GAGGTTTTAA 300 
CTTTATGAAG CTATGGGACT TGACAAAAAG TGATATTTGA GAAGAAAGTA CGCAGTGGTT 360 
GGTGTTTTCT TTTTTTTAAT AAAGGAATTG AATTACTTTG AACACCTCTT CCAGCTGTGC 420 
ATTACAGATA ACGTCAGGAA GAGTCTCTGC TTTACAGAAT CGGATTTCAT CACATGACAA 480 
CATGAAGCTG TGGATTCATC TCTTTTATTC ATCTCTCCTT GCCTGTATAT CTTTACACTC 540 
CCAAACTCCA GTGCTCTCAT CCAG AGGCTC TTGTGATTCT CTTTGCAATT GTGAGGAAAA 600 
AGATGGCACA ATGCTAATAA ATTGTG AAGC AAAAGGTATC AAGATGGTAT CTGAAATAAG 660 
TGTGCCACCA TCACGACCTT TCCAACTAAG CTTATTAAAT AACGGCTTGA CGATGCTTCA 720 
CACAAATGAC TTTTCTGGGC TTACCAATGC TATTTCAATA CACCTTGG AT TTAACAATAT 780 
TGCAGATATT GAGATAGGTG CATTTAATGG CCTTGGCCTC CTGAAACAAC TTCATATCAA 840 
TCACAATTCT TTAGAAATTC TTAAAGAGG A TACTTTCCAT GGACTGGAAA ACCTGGAATT 900 
CCTGCAAGCA GATAACAATT TTATCACAGT GATTG AACCA AGTGCCTTTA GCAAGCTCAA 960 
CAG ACTCAAA GTGTTAATTT TAAATGACAA TGCTATTGAG AGTCTTCCTC CAAACATCTT 1020 
CCGATTTGTT CCTTTAACCC ATCTAGATCT TCGTGGAAAT CAATTACAAA CATTGCCTTA 1080 
TGTTGGTTTT CTCGAACACA TTGGCCGAAT ATTGGATCTT CAGTTGGAGG ACAACAAATG 1 140 
GGCCTGCAAT TGTG ACTTAT TGCAGTTAAA AACTTGGTTG G AG AACATGC CTCCACAGTC 1 200 
TATAATTGGT GATGTTGTCT GCAACAGCCC TCCATTTTTT AAAGGAAGTA TACTCAGTAG 1260 
ACTAAAGAAG GAATCTATTT GCCCTACTCC ACCAGTGTAT GAAGAACATG AGGATCCTTC 1320 
AGGATCATTA CATCTGGCAG CAACATCTTC AATAAATGAT AGTCGCATGT CAACTAAGAC 1380 
CACGTCCATT CTAAAACTAC CCACCAAAGC ACCAGGTTTG ATACCTTATA TTACAAAGCC 1440 
ATCCACTCA A CTTCCAGGAC CTTACTGCCC TATTCCTTGT AACTGCAAAG TCCTATCCCC 1 500 
ATCAGGACTT CTAATACATT GTCAGGAGCG CAACATTGAA AGCTTATCAG ATCTGAGACC 1560 
TCCTCCGCAA AATCCTAGAA AGCTCATTCT AGCGGGAAAT ATTATTCACA GTTTAATGAA 1620 
GTCTGATCTA GTGG AATATT TCACTTTGGA AATGCTTCAC TTGGGAAACA ATCGTATTGA 1 680 
AGTTCTTGAA GAAGGATCGT TTATGAACCT AACGAGATTA CAAAAACTCT ATCTAAATGG 1740 
TAACCACCTG ACC AAATTAA GTAA AGGCAT GTTCXnTGGT CTCCATAATC TTGAATACTT 1800 
ATATCTTGAA TACAATGCCA TTAAGG AAAT ACTGCCAGGA ACCTTTAATC CA ATGCCT AA 1 860 
ACTTAAAGTC CTGTATTTAA ATAACAACCT CCTCCAAGTT TTACCACCAC ATATTTTTTC 1920 
AGGGGTTCCT CTAACTAAGG TAAATCTTAA AACAAACCAG TTTACCCATC TACCTGTAAG 1980 
TAATATTTTG GATGATCTTG ATTTACTAAC CCAGATTGAC CTTGAGGATA AC(XCTGGGA 2040 
CTGCTCCTGT GACCTGGTTG GACTGCAGCA ATGGATACAA AAGTTAAGCA AGAACACAGT 2100 
GACAGATGAC ATCCTCTGCA CTTCCCCCGG GCATCTCGAC AAAAAGGAAT TGAAAGCCCT 2160 
AAATAGTGAA ATTCTCTGTC C AGGTTTAGT AAATAACCCA TCCATGCCAA CACA GACTAG 2220 
TTACCTTATG GTCACCACTC CTGCAACAAC AACAAATACG GCTGATACTA TTTTACGATC 2280 
TCTTACGGAC GCTGTGCCAC TGTCTGTTCT AATATTGGGA CTTCTG ATTA TGTTCATCAC 2340 
TATTGTTTTC TGTGCTGCAG GG ATAGTGGT TCTTGTTCTT CACCGCAGG A GAAGATACAA 2400 
AAAGAAACAA GTAGATGAGC AAATGAG AGA CAACAGTCCT GTGCATCTTC AGTACAGCAT 2460 
GTATGGCCAT AAAACCACTC ATCACACTAC TGAAAGACCC TCTGCCTCAC TCTATGAACA 2520 
GCACATGGTG AGCCCCATGG TTCATGTCTA TAG AAGTCCA TCCTTTGGTC CAAAGCATCT 2580 
GGAAGAGG AA GAAGAGAGGA ATGAGAAAGA AGGAAGTG AT GCAAAACATC TCCAAAG AAG 2640 
TCTTTTGGAA CAGG AAAATC ATTCACCACT CACAGGGTCA AATATG AAAT ACAAA ACCAC 2700 
GAACCAATCA ACAGAATTTT TATCCTTCCA AGATGCCAGC TCATTGTACA GAAACATTTT 2760 
AGAAAAAGAA AGGGAACTTC AGCAACTGGG AATCACAG AA TACCTAAGGA AAAACATTGC 2820 
TCAGCTCCAG CCTGATATGG AGGCACATTA TCCTGGAGCC CACGAAGAGC TGAAGTTAAT 2880 
GGAAACATTA ATGTACTCAC GTCCAAGG AA GGTATTAGTG GAACAGACAA AAAATGAGTA 2940 
TTTTGAACTT AAAGCTAATT TACATGCTGA ACCTGACTAT TTAGAAGTCC TGGAGCAGCA 3000 
AACATAGATG GAGAGTTTGA GGGCTTTCGC AGAAATGCTG TGATTCTGTT TTAAGTCCAT 3060 
ACCTTGTAAA TAAGTGCCTT ACGTGAGTGT GTCATCAATC AGAACCTAAG CACAGCAGTA 3 120 
AACTATGGGG AAAAAAAAAG AAGAAGAAAA GAAACTCAGG GATCACTGGG AGAAGCCATG 3180 
GCATTATCTT CAGGCAATTT AGTCTGTCCC AAATAAAATC AATCCTTGCA TGTAAATC 



Seq ID NO: 8 Protein sequence: 
Protein Accession #: none found 

1 11 21 31 41 51 

MKLWIHLFYS SLLACISLHS QTP VLSSRGS CDSLCNCEEK DGTMUNCEA KGIKMVSEIS 60 
VPPSRPFQLS LLNNGLTMLH TNDFSGLTNA ISIHLGFNNI ADIEIGAFNG LGLLKQLHIN 120 
HNSLEILKED TFHGLENLEF LQADNNF1TV IEPSAFSKLN RLK.VULNDN AIESLPPNIF 180 
RFVPLTHLDL RGNQLQTLPY VGFLEH1GRI LDLQLEDNKW ACNCDLLQLK TWLENMPPQS 240 
IIGDWCNSP PFFKGSILSR LKKESICPTP PVYEEHEDPS GSLHLAATSS INDSRMSTKT 300 
TSILKLPTKA PGUPYITKP STQLPGPYCP IPCNCKVLSP SGLLIHCQER NIESLSDLRP 360 
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PPQNPRKLIL AGN11HSLMK SDLVEYFTLE MLHLGNNRIE VLEEGSFMNL TRLQKLYLNG 420 
NHLTKLSKGM FLGLHNLEYL YLEYNAlKEi LPGTFNPMPK LK.VLYLNNNL LQVLPPHIFS 480 
GVPLTKVNLK TNQFTHLP VS NILDDLDLLT QIDLEDNPWD CSCDLVGLQQ WIQKLSKNTV 540 
TDDILCTSPG HLDKKELKAL NSEILCPGLV NNPSMPTQTS YLMVTTPATT TNTADTILRS 600 
LTDAVPLSVL ILGLUMFIT IVFCAAGIVV LVLHRRRRYK. KKQVDEQMRD NSP VHLQYSM 660 
YGHKTTHHTT ERPSASLYEQ HMVSPMVHVY RSPSFGPKHL EEEEERNEKE GSDAKHLQRS 720 
LLEQENHSPL TGSNMKYKTT NQSTEFLSFQ DASSLYRNIL EK.ERELQQLG ITEYLRKNIA 780 
QLQPDMEAHY PGAHEELK.LM ETLMYSRPRK VLVEQTKNEY FELKANLHAE PDYLEVLEQQ 840 
T 

Seq ID NO: 9 DNA sequence 

Nucleic Acid Accession #:NMO03474 

Coding sequence: 307-3036 (underlined sequences correspond to start and stop codons) 



1 U 2l 31 41 51 
1 1 ] 



CACTAACGCT CTTCCTAGTC CCCGGGCCAA CTCGGACAGT TTGCTCATTT ATTGCAACGG 60 
TCAAGGCTGG CTTGTGCCAG AACGGCGCGC GCGCGACGCA CGCACACACA CGGGGGGAAA 120 
CTTTTTTAAA AATGAAAGGC TAGAAGAGCT CAGCGGCGGC GCGGGCCGTG CGCGAGGGCT 180 
CCGGAGCTGA CTCGCCG AGG CAGGA AATCC CTCCGGTCGC G ACGCCCGGC CCCGCTCGGC 240 
GCCCGCGTGG GATGGTGCAG CGCTCGCCGC CGGGCCCGAG AGCTGCTGCA CTGAAGGCCG 300 
GCGACGATGG CAGCGCGCCC GCTGCCCGTG TCCCCCGCCC GCGCCCTCCT GCTCGCCCTG 360 
GCCGGTGCTC TGCTCGCGCC CTGCGAGGCC CGAGGGGTGA GCTTATGGAA CGAAGGAAGA 420 
GCTGATGAAG TTGTCAGTGC CTCTGTTCGG AGTGGGGACC TCTGGATCCC AGTGAAGAGC 480 
TTCGACTCCA AGAATCATCC AGAAGTGCTG AATATTCGAC TACAACGGGA AAGCAAAGAA 540 
CTGATCATAA ATCTGGAAAG AAATGAAGGT CTCATTGCCA GCAGTTTCAC GGAAACCCAC 600 
TATCTGCAAG ACGGTACTG A TGTCTCCCTC GCTCGAAATT ACACGGTAAT TCTGGGTCAC 660 
TGTTACTACC ATGG AC ATGT ACGGGG ATAT TCTG ATTCAG CAGTCAGTCT CAGCACGTGT 720 
TCTGGTCTCA GGGGACTTAT TGTGTTTGAA AATGAAAGCT ATGTCTTAGA ACCAATGAAA 780 
AGTGCAACCA ACAGATACAA ACTCTTCCCA GCGAAGAAGC TGAAAAGCGT COGGGGATCA 840 
TGTGGA^C ATCACAACAC ACCAAACCTC GCTGCAAAGA ATGTGTTTCC ACCACCCTCT 900 
CAGACATGGG CAAGAAGGCA TAAAAGAGAG ACCCTCAAGG CAACTAAGTA TGTGGAGCTG 960 
GTGATCGTGG CAGACAACCG AGAGTTTCAG AGGCAAGGAA AAGATCTGGA AAAAGTTAAG 1020 
CAGCGATTAA TAGAGATTGC TAATCACGTT GACAAGTTTT AC^GACCACT GAACA^OSG 1080 
ATCGTGTTGG TAGGOGTGGA AGTGTGGAAT GACATGGACA AATGCTCTGT AAGTC/^GAC U40 
CCATTCACCA GCCTCCATG A ATTTCTGG AC TGGAGGAAG A TGAAGCTTCT ACCTCGCAAA 200 
TCCCATGACA ATGCGCAGCT TO iCaGTGGG GTTTATTTCC AAGGGACCAC CATCGGC^G 1260 
GCCCCAATCA TGAGCATGTG CACGGCAGAC CAGTCTGGGG GAATTGTCAT G^CCATTCA 1320 
GACAATCCCC TTGGTGCAGC CGTGACCCTG GCACATGAGC TGGGCCACAA TTTCGGGAXQ WO 
AATCATGACA CACTGGACAG GGGCTGTAGC TGTCAAATGG CGGTTGAGAA AGG AGGCTGC 1440 
ATCATGAACG CTTCCACCGG GTACCCATTT CCCATGGTGT TCAGCAGTTG CA^ 00 ^ 1 ^ 
GACTTGGAGA CCAGCCTGGA GAAAGGAATG GGGGTGTGCC TGTTTAACCT 

AGGGAGTCTT TCGGGGGCCA GAAGTGTGGG AACAGATTTG TGGAAGAAGG AGAGGAGTGT 1620 
GACTGTGGGG AGCCAGAGGA ATGTATGAAT CGCTGCTGCA ATGCCACCAC CTGTACCCTG 1680 
AAGCCGGACG CTGTGTGCGC ACATGGGCTG TGCTGTGAAG ACTGCCAGCT GAAGCCTGCA 1740 
GGAACAGCGT GCAGGGACTC CAGCAACTCC TGTGACCTCC CAGAGTTCTG C ACAGGGGCC 1800 
AGCCCTCACT GCCCAGCCAA CGTGTACCTG CACGATGGGC ACTCATGTCA GGATGTGGAC 860 
GGCTACTGCT ACAATGGCAT CTGCCAGACT CACGAGCAGC AGTGTGTCAC ACTCTGGGGA 1920 
CCAGGTGCTA AACCTGCCCC TGGGATCTGC TTTGAGAGAG TCAATTCTGC AGGTGATCCT 1980 
TATGGCAACT GTGGCAAAGT CTCGAAGAGT TCCTTTGCCA AATGCGAGAT GAGAGATGCT 2040 
AAATGTGGAA AAATCCAGTG TCAAGGAGGT GCCAGCCGGC CAGTCATTCG TACCAATGCC 2100 
GTTTCCATAG AAACAAACAT CCCCCTGCAG CAAGGAGGCC GGATTCTGTG CCGGGGGACC 2160 
CACGTGTACT TGGGCGATGA CATGCCGGAC CCAGGGCTTG TGCTTGCAGG CACAAAGTGT 7320 
GCAGATGGAA AAATCTGCCT GAATCGTCAA TGTCAAAATA TTAGTCTCTT TGGGGTOAC 2280 
GAGTGTGCAA TGCAGTGCCA CGGCAGAGGG GTGTGCAACA ACAGG AAG A A CTGCCACTGC 2340 
GAGGCCCACT GGGCACCTCC CTTCTGTGAC AAGTTTGGCT TTGGAGGAAC I CACAG ^ACAGC 2400 
GGCCCCATCC GGCAAGCAGA TAACCAAGGT TTAACCATAG G AATTCTGGT G ACCATOCTC 2460 
TGTCTTCTTG CTGCCGGATT TGTGGTTTAT CTCAAAAGGA AGACCTTGAT ACGACTGCTC 2520 
TTTACAAATA AGAAGACCAC CATTGAAAAA CTAAGGTGTG TGCGCCCTTC CCGGCCACCC 2580 
CGTGGCTTCC AACCCTGTCA GGCTCACCTC GGCCACCTTG GAAAAGGCCT GATGAGGAAG 2640 
CCGCCAGATT CCTACCCACC GAAGGACAAT CCCAGGAG AT TGCTGCAGTG TCAGAATGTT 2700 
GACATCAGCA G ACCCCTCAA CGGCCTGAAT GTCCCTCAGC CCCAGTCAAC TCAGCGAGTG 2760 
CTTCCTCCCC TCCACCGGGC CCCACGTGCA CCTAGCGTCC CTGCCAGACC CCTGCCAGCC ^2820 
AAGCCTGCAC TTAGGCAGGC CCAGGGGACC TGTAAGCCAA ACCCCCCTCA GAAGCCTCTG 2880 
CCTGCAGATC CTCTGGCCAG AACAACTCGG CTCACTCATG CCTTGGCCAG G ACCCCAGGA 2940 
CAATGGGAGA CTGGGCTCCG CCTGGCACCC CTCAGACCTG CTCCACAATA TCCACACCAA 3000 
GTGCCCAGAT CCACCCACAC CGCCTATATT AAGJGAGAAG CCGACA ^ TI1 Z C . A ^ C ^ G i ?° 6 ° 
TGAAGACAGA AGTTTGCACT ATCTTTCAGC TCCAGTTGGA GTTTTTTGTA CCAACTTTTA 3120 
ATTrmr TAATGTnAA AACATCATTA CTATAAG AAC TTTG AGCTAC TGCCGTCAGT 3180 
CCTGTGCTGT GCTATGGTGC TCTGTCTACT TGCACAGGTA CTTGTAAATT ATTAATTTAT 3240 
GCAGAATGTT GATTACAGTG CAGTGCGCTG TAGTAGGCAT TTTTACCATC ACTG AGTTTT 3300 
rcATGGCAGG AAGGCTTGTT GTGCTTTTAG TATTTTAGTG AACTTGAAAT ATCCTGCTTG 3360 
ATGGGATTCT GG ACAGG ATG TGTTTGCTTT CTGATCAAGG CCTTATTGGA AAGCAGTCCC 3420 
CCAACTACCC CCAGCTGTGC TTATGGTACC AGATGCAGCT CAAGAGATCC CAAGTAGAAT 3480 
rTCAGTTGAT TTTCTGGATT CCCCATCTCA GGCCAGAGCC AAGGGGCTTC AGGTCCAGGC 3540 
TGTGTTTGGC TTTCAGGG AG GCCCTGTGCC CCTTGACAAC TGGCAGGCAG GCTCCCAGGG 3600 
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ACACCTGGGA GAAATCTGGC TTCTGGCCAG GAAGCTTTGG TGAGAACCTG GGTTGCAG AC 3660 
AGGAATCTTA AGGTGTAGCC ACACCAGGAT AGAGACTGGA ACACTAGACA AGCCAGAACT 3720 
TGACCCTG AG CTGACCAGCC GTGAGCATGT TTGGAAGGGG TCTGTAGTGT CACTCA AGGC 3780 
GGTGCTTG AT AGAAATGCCA AGCACTTCTT TTTCTCGCTG TCCTTTCTAG AGCACTGCCA 3840 
5 CCAGTAGGTT ATTTAGCTTG GGAAAGGTGG TGTTTCTGTA AG AAACCTAC TGCCCAGGCA 3900 
CTGCAAACCG CCACCTCCCT ATACTGCTTG GAGCTGAGCA AATCACCACA AACT GTAATA 3960 
CAATG ATCCT GTATTCAGAC AGATGAGGAC TTTCCATGGG ACCACAACTA TTTTCAGATG 4020 
TGAACCATTA ACCAGATCTA GTCAATCAAG TCTGTTTACT GCAAGGTTCA ACTTATTAAC 4080 
AATTAGGCAG ACTCTTTATG CTTGCAAAAA CTACAACCAA TGGAATGTGA TGTTCATGGG 4140 

1 0 TATAGTTCAT GTCTGCTATC ATTATTCGTA GATATTGGAC AAAG AACC TT CTCTATGGGG 4200 
CATCCTCTTT TTCCAACTTG GCTGCAGGAA TCTTTAAAAG ATGCTTTTAA CAGAGTCTGA 4260 
ACCTATTTCT TAAACACTTG CAACCTACCT GTTGAGCATC ACAGAATGTG ATAAGGAAAT 4320 
CAACTTGCTT ATCAACTTCC TAAATATTAT GAGATGTGGC TTGGGCAGCA TCCCCTTGAA 4380 
CTCTTCACTC TTCAAATGCC TGACTAGGGA GCCATGTTTC ACAAGGTCTT TAAAGTGACT 4440 

1 5 AATGGCATGA G AAA T AC AAA AATACTCAGA TAAGGTAAAA TGCCATGATG CCTCTGTCTT 4500 
CTGGACTGGT TTTCACATTA G AAGACAATT GACAACAGTT ACATAATTCA CTCTGAGTGT 4560 
TTTATG AGAA AGCCTTCTTT TGGGGTCAAC AGTTTTCCTA TGCTTTGAAA CAGA AAAATA 4620 
TGTACCAAGA ATCTTGGTTT GCCTTCCAG A AAACAAAACT GCATTTCACT TTCCCGGTGT 4680 
TCCCCACTGT ATCTAGGCAA CATAGTATTC ATGACTATGG ATAAACTAAA CACGTGACAC 4740 

20 AAACACACAC AAAAGGGAAC CCAGCTCTAA TACATTCCAA CTCGTATAGC ATGCATCTGT 4800 
TTATTCTATA GTTATTAAGT TCTTTAAAAT GTAAAGCCAT GCTGGAAAAT AATACTGCTG 4860 
AGATACATAC AG AATTACTG TAACTGATTA CACTTGGTAA TTGTACTAAA GCCA AACAT A 4920 
TATATACTAT TAAAAAGGTT TACAG AATTT TATGGTGCAT TACGTGGGCA TTGTCTTTTT 4980 
AGATGCCCAA ATCCTTAGAT CTGGCATGTT AGCCCTTCCT CCAATTATAA GAGGATATGA 5040 

25 ACCAAAAAAA AAAAAAAAAA AA 



30 



Seq ID NO: 10 Protein sequence: 
Protein Accession #: NPJW3465.2 



t It 21 31 41 51 

MAARPLPVSP ARALLLALAg' ALLAPCEARG VSLWNEGRAD EWSASVRSG DLWIP VKSFD 60 
SKNHPEVLNI RLQRESKELI INLERNEGU ASSFTETHYL QDGTDVSLAR NYTVILGHCY 120 

35 YHGHVRGYSD SAVSLSTCSG LRGLIVFENE SYVLEPMKSA TNRYKLFPAK KLK.SVRGSCG 180 
SHHNTPNLAA KNVFPPPSQT WARRHKRETL KATKYVELVI VADNREFQRQ GKDLEKVKQR 240 
LIEIANHVDK. FYRPLNIR1V LVGVEVWNDM DKCSVSQDPF TSLHEFLDWR KMKLLPRKSH 300 
DNAQLVSGVY FQGTTIGMAP 1MSMCTADQS GGIVMDHSDN PLGAAVTLAH ELGHNFGMNH 360 
DTLDRGCSCQ MAVEKGGCIM NASTGYPFPM VFSSCSRKDL ETSLEKGMGV CLFNLPEVRE 420 

40 SFGGQKCGNR FVEEGEECDC GEPEECMNRC CNATTCTLKP DAVCAHGLCC EDCQLKPAOT 480 
ACRDSSNSCD LPEFCTQASP HCPANVYLHD GHSCQDVDGY CYNGlCQTHE QQC VTLWGPG 540 
AKPAPGICFE RVNSAGDPYG NCGKVSKSSF AKCEMRDAKC GKIQCQGGAS RPVIGTNAVS 600 
IETNIPLQQG GRILCRGTHV YLGDDMPDPG LVLAGTKCAD GKJCLNRQCQ NISVFGVHEC 660 
AMQCHGRGVC NNRKNCHCEA HWAPPFCDKF GFGGSTDSGP IRQADNQGLT IGILVTILCL 720 

45 LAAGF WYLK RKTURLLFT NKKTTIEKLR CVRPSRPPRG FQPCQAHLGH LGKGLMRKPP 780 
DSYPPKDNPR RLLQCQNVDI SRPLNGLNVP QPQSTQRVLP PLHRAPRAPS VPARPLPAKP 840 
ALRQAQGTCK PNPPQKPLPA DPLARTTRLT HALARTPGQW ETGLRLAPLR PAPQYPHQVP 900 
RSTHTAYIK 

50 Scq ID NO; 1 1 DNA sequence 

Nucleic Acid Accession »: NM J>0 1 394 

Coding sequence: 400-1584(underiined sequences correspond to start and stop codons) 

1 II 21 31 41 51 

55 | I I | I I 

GGCACGAGGG CTGAGCGCCG GAGGAGCGTA GGCAGGGCAG CGCTGGCGCC AGCGGCGACA 60 
GGAGCCGCGC GACCGGCAAA AATACACGGG AGGCCGTCGC CGAAAAGAGT CCGCGGTCCT 120 
CTCTCGTAAA CACACTCTCC TCCACCGGCG CCTCCCCCTC CGCTCTGCGC GCCGCCCGGC 180 
TGGGCGCCCG AGGCCGCTCC GACTGCTATG TGACCGCGAO GCTGCGGGAG GAAGGGGACA 240 

60 GGGAAG AAGA GGCTCTCCCG CGGGAGCCCT TGAGGACCAA GTTTGCGGCC ACTTCTGCAG 300 
GCGTCCCTTC TTAGCTCTCG CCTGCCCCTT TCTGCAGCCT AGGCGGCCCA GGTTCTCTTC 360 
TCTTCCTCGC GCGCCCAGCC GCCTCGGTTC CCGGCGACCA_TGGTGACGAT GGAGGAGCTG 420 
CGGGAGATGG ACTGCAGTGT GCTCAAAAGG CTGATGAACC GGG ACGAGAA TGGCGGCGGC 480 
GCGGCCGGCA GCGGCAGCCA CGGCACCCTG GGGCTGCCGA GCGGCGGCAA GTGCCTGCTG 540 

65 CTGGACTGCA GACCGTTCCT GGCGCACAGC GCGGGCTACA TCCTAGGTTC GGTCAACGTG 600 
CGCTGTAACA CCATCGTGCG GCGGCGGGCT AAGGGCTCCG TGAGCCTGGA GCAG ATCCTG 660 
CCCGCCGAGG AGGAGGTACG CGCCCGCTTG CGCTCCGGCC TCTACTCGGC GGTCATCGTC 720 
TACGACG AGC GCAGCCCGCG CGCCGAGAGC CTCCQCGAGG ACAGCACCGT GTCGCTGGTG 780 
GTGCAGGCGC TGCGCCGCAA CGCCGAGCGC ACCG ACATCT GCCTGCTCAA AGGCGGCTAT 840 

70 GAG AGGTTTT CCTCCGAGTA CCCAGAATTC TGTTCTAAAA CCAAGGCCCT GGCAGCCATC 900 
CCACCCCCGG TTCCCCCCAG CGCCACAGAG CCCTTGGACC TGGGCTGCAG CTCCTGTGGG 960 
ACCCCACTAC ACGACCAGGG GGGTCCTGTG GAGATCCTTC CCTTCCTCTA CCTCGGCAGT 1020 
GCCTACCATG CTGCCCGGAG AGACATGCTG GACGCCCTGG GCATCACGGC TCTGTTGAAT 1080 
GTCTCCTCGG ACTGCCCAAA CCACTTTGAA GGACACTATC AGTACAAGTG CATCCCAGTG 1 140 

75 GAAGATAACC ACAAGGCCGA CATCAGCTCC TGGTTCATGG AAGCCATAGA GTACATCGAT 1200 
GCCGTGAAGG ACTGCCGTGG GCGCGTGCTG GTGCACTGCC AGGCGGGCAT CTCGCGGTCG 1260 
GCCACCATCT GCCTGGCCTA CCTGATGATG AAGAAACGGG TGAGGCTGGA GG AGGCCTTC 1320 
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GAGTTCGTTA AGCAGCGCCG CAGCATCATC TCGCCCAACT TCAGCTTCAT GGGGCAGCTG 1380 
CTGCAGTTCG AGTCCCAGGT GCTGGCCACG TCCTGTGCTG CGGAGGCTGC T^C^CTCG 1440 
GGACCCCTGC GGGAGCGGGG CAAG ACCCCC GCCACCCCCA CCTCGCAGTT OTCTTCAOC 1500 
TTTCCGGTCT CCGTGGGCGT GCACTCGGCC CCCAGCAGCC TGCCCTACCT GCACAGCCCC 1560 
ATCACCACCT CTCCCAGCTG TTAGAGCCGC CCTGGGGGCC CCAGAACCAG AGCTGGCTCC 1620 
CAGCAAGGGT AGGACGGGCC GCATGCGGCA G AAAGTTGGG ACTGAGCAGC TGGGAGCAGG 1680 
CG ACCGAGCT CCTTCCCCAT CATTTCTCCT TGGCCAACGA CG AGGCCAGC CAGAATGGCA 1740 
ATAAGGACTC CGAATACATA ATAAAAGCAA ACAG AACACT CCAACTTAGA GCAATAACCG 1800 
GTGCCGCAGC AGCCAGGGAA GACCTTGGTT TGGTTTATGT GTCAGTTTCA CTTTTCCGAT 1860 
AG AAATTTCT TACCTCATTT TTTTAAGCAG TAAGGCTTGA AGTG ATGAAA CCCACAGATC 1920 
CTAGCAAATG TGCCCAACCA GCTTTACTAA AGGGGGAGGA AGGGAGGGCA AAGGGATCAG 1980 
AAG ACAAGTT TCCCAGAAGT GCCTGGTTCT GGGTACTTGT CCCTTTGTTG TOTOTTGT 2040 
AGTTAAAGGA ATTTCATTTT TAAAAGAAAT CTTCGAAGGT GTGGTTTTCA mCTCAGTC 2100 
ACCAACAGAT GAATAATTAT GCTTAATAAT AAAGTATTTA TTAAGACTTT CTTCAGAGTA 21 60 
TGAAAGTACA AAAAGTCTAG TTACAGTGGA TTTAGAATAT ^^^^^Sj^^^t^^^f^^L 
GCTG AGCACC GTAGCATGCA GATGTCAAGG CAGTTAGGAA GTAAATGGTG TCTTGTAGAT 2280 
ATGTGCAAGG TAGCATGATG AGCAACTTGA GTTTGTTGCC ACTGAGAAGC AGGCGGGTTG 2340 
GGTGGGAGGA CGAAGAAAGG GAAGAATTAG GTTTGAATTG CTTTTTAAAA f^^^^f^^J^^ 
AG AAAAAGAC AGCATCTCAC TATGTTGCCA AGGCTCATCT TGAGAAGCAG GCGGGTTGGG 2460 
TGGGAGGAGG AAGAAAGGGA AGAATTAGGT TTGAATTGCT TTTTTAAAAA A AAA 

Seq ID NO: 12 Protein sequence: 
Protein Accession*: NPJKJ1385 

I U 21 31 41 51 

MVTMEELREM DCSVLKRLMN RDENGGGAGG SGSHGTLGLP SGGKCLLLDC RPFLAHSAGY 60 
ILGSVNVRCN TIVRRRAKGS VSLEQILPAE EEVRARLRSG LYSAVIVYDE RSPRAESLRE 120 
DSTVSLVVQA LRRNAERTDI CLLKGGYERF SSEYPEFCSK. TKALAAIPPP VPPSATEPLD1 80 
LGCSSCGTPL HDQGGPVE1L PFLYLGSAYH AARRDMLDAL GrTALUWSS DCPNHFEGHY 240 
OYKCIPVEDN HKADISSWFM EA1EYIDAVK DCRGRVLVHC QAGISRSATI CLAYLMMKKR 300 
VRLEEAFEFV KQRRSIISPN FSFMGQLLQF ESQVLATSCA AEAASPSGPL RERGKTPATP 360 
TSQFVFSFPV SVGVHSAPSS LPYLHSPITT SPSC 



SeqIDNO:13DNAs . 
Nucleic Acid Accession ft: none found 

Coding sequence: 68-340<undcrlmed sequences correspond to start and stop cooons) 



I 11 21 31 41 51 

AGCGCCTTGC CnCTCTTAG GCTTTGAAGC ATTTTTGTCT ^JS^^^^^J^^^^IL f 2Q 
CACCACCATG AAGTTCTTAG CAGTCCTGGT ACTCTTGGGA GTTTCCATCT TTCTGGTCTC 120 
TGCCCAGA7T CCGACAACAG CTGCTCCAGC TGACACGTAT CCAGCTACTG GTCCTGCTGA 180 
TGATGAAGCC CCTGATGCTG AAACCACTGC TGCTGCAACC ACTGCGACCA CTGCTGCTCC 240 
TACCACTGCA ACCACCGCTG CTTCTACCAC TGCTCGTAAA GACATTCCAG TT^CCCAA 300 
ATGGGTTGGG GATCTCCCGA ATGGTAGAGT GTGTCCCTGA GATGGAATCA^G^TCT 'M 
TCTGCAATTG GGTCACAACT ATTCATGCTT CCTGTGATTT CATCCAACTA CTTACCTTGC 420 
CTACGATATC CCCTTTATCT CTAATCAGTT TATTTTCTT T CAAATAAAAA ATAACTATGA 480 
GCGAGCTAAC AT 

Seq ID NO: 14 Protein sequence: 
protein Accession #: none found 

1 11 21 31 41 51 

MKFLAVLVLL GVSIFLVSAQ NPTTAAPADT YPATGPADDE APDAETTAAA TTATTAAPTT 60 
ATTAASTTAR KDIPVLPICWV GDLPNGRVCP 

Seq ID NO: 1 5 DNA sequence 

Nucleic Acid Accession ft NM 016640.2 

Coding sequence: 39-1 358(underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

(3CTTAAGTTG acctctgggt CCGG aatcgc GGGCAAAG AT GGCGGCGGCC AGGTGTTGGA 60 
GGCCTTTGCT ACGCGGTCCG AGGCTTTCAT TGCACACCGC GGCTAATGCC GCCGCCACGG 120 
CTACAGAAAC G ACCTCCCAA GACGTCGCGG CGACCCCCGT CGCGCGGTAC CCGCCG ATTG 180 
TGGCCTCCAT GACAGCCGAC AGCAAAGCTG CACGGCTGCG GCGG ATCGAG CGCTGGCAGG 240 
CG ACGGTGCA CGCTGCGGAG TCGGTAGACG AGAAGCTGCG AATCCTCACC AAGATGCAGT 300 
TTATGAAGTA CATGGTTTAC CCGCAG ACCT TCGCACTGAA TGCCGACCGC TGGTACCAGT 360 
ACTTCACCAA GACCGTGTTC CTGTCGGGTC TGCCGCCGCC CCCAGCGGAG CCCGAGCCCG 420 
AGCCCGAACC CGAACCTGAA CCTGCGCTGG ACCTCGCGGC GCTGCGTGCG GTCGCCTGCG 480 
ACTGCCTGCT GCAGGAGCAC TTCTACCTGC GGCGCAGGCG GCGCGTGCAC CGTTACGAGG 540 
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AGAGCGAGGT CATATCTTTG CCCTTCCTGG ATCAGCTGGT GTCAACCCTC GTGGGCCTCC 600 
TCAGCCCACA CAACCCGGCC CTGGCCGCTG CCGCCCTCGA TTATAGATGC CCAGTTCATT 660 
TTTACTGGGT GCGTGGTGAA GAAATTATTC CTCGTGGTCA TCGAAGAGGT CGAATTGATG 720 
ACTTGCGATA CCAGATAGAT GATAAACCAA ACAACCAGAT TCGAATATCC AAGCAACTCG 780 
5 CAGAGTTTGT GCCATTGGAT TATTCTGTTC CTATAGAAAT CCCCACTATA AAATGTAAAC 840 
• CAGACAAACT TCCATTATTC AAACGGCAGT ATGAAAACCA CATATTTGTT GGCTCAAAAA 900 
CTGCAGATCC TTGCTGTTAC GGTCACACCC AGTTTCATCT GTTACCTGAC AAATTAAGAA 960 
GGGAAAGGCT TTTGAGACAA AACTGTGCTG ATCAGATAGA AGTTGTTTTT AGAGCTAATG IO20 
CTATTGCAAG CCTTTTTGCT TGGACTGGAG CACAAGCTAT GTATCAAGGA TTCTGGAGTG 1080 

1 0 AAGCAG ATGT TACTCGACCT TTTGTCTCCC AGGCTGTGAT CACAG ATGGA AAATACTTTT 1 1 40 
CCTTTTTCTG CTACCAGCTA AATACTTTGG CACTGACTAC ACAAGCTG AT CAAAATAACC 1200 
CTCGTAAAA A TATATGTTGG GGTACACAAA GTAAGCCTCT TTATGAAACA ATTG AGGATA 1260 
ATGATGTGAA AGGTTTTAAT GATGATGTTC TACTTCAGAT AGTTCACTTT CTACTGAATA 1320 

, c GACCAAAAGA AGAAAAATCA CAGCTGTTGG AAAACTGAAA AAGCATATTT GATTGAGAAC 1380 

1 5 TGTGGGAATA TTTAAATTTT ACTGAAGGAA CAATAATGAT GAGATTTGTA ACTGTCAACT 1440 
ATTAAATACA TTGATTTTTG AGACAAATAT TTCTTATOTC AACCTGTTAT TAGATCTCTT 1500 
ACTCTGCTCA AATTCATCAC TGAAAGATTT AATTTTAGTT ACCTTTTOTI GATTTAAAAA 1560 
TAATTGCATT TGTATATTGC TAACTGATAA GACAAATTGA GTTATTGAGC TATTAAATGC 1620 
ACATTTTAAT ATAAATGCAG AAATCCCAAA TAAAATGCTA ACATACTGAA TTCAGTAATT 1 680 

20 AAAAGAACCC ACTGC 
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40 



Scq ID NO: 16 Protein sequence: 
Protein Accession #: NP _057724. 1 



I 11 21 31 41 51 
I I I I I 1 

MAAARCWRPL LRGPRLSLHT AANAAATATE TTSQDVAATP VARYPPIVAS MTADSKAARL 60 
RRIERWQATV HAAESVDEKL RILTKMQFMK VMVYPQTFAL NADRWYQYFT KTVFLSGLPP 1 20 

3 0 PPAEPEPEPE PEPEPALDLA ALRA VACDCL LQEHFYLRRR RRVHRYEESE VISLPFLDQL 1 80 

VSTLVGLLSP HNPALAAAAL DYRCPVHFYW VRGEEIIPRG HRRGRIDDLR YQIDDKPNNQ 240 
BUSKQLAEF VPLDYSVPE IPTDCCKPDK. LPLFKRQYEN HffVGSKTAD PCCYGHTQFH 300 
LLPDKLRRER LLRQNCADQI EWFRANAIA SLFAWTGAQA MYQGFWSEAD VTRPFVSQAV 360 

_ - ITDGKYFSFF CYQLNTLALT TQADQNNPRK N1CWGTQSKP LYETIEDNDV KGFNDDVLLQ 420 

3 5 IVHFLLNRPK EEKSQLLEN 



SeqIDNO; 17 DNA sequence 

Nucleic Acid Accession U: NM_025059. 1 

Coding sequence: 3-2150 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 SI 
I I I I I I 
, r GCATGAGCCT GG ACTGC ACC AGCCATATCG CGCTGGGTGC CGCTTCGCCA GCGCCCGAGG 60 
45 AAACTTACGA TCATCTTTCG GAAGTCCCGG TCACGCGGGA GCAGTTAAAC CACTATCGGA 120 
ATGTGGCTCA AAATGCTCGA AGTGAACTTG CAGCAACTTT GGTCAAATTT GAATGTGCTC 180 
AGTCTGAGCT TCAAGACCTC CGATCCAAGA TGCTTTCTAA AGAAGTCTCC TGTCAAGAAC 240 
TGAAAGCTGA AATGGAGAGC TACAAGGAAA ACAATGCCAG AAAATCATCT CTCCTTACCT 300 
CTTTGAGAG A CAGAGTTCAG G AACTAGAAG AAGAATCAGC AGCACTTTCC ACTTCTAAAA 360 
5 0 TCAGAACAGA AATCACAGCT CACGCTGCAA TCAAGGAGAA CCAGGAATTA AAGAAGAAAG 420 
TTGTAGAGTT AAATG AAAAA TTACAAAAGT GTTCAAAAGA AAATGAGGAG AATAAGAA AC 480 
. AAGTTTCAAA GAATTGCAGG AAACATGAGG AATTTCTGAC TCAACTOCOT GACTGCTTGG 540 
ATCCAGATGA GAGGAATGAC AAGGCATCAG ATGAAGATTT AATTTTAAAG CTTAGAGACC 600 
TGCGCAAAGA AAATG AATTC GTGAAAGGAC AAATTGTTAT TCTTGAAGAG ACTATAAATG 660 

5 5 TCCATGAGAT GGAAGCAAAA GCTAGCAGAG AAACGATCAT GAGGCTGGCT TCAGAAGTCA 720 

ACAGAGAGCA GAAAAAAGCT GCCTCCTGTA CTGAAGAGAA AGAGAAGCTG AACCAGGACC 780 
TGCTCAGTGC TGTAG AAGCA AAAGAAGCTC TTGAAAGGGA AOTTAAGATC TTCCAAGAAA 840 
GGCTGCTTGC TGGCCAGCAG GTCTGGGATG CCTCAAAGCA GGAAGTGAGC CTCCTGAAGA 900 
AAAGCTCTTC TGAGTTGGAG AAGAGTTTGA AGGCCAGTCA GGATGCAGTC ACAACCTCAC 960 
60 AAAGCCAGTA CTCCTCATTT AGGGAGAAAA TCGCAGCCCT CCTTAGGGGC AGATTGAGCA 1020 
TGACTGGGTC CACTGAGGAC ACCATTTTGG AGAAGATTCG AGAAATGGAC AGCCGGGAAG 1080 
AAAGCAGGGA CCGGATGGTC TCCCAGCTTG AAGCCCAAAT ATCTGAGCTT GTTGAACAGT 1 140 
TGGGAAAGGA GTCTGGGTTT CACCAGAAAG CTCTCCAGAG GGCCCAGAAA GCAGAGAATA 1200 
TGTTGG AGAC TCTTCAGGGT CAGCTG ACAC ACCTGGAGGC AG AGCTGGTT TCTGGAGGTG 1 260 

6 5 TTTTGCG AG A CAACTTGAAT TTTG AG AAAC AAAAATATCT TAAATTTCTG G ATCAGCTTT 1 320 

CTCAGAAAAT GAAGTTGGAC CAGATGGCTG CCGAACTTGG CTTTGACACG CGGCTGGACG 1380 
TGGTTTTAGC TCGAACAGAG CAGCTG GTTC GTCTTGAGAG CAATGCAGTC ATTGAGAACA 1440 
AG ACCATTGC CCACAATTTG CAGAGAAAGC TAAAGACACA G AAAGAGAG A CTGGAGAGCA 1 500 
AAG AATTACA CATG AGCCTC CTCCGGCAG A AAATAGCCCA GCTGGAGGAG G AG AAGCAGG 1 560 

70 CACGCACGGC CTTGGTGGTT GAGAGGG ACA ACGCGCATCT TACCATCAGG AACTTGCAGA 1 620 
AGAAGGTGGA GAGGCTGCAG AAAGAGCTGA ACACGTGTCG AGACTTGCAC ACCGAGCTCA 1680 
AAGCCAAACT GGCCGACACC AATGAACTGA AGATTAAAAC TTTGGAACAG ACTAAAGCCA 1740 
TTGAAGATCT AAACAAATCC AGAGACCAAC TGGAGAAGAT GAAGGAGAAA GCTGAGAAAA 1800 
AGCTCATGTC TGTCAAGTCA GAACTGGATA CCACAGAACA TGAGGCTAAG GAGAATAAAG 1860 

75 AAAGGGCCAG AAAC ATG ATA GAAGTGGTAA CCAGTGAAAT GAAGACACTA AAAAAATCTC 1920 
TGGAAGAAGC AGAAAAGAGA GAAAAGCAGC TGGCAGACTT CAGGGAGGTG GTGTCGCAGA 1980 
TGCTAGGCTT G AACGTG ACC AGCCTTGCTC TTCCTG ATTA TG AAATCATC AAGTGTCTTG 2040 

354 



# 



WO 02/059377 PCT/US02/02242 



AAAGATTGGT CCATTCACAT CAGCATCACT TTGTTACCTG TGCCTGCCTC AAAGATGTGA 2100 
CTACTGGGCA AGAGAGGCAC CCACAAGGCC ATTTACAGCT TCTTCATTGA ACACTGTATC 2160 
TCTTGAGAG A GGTGGCCATA AGACATGGCA CACAATTCCC AATTTCACAA ATTCCTCATG 2220 
TCTTTGAGAT TTG ATCAGTT TGTGAATATT TTATGCTTTG ATG ATATAGT G AG AATGCAT 2280 
5 CACTTGCAAA AACGATCTCA AAAGTGTCAG CCTTAG ATAA ACGTCAGCAT TA^A^CGC 2340 
CAAAAAAAAA AAAAAAAAGC ATTTTAGGAT CCAGAAG AAT TCCACCAGAT TGCATG AjOTT 2400 
AG ATTGGG AA ATGGG AGTGG GAGATAATAT TGGGAGGTAT CTATTTTAAG TCAGGGGCTT 2460 
TACTAGCCGA TTTAGTTCTC ACAATAACCA TGTGGAG AAG CTGTGACATT TTTAATTTAC : 2520 
AACCTTTCTG GGGCTCAGAC ATAAAGTTAC CTATCCAAGG TTGCAGTTGG GTAGTGGTGG 2580 

1 0 GACCAGGATG GACAACTCAT TGGCCCTGCC TCAAAAGCCA TACCTCTTCT CCTGCTATGC 2640 
AGAATCTGTT TCTCCTG AAT CTCTGTGATG CTGGTGGGAA TTGTTTGCAT AG AGG AAGG A 2700 
CAATAACCCT GCCATCGTGA GTTAATGTCC GGGCTGGTCA CAGTGGTTCA TGCCTGTAAT 2760 
CCCAGCACTT TGGGAGTCCA AGGCAGGCAT ATCATTTG AG GTCAGGAGTT TAAGACCAGC 2820 
CTGGCTAACA TAGTGAGACC CTGTTTCTAC TAAAAATACA AAAATAAGCC AGGTGTGGTG 2880 

1 5 GTGCATGACT GTAATCCCAA CTACTCAGCA GGAGAAGCAC TTGAACCCAG GAGACGGAGG 2940 
CGGCAGTGAG CCAAGATTGT GCCACTGCAC TCCAGCCTGG GCGACAGAGT GAAACTACAT 3000 
CTCAGAAAAA AAAAAAAAAA AAAAAAAAAA AAGTTAATGT CCAAAAATGA CAGATTTACA 3060 
AGTGTAAGCT ATATG ATTTC TTCAAAAAGC AAAAGCAATA TACCTAATTC ^TUGQATCA 3 120 
AACTTACATA GGTCTCAGGT CCTGTAAG AA ACTTGCCTGT TCTAACTGTT GCTACCAGAT 3180 

20 TATATCTGGT GGTAATTGTT AATGTTTCAG CAGGGCTGOT CTCAGTCOT TAA^TCGAA 3240 
AGCTTTATTT GGAGCCCAAC CCTATAAGAT GAAGAAATCC TATATAGTCT TATTCACCAA 3300 
TATATCCAAT ACACCCACAG CAATGGTACC TTTTTAAGAT CAGG ATTTT A TTATG AATTC 3360 
CTGTCACTTT CTGTTTTCCA TTTAAATTTC TATTTTACAA ATTTTTCAGG G AATCATATT 3420 
CTTAACTTCA CTGAGAAAAA TGTGTTACTC TTTTGGACAA TTTATCTTAT TTCTATCATA 3480 

25 TAAGATGTAT TTTTTTATTG TCCTTAAAAG AAGCTCTAGC ATGAAATrAA AGG AA AG GG A 3540 
AAG AAATAG A TCTGGTGCAC CCG AACATTA GGAGAAAATG AAAAATATAC AACCAACCGT 3600 
TCGTG AGTCA TCAAAAAGTC AAAGTCAGCC TGGCCAACAT GGCAAAACTC CGTCTCTGCA 3660 
AAAAATACAA AAATGAGCCC GGTATGTTGG CATATGCCTG TAATCCCAGC ™CTCGAGAG 3720 
GCTG AG ACAC GAAAATTGCT TGAACCTGGG AGGCGGAGGT TGCAATGAGC CGAGATCGCG 3780 

3 0 CTACTGCACT CCAGCCTGGG CAACAG AG AG AGACCTTGTC TCAAAAAACA ACAACAACAA 3840 
AAAGTCAAAG TCATAATAAG CAAATTATTG GCTCTTTCTTCTAGACY^ 3900 
AG AG ATG AAA CAATCAATTG CAAGGGTCAA AACTAGATTG GATCTTGGTT TG AATG A AAA 39 60 
AAAGCATAAA ATATTCTTGC AATAATTGTA AAAATTTGAA TGTGGACTAA GTCCTAGATT "4020 
ATATTAAAAT ATTTTTAATT TTTTAAGCTT GACAAATGCA CTGATTGTTA TACTTTAAAT 4080 

35 AACTAAAAAT CTGAGAATCC ACAGTGCTAC AGACAATAAA TG ATAAAATG GG AAAAAAAA 4140 
AAAAAAAAAA A 

Seq ID NO: 18 Protein sequence: 
40 Protein Accession #: NP_079335.I 

1 11 21 31 41 51 

MSLDCTSHIA LGAASPAPEE TYDHLSEVPV TREQLNHYRN VAQNARSELA ATLVKFECAQ 60 
45 SELQDLRSKM LSKEVSCQEL KAEMESYKEN NARKSSLLTS LRDRVQELEE ESAALSTSKI 120 
RTEITAHAAI KJENQELKKKV VELNEKLQKC SKENEENKKQ VSKNCRKHEE FLTQLRDCLD 180 
PDERNDKASD EDULKLRDL RKENEFVKGQ IVILEETINV HEMEAKASRE TIMRLASEVN 240 
REQKKAASCT EEKEKLNQDL LSAVEAKEAL EREVKIFQER LLAGQQVWDA SKQEVSLLKK 300 
SSSELEKSLK ASQDAVTTSQ SQYSSFREKI AALLRGRLSM TGSTEDTILE KIREMDSREE 3&) 
50 SRDRMVSQLE AQISELVEQLGKESGFHQKA LQRAQKAENM LETLQGQLTH LEAELVSGGV 420 
LRD^WEKQ KYUCFLDQLS QKMKLDQMAA ELGFDTRLDV VLARTEQLVR LESNAVIENK 480 
TIAHNLQRKL KTQKERLESK ELHMSLLRQK XAQLEEEKQA RTALWERON AHLTnWLQK 540 
KVERLQKELN TCRDLHTELK AKLADTNELK IKTLEQTKAI EDLNKSRDQL EKMKEKAEKK 600 
LMSVKSELDT TEHEAKENKE RARNMIEVVT SEMKTLKKSL EEAEKREKQL ADFREWSQM 660 
5 5 LGLNVTSLAL PDYE11KCLE RLVHSHQHHF VTCACLKDVT TGQERHPQGH LQLLH 

Seq ID NO: 19 DNA sequence 

Nucleic Acid Accession* AF071552, NM 000662 

60 Coding sequence: 441-13 13 (underlined sequences correspond to start and stop codons) 

I 11 21 31 41 51 

65 CTTTGTATAA GGCTCAGCTA AAAGGG AAAT TGAGTGGGTC AGGTACCACG^TACTATAC 60 
ACTCTATTGC ATGATTCTCC TGCCTACATC AGAAG ACGTT TATAAGCCTA TJTTAAAGGA 120 
TACCAGTTGG AATCTCTCTT TTATTAATCA CCAAGAGAAC CATGAACAAG CTGTTTATCA 180 
TTTGACTCAT CATTTAATCT TGATTTCCAG CTTCTCACAC TTGAAAG AAG ACATAATACA 240 
TTTCTGACAG G^TTCTGG^ACTATTAACTG AACTTATGTG TGTAAAAGGA^CATACAA 300 

70 TfiAAAGCACT AGAAATAATT ATTATACTTA TAACCATTGT ATTTTTACAT GTTTAAAATA 360 
TAGC^A^T^C^ 420 
TTTTCCTTGC TTAGGGGATC ATGGACATTG AAGCATATCT TGAAAGAATT G G CT AT AAG A 480 
AGTCTAGGAA CAAATrGGACTTGGAAACATTAACTGACAT TCTTCAACAC CAGATCCGAG 540 

Scc^tgIgaYcot « 

7 5 CCATTTTTG A TCAAGTTGTG AGAAG AAATC GGGGTGG ATG GTGTCTCCAG GTCAATCATC 660 
TTCTGTACTG GGCTCTGACC ACTATTGGTT TTGAGACCAC GATGTTGGGA GGGTATGTTT 720 
A^CACTCC TgCCAAAAAA TACAGCACTG GCATGATTCA CCTTCTCCTG CAGGTGACCA 780 

355 
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TTGATGGCAG GAACTACATT GTCGATCCTO GGTTTGGACG CTCATACCAG ATGTGGCAGC 840 
CTCTGGAGTT AATTTCTGGG AAGGATCAGC CTCAGGTGCC TTGTGTCTTC CGTTTGACGG 900 
AAGAGAATGG ATTCTGGTAT CTAGACCAAA TCAGAAGGGA ACAGTACATT CCAAATGAAG 960 
AATTTCTTCA TTCTGATCTC CTAGAAGACA GCAAATACCG AAAAATCTAC TCCTTTACTC 1020 
5 TTAAGCCTCG AACAATTGAA G ATTTTGAGT CTATGAATAC ATACCTGCAG ACATCTCCAT 1080 
CATCTGTGTT TACTAGTAAA TCATTTTGTT CCTTGCAGAC CCCAGATGGG GTTCACTGTT 1 140 
TGGTGGGCTT CACCCTCACC CATAGGAGAT TCAATTATAA GGACAATACA GATCTAATAG 1200 
AGTTCAAGAC TCTGAGTGAG GAAGAAATAG AAAAAGTGCT GAAAAATATA TTTAATATTT 1260 
CCTTGCAGAG AAAGCTTGTG CCCAAACATG GTGATAGATT TTTTACTATT JAGAATAAGG 1320 
1 0 AGTAAAACAA TCTTGTCTAT TTGTCATCCA GCTCACCAGT TATCAACTGA CGACCTATCA 1 380 
TGTATCTTCT GTACCCTTAC CTTATTTTGA AGAAAATCCT AGACATCAAA TCATTTCACC 1440 
TATAAAAATG TCATCATATA TAATTAAACA GCTTTTTAAA GAAACATAAC GACAAACCTT 1500 
TTCAAATAAT AATAATAATA ATAATAATAA ATGTATTTTA AAGATGGCCT GTGGTTATCT 1560 
TGGAAATTGG TGATTTATGC TAG AAAGCTT TTAATGTTGG TTTATTGTTG AATTC 



Seq ID NO: 20 Protein sequence: 
Protein Accession U: NP_000653.1 



20 1 11 21 31 41 51 
J I I I I I 

MDIEAYLERI GYKKSRNKLD LETLTDILQH QIRAVPFENL NIHCGDAMDL GLEAIFDQVV 60 
RRNRGGWCLQ VNHLLYWALT TIGFETTMLG GYVYSTPAKK YSTGMIHLLL QVT1DGRNY1 1 20 
VDAGFGRSYQ MWQPLELISG KDQPQVPCVF RLTEENGFWY LDQIRREQY1 PNEEFLHSDL 1 80 
25 LEDSKYRKTY SFTLKPRTIE DFESMNTYLQ TSPSSVFTSK SFCSLQTPDG VHCLVGFTLT 240 
HRRFNYKDNT DLIEFKTLSE EEIEKVLKNI FNISLQRKLV PKHGDRfFTI 

Seq 10 NO: 21 DNA sequence 
Nucleic Acid Accession #: NM_003714 
3 0 Coding sequence: 1 23-t03 1 (underlined sequences correspond to start and stop codons) 

I 11 21 31 41 51 

I I I I I I 

3 5 CGGCACGAGC AAAAAGG AAG AGTGGGAGGA GGAGGGGAAG CGGCGAAGGA GGAAGAGGAG 60 
GAGGAGGAAG AGGGGAGCAC A AAGG ATCCA GGTCTCCCGA CGGGAGGTTA ATACCAAG AA 1 20 
CCATGTGTGC CG AGCGGCTG GGCCAGTTCA TGACCCTGGC TTTGGTGTTG GCCACCTTTG 1 80 
ACCCGGCGCG GGGGACCGAC GCCACCA ACC CACCCGAGGG TCCCCAAGAC AGGAGCTCCC 240 
AGCAGAAAGG CCGCCTGTCC CTGCAGAATA CAGCGGAGAT CCAGCACTGT TTGGTCAACG 300 

40 CTGGCGATGT GGGGTGTGGC GTGTTTGAAT GTTTCGAGAA CAACTCTTGT GAG ATTCGGG 360 
GCTTACATGG GATTTGCATG ACTTTTCTGC ACAACGCTGG AAAATTTGAT GCCCAGGGCA 420 
AGTCATTCAT CAAAGACGCC TTG AAATGTA AGGCCCACGC TCTGCGGCAC AGGTTCGGCT 480 
GCATAAGCCG GAAGTGCCCG GCCATCAGGG AAATGGTGTC CCAGTTGCAG CGGG AATGCT 540 
ACCTCAAGCA CGACCTGTGC GCGGCTGCCC AGGAGAACAC CCGGGTGATA GTGGAGATGA 600 

45 TCCATTTCAA GGACTTGCTG CTGCACGAAC CCTACGTGG A CCTCGTGAAC TTGCTGCTGA 660 
CCTGTGGGGA GGAGGTG AAG GAGGCCATCA CCCACAGCGT GCAGGTTCAG TGTGAGCAG A 720 
ACTGGGGAAG CCTGTGCTCC ATCTTG AGCT TCTGCACCTC GGCCATCCAG AAGCCTCCCA 780 
CGGCGCCCCC CGAGCGCCAG CCCCAGGTGG ACAGAACCAA GCTCTCCAGG GCCCACCACG 840 
GGGAAGCAGG ACATCACCTC CCAGAGCCCA GCAGTAGGGA GACTGGCCGA GGTGCCAAGG 900 

5 0 GTGAGCGAGG TAGCAAGAGC CACCCAAACG CCCATGCCCG AGGCAGAGTC GGGGGCCTTG 960 
GGGCTCAGGG ACCTTCCGGA AGCAGCGAGT GGGAAGACGA ACAGTCTGAG TA TTCT GATA 1020 
TCCGG AGGTGAAATG A AAG G CCTGGCCACG AAATCTTTCC TCCACGCCGT CCATTTTCTT 1080 
ATCTATGGAC ATTCCAAAAC ATTTACCATT AGAG AGGGGG GATGTCACAC GCAGGATTCT 1 140 
GTGGGGACTG TGGACTTCAT CGAGGTGTGT GTTCGCGGAA CGGACAGGTG AGATGG AGAC 1 200 

55 CCCTGGGGCC GTGGGGTCTC AGGGGTGCCT GGTGAATTCT GCACTTACAC GTACTCAAGG 1260 
GAGCGCGCCC GCGTTATCCT CGTACCTTTG TCTTCTTTCC ATCTGTGGAG TCAGTGGGTG 1320 
TCGGCCGCTC TGTTGTGGGG G AGGTGAACC AGGOAGGGGC AGGGCAAGGC AGGGCCCCCA 1380 
GAGCTGGGCC ACACAGTGGG TGCTGGGCCT CGCCCCGAAG CTTCTGGTGC AGCAGCCTCT 1440 
GGTGCTGTCT CCGCGGAAGT CAGGGCGGCT GG ATTCCAGG ACAGGAGTGA ATGTAAAAAT 1500 

60 AAATATCGCT TAGAATGCAG GAGAAGGGTG GAGAGGAGGC AGGGGCCGAG GGGGTGCTTG 1560 
GTGCCAAACT G AAATTCAGT TTCTTGTGTG GGGCCTTGCG GTTCAG AGCT CTTGGCG AGG 1 620 
GTGGAGGG AG GAGTGTCATT TCTATGTGTA ATTTCTG AGC CATTGTACTG TCTGGGCTGG 1 680 
GGGGGACACT GTCCAAGGG A GTGGCCCCTA TGAGTTTATA TTTTAACCAC TGCTTCAAAT 1 740 
CTCGATTTCA CTTTI 1 1 1 AT TTATCCAGTT ATATCTACAT ATCTGTCATC TAAATAAATG 1800 

65 GCTTTCAAAC AAAAAAAAAA AAAAAAAAAA AAAAAAA 

Seq ID NO: 22 Protein sequence: 
Protein Accession 8: NP 003705 

70 

1 11 21 31 41 51 
I I I I I I 

MCAERLGQFM TLALVLATFD PARGTDATNP PEGPQDRSSQ QKGRLSLQNT AEIQHCLVNA 60 
GDVGCGVFEC FENNSCEIRG LHGICMTFLH NAGKFDAQGK SFDCDAUCCK AHALRHRFGC 120 
75 ISRKCPAIRE MVSQLQRECY UCHDLCAAAQ ENTRVIVEMI HFKDLLLHEP YVDLVNLLLT 180 
CGEEVKEAIT HSVQVQCEQN WGSLCSILSF CTSAIQKPPT APPERQPQVD RTKLSRAHHG 240 
EAGHHLPEPS SRETGRGAKG ERGSKSHPNA HARGRVGGLG AQGPSGSSEW EDEQSEYSDI 300 
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RR 

Scq ID NO: 23 DNA sequence 
5 Nucleic Acid Accession*: NMJ>05264.1 

Coding sequence: 557-I954(underlined sequences correspond to start and stop codons) 

I 11 21 31 41 51 

GAATTCCGGC CAGAAGAAAT CTGGCCTCGG AACACGCCAT TCTCCGCGCC GCTTCCAATA 60 
ACCACTAACA TCCCTAACGA GCATCCGAGC CGAGGGCTC T GCTCGGAA AT CGTCCTGGCC 120 
CAACTCGGCC CTTCGAGCTC TCGAAGATTA CCGCATCTAT TTTTTTTTTC TTTTTTTTCT 180 
TTTCCTAGCG CAGATAAAGT GAGCCCGGAA AGGGAAGGAG GGGGCGGGG A CACCATTGCC 240 

1 5 CTGAAAGAAT AAATAAGTAA ATAAACAAAC TGGCTCCTCG CCGCAGCTGG ACGCGGTCGG 300 
TTGAGTCCAG GTTGGGTCGG ACCTGAACCC CTAAAAGCGG AACCGCCTCC CGCCCTCGCC 360 
ATCCCGG AGC TGAGTCGCCG GCGGCGGTGG CTGCTGCCAG ACCCGGAGTT TCCTCTTTCA 420 
CTGGATGGAG CTGAACTTTG GGCGGCCAGA GCAGCACAGC TGTCCGGGGA TCGCTGCACG 480 
CTGAGCTCCC TCGGCAAGAC CCAGCGGCGG CTCGGG ATTT TTTTGGGGGG GCGGGGACCA 540 

20 GCCCCGCGCC GGCACCATGT TCCTGGCGAC CCTGTACTTC GCGCTGCCGC TCTTGGACTT 600 
GCTCCTGTCG GCCGAAGTGA GCGGCGGAGA CCGCCTGGAT TGCGTGAAAG CCAGTCATCA 660 
GTGCCTGAAG GAGCAGAGCT GCAGCACCAA GTACCGCACG CTAAGGCAGT GCGTGGCGGG 720 
CAAGGAGACC AACTTCAGCC TGGCATCCGG CCTGGAGGCC AAGGATG ACT GCCGCAGCGC 780 
CATGGAGGCC CTGAAGCAGA AGTCGCTCTA CAACTGCCGC TGCAAGCGGG GTATGAAGAA 840 

25 GGAGAAGAAC TGCCTGCGCA TTTACTGGAG CATGTACCAG AGCCTGCAGG GAAATGATCT 900 
GCTGG AGG AT TCCCCATATG AACCAGTTAA CAGCAGATTG TCAGATATAT TCCGGGTGGT 960 
CCCATTCATA TCAGATGTTT TTCAGCAAGT GGAGCACATT CCCAAAGGGA ACAACTGCCT 1020 
GGATGCAGCG AAGGCCTGCA ACCTCGACGA CATTTGCAAG AAOTACAGGT CGGCGTACAT 1080 
CACCCCGTGC ACCACCAGCG TGTCCAACGA TGTCTGCAAC CGCCGCAAGT GCCACAAGGC 1 140 

30 CCTCCGGCAG TTCTTTGACA AGGTCCCGGC CAAGCACAGC TACGGAATGC TCTTCTGCTC 1200 
CTGCCGGGAC ATCGCCTGCA CAGAGCGGAG GCGACAGACC ATCGTGCCTG TGTGCTCCTA 1260 
TGAAG AG AGG G AG AAGCCCA ACTGTTTG AA TTTGCAGG AC TCCTGCAAG A CG AATTACAT 1320 
CTGCAGATCT CGCCTTGCGG ATTTTTTTAC CAACTGCCAG CCAGAGTCAA GGTCTGTCAG 1380 
CAGCTGTCTA AAGGAAAACT ACGCTGACTG CCTCCTCGCC TACTCGGGGC TTATTCQCAC 1440 

35 AGTCATGACC CCCAACTACA TAGACTCCAG TAGCXITC AGT<yT|G^CCCCAT ^^T^TI^^^^I l f* 
CAGCAACAGT GGG AACGACC TAGAAGAGTG CTTGAAATTT TTG AATTTCT TCAAGGACAA 1 560 
TACATGTCTT AAAAATGCAA TTCAAGCCTT TGGCAATGGC TCCGATGTGA CCGTGTGGCA 1620 
GCCAGCC1TC CCAGTACAGA CCACCACTGC CACTACCACC ACTGCCCTCC G GGTT.AAG AA 680 
CAAGCCCCTG GGGCCAGCAG GGTCTGAGAA TGAAATTCCC ACTCATGTTT TGCCACCGTG 740 

40 TGCAAATTTA CAGGCACAGA AGCTGAAATC ^^^^^^ ^^^^^f^^^^^l^J, \f° 
TTCCAATGGT AATTATGAAA AAGAAGGTCT CGGTGCTTCC AGCCACATAA CCAGAAAATC 1860 
AATGGCTGCT CCTCCAAGCT GTGGTCTGAG CCCACTGCTG GTCCTGGTGG TAACXGCTCT 1 920 
GTCCACCCTA TTATCTTTAA CAGAAACATC ATAGCTGCAT TAAAAAAATA CAATATGGAC 1980 
ATGTAAAAAG ACAAAAACCA AGTTATCTGT TTCCTGTTCT CTT GTATAGC TGAAATTCCA 2040 

45 GmAG^CTCAGTTGAGAAACAGTTCCATTCAACTG 

AAGAAAGCTT CTTGTGATCC TTCGGGGCTT CTGTGAAAAA CCTGATGCAG TGCTCCATCC 21 60 
AAACTC^GA^ G^CTTTGG^GA TATGCTGTAT TTTAAAGGGA CAGTTTGTAA ?IT^,2H??'?J ^ 
A^CCAAACTGGGGCTGTGTrrTCGATGATGATGATC^^^ 

AGTTTTACTT CTGGCCTTTC CTAGCTAGAG AAGGAGTTAA TATTTCTAAG GTAACTCCCA 2340 
50 TATCTCCTTT AATGACATTG ATTTCTAATG ATATAAATTT CAGCCTACAT TO ATGCCAAG 2400 
CTTTTTTGCC ACAAAGAAGA TTCTTACCAA GAGTGGGCTT ^ G ^^^J^ r % 2 ^° 
ATGTTCACCT TTATATATGT ACTAGCATTT TCCACGCTGA TGTTTATQTA CTGTAAACAG 2520 
TTCTGCACTC TTGTACAAAA GAAAAAACCA CCCGGAATTC 

55 a»q ID NO: 24 Protein sequence: 

Protein Accession #: NPJD05255 

1 11 21 31 41 51 

60 MFLATLYFAL PLLDLLLSAE VSGGDRLDCV KASDQCLKEQ SCSTKYRTLR QCVAGKETNF 60 

SLASGLEAKD ECRSAMEALK. QKSLYNCRCK RGMKKEKNCL RIYWSMYQSL QGNDLLEDSP 120 
YEPVNSRLSD IFR WPFISD VFQQVEHIPK GNNCLDAAKA CNLDDICKKY RSAYITPCTT 180 
SVSNDVCNRR KCHKALRQFF DKVPAKHSYG MLFCSCRDIA CTERRRQTIV PVCSYEEREK 240 
PNCLNLQDSC KTNYICRSRL ADFFTNCQPE SRSVSSCLKE NYADCLLAYS OUGTVUTYU 300 

65 YIDSSSLSVA PWCDCSNSGN DLEECLKFLN FFKDNTCLKN AIQAFGNGSD ^QPAFPV 360 
QTTTATTTTA LRVKNKPLGP AGSENEIPTH VLPPCANLQA Q1CLKSNVSGN THLCISNGNY 420 
EKEGLGASSH 1TTKSMAAPP SCGLSPLLVL WTALSTLLS LTETS 

Seq ID NO: 25 DNA sequence 
70 Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: I576(cntire sequence represents open reading frame) 

1 11 21 31 41 51 

7 5 rrrTTGTTTC GCCATGCCTA GTCTAGTGGT ATCTGG AATA ATGGAAAG AA ATGGGGGCTT 60 
75 wSS^^^^gSjjqm TAAGGACCGA GGGCTGCTGG AAGACGAGCG 120 
CGCCCTTCAG CTGGCTCTCG ATCAACTCTG CCTCCTGGGT TTGGGGGAGC CCCCCGCCCC 180 
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CAGGGCGGGC GAGGACGGGG GAGGTGGGGG GGGCGGCGCC CCCGCGCAGC CGACAGCCCC 240 
CCCGCAGCCG GCGCCGCCGC CGCCGCCCGC GGCGCCCCCG GCCGCCCCGA CGACGGCCCC 300 
CGCAGCGCAG ACGCCCCAGC CCCCCACCGC CCCCAAAGGG GCGAGCGACG CCAAGCTCTG 360 
CGCTCTCTAC AAAGAGGCCG AGCTGCGCCT GAAGGGCAGC AGCAACACCA CGGAGTGTGT 420 
TCCCGTGCCC ACCTCCGAGC ACGTGGCCGA GATCGTGGGC AGGCAAGGCT GCAAGATTAA 480 
GGCCTTGAGG GCCAAGACCA ACACCTACAT CAAGACACCG GTG AGGGGCG AGGAACCAGT 540 
GTTCATGGTG ACAGGGCGAC GGGAGGACGT GGCCACAGCC CGGCGGGAAA TCATCTCAGC 600 
AGCGGAGCAC TTCTCCATGA TCCGTGCCTC CCGCAACAAG TCAGGCGCCG CCTTTGGTGT 660 
GGCTCCTGCT CTGCCCGGCC AGGTGACCAT CCGTGTGCGG GTGCCCTACC GCGTGGTGGG 720 
GCTGGTGGTG GGCCCCAAAG GGGCAACCAT CAAGCGCATC CAGCAGCAAA CCAACACATA 780 
CATTATCACA CCAAGCCGTG ACCGCGACCC CGTGTTCGAG ATCACGGGTG CCCCAGGCAA 840 
CGTGGAGCGT GCGCGCGAGG AGATCGAGAC GCACATCGCG GTGCGCACTG GCAAGATCCT 900 
CGAGTACAAC AATGA AAACG ACTTCCTGGC GGGGAGCCCC G ACGCAGCAA TCGATAGCCG 960 
CTACTCCGAC GCCTGGCGGG TGCACCAGCC CGGCTGCAAG CCCCTCTCCA CCTTCCGGCA 1020 
GAACAGCCTG GGCTGCATCG GCGAGTGCGG AGTGGACTCT GGCTTTGAGG CCCCACGCCT 1080 
GGGTGAGCAG GGCGGGGACT TTGGCTACGG CGGGTACCTC TTTCCGGGCT ATGGCGTGGG 1140 
CAAGCAGGAT GTGTACTACG GCGTGGCCGA GACTAGCCCC CCGCTGTGGG CGGGCCAGGA 1200 
GAACGCCACG CCCACCTCCG TGCTCTTCTC CTCTGCCTCC TCCTCCTCCT CCTCTTCCGC 1260 
CAAGGCCCGC GCTGGGCCCC CGGGCGCACA CCGCTCCCCT GCCACTTCCG CGGGACCCGA 1320 
GCTGGCCGGA CTCCCGAGGC GCCCCCCGGG AGAGCCGCTC CAGGGCTTCT CTAAACTTGG 1380 
TGGGGGCGGC CTGCGGAGCC CCGGCGGCGG GCGGGATTGC ATGGTCTGCT TTGAGAGCGA 1440 
AGTGACTGCC GCCCTTGTGC CCTGCGGACA CAACCTGTTC TGCATGGAGT GTGCAGTACG 1500 
CATCTGCGAG AGGACGGACC CAGAGTGTCC CGTCTGCCAC ATCACAGCCA CGCAAGCCAT 1560 
CCG AATATTC TCCTAA 

Seq ID NO: 26 Protein sequence: 

Protein Accession #: FGENESH predicted 



1 II 21 31 41 51 

FCFAMPSLVV SGIMERNGGF GELGCFGGSA KDRGLLEDER ALQLALDQLC LLGLGEPPAP 60 
RAGEDGGGGG GGAPAQPTAP PQPAPPPPPA APPAAPTTAP AAQTPQPPTA PKGASDAKLC 120 
ALYKEAELRL KGSSNTTECV PVPTSEHVAE IVGRQGCK1K ALRAKTNTY1 KTPVRGEEPV 1 80 
FMVTGRREDV ATARREIISA AEHFSMIRAS RNKSGAAFGV APALPGQVTI RVRVPYRWG 240 
LWGPKGATI KRIQQQTNTY 1ITPSRDRDP VFEITG APGN VERAREEIET HIAVRTGK1L 300 
EYNNENDFLA GSPDAAIDSR YSDAWRVHQP GCKPLSTFRQ NSLGCIGECG VDSGFEAPRL 360 
GEQGGDFGYG GYLFPGYGVG KQDVYYGVAE TSPPLWAGQE NATPTSVLFS SASSSSSSSA 420 
KARAGPPGAH RSPATSAGPE LAGLPRRPPG EPLQGFSKLG GGGLRSPGGG RDCMVCFESE 480 
VTAALVPCGH NLFCMECAVR ICERTDPECP VCHITATQAI RIFS 

Seq ID NO: 27 DNA sequence 

Nucleic Acid Accession U: FGENESH predicted 

Coding sequence: 1 -2070 (underlined sequences correspond to start and stop codons) 

1 11 2! 31 41 51 

ATGAGCGGTG CGGGGGTGGC GGCTGGGACG CGGCCCCCCA GCTCGCCGAC CCCGGGCTCT 60 
CGGCGCCGGC GCCAGCGCCC CTCTGTGGGC GTCCAGTCCT TG AGGCCGCA GAGCCCGCAG 1 20 
CTCAGGCAG A GCG ACCCGCA G AAACGGAAC CTGG ACCTGG AGAAAAGCCT GCAGTTCCTG 1 80 
CAGCAGCAGC ACTCGGAGAT GCTGGCCAAG CTCCATGAGG AGATCGAGCA TCTGAAGCGG 240 
GAAAACAAGG GTGAGCCGGC GCGGGGCCCT AGGCCGGCOC TGCCTCCCCA GGCACACTCA 300 
ACAGTGCGGC TCCCGCAGCA CAGAAACACA GCCATCAACT CCAG C ACACG CCTGGGCTCA 360 
GGGGGAACAC AGG ACGGGG A GCCCCTCCAG ACTGTCCTTG CCCACCTGGC TGCACTGGCC 420 
CCTGTATGCC AACCCAGTGG GTACAGGTTC TGGGGG ACCT GGACAGATGC CGCTACCTCT 480 
AGCCGTGGCT GG ACGATGTT ATGCAGCCAA GCACAGCACG TGCTGCTCTC GGGAAGCCCA 540 
GGGCCTGAGG TCATTGCAGG GCGGCAGGTG GCCACAGGGT GCTCCCCAGA CCTCCCTCCT 600 
CCAAGTAG AG CTGAAATGGG AAGG AACCCC TGGG ACAGCC CCTGCCCTGC TAGATCTTTG 660 
CCTCAGATTG CTGCTGTGGC CAGGCCCAGG ATTTCCAGCC CTATGGCTCT GAGTCCTCAC 720 
ATGCTGGGGG CCCAGGGGAT ATGGACACAC TCCATCCAGG GATCCCTTCC TGCCATCTGG 780 
GCAGCAACCA TGGGGACAAA GGGAGGAAGC AGAGTCCTGT TTCCTTGCCA CTTGTCCAAG 840 
GCACTTCCCC ATCCTGACAG CGGCCCCCAC CCAGCCCAGG ATCCTGGGCT GTGGTCTCAA 900 
GCTCACTTCC CATTATCTTT GGGGCTGGGG CTGACATCAG GAGGACATCT GACTGGTGGA 960 
TGGAGCCAGC CTGGGAACAT CGCAGCTGGG GCAGTGCCTA GGGCTCTCCC TTCCCAGGGA 1020 
GACATGG AGA AGGGGGTTGA GGGAGGGCCC TTCCCTAGCC GCTGTGGCAA CTCCAGTGAG 1080 
CTGTTCTGGG CAAAGTGTGG CCCAAGTCGG CAGCCCCAGC CCTGCAGTGC TGGGGACGCT 1140 
GACAGGACAC GGGAAGAGGC CATGCTTTCC CTCGGGACCT GCTGTTCCAT GTGTOCCAAG 1200 
CCCTCCTGCT TTCCAGATGG CCCCTCAGGA AACCACCTTT CCAGGGCCTC TGCTCCCTTG 1260 
GGCGCTCGCT GGGTCTGCAT CAACGGAGTG TGGGTAGAGC CGGGAGGACC CAGCCCTGCC 1320 
AGGCTGAAGG AGGGCTCCTC ACGGACACAC AGGCCAGGAG GCAAGCGTGG GCGTCTTGCG 1380 
GGCGGTAGCG CCGACACTGT GCGCTCTCCT GCAG ACAGCC TCTCCATGTC AAGCTTCCAG 1440 
TCTGTCAAGT CCATCTCTAA TTCAGCCAAC TCTCAAGGCA AGGCCAGGCC CCAGCCCGGC 1500 
TCCTTCAACA AGCAAGATTC AAAAGCTG AC GTCTCCCAGA AGGCGG ACCT GGAAGAGGAG 1 560 
CCCCTACTTC ACAACAGCAA GCTGGACAAA GTTCCTGGGG TACAAGGGCA GGCCAGAAAG 1620 
GAGAAAGCAG AGGCCTCTAA TGCAGGAGCT GCCTGTATGG GG AACAGCCA GCACCAGGGC 1680 
AGGCAGATGG GGGCGGGGGC ACACCCCCCA ATGATCCTGC CCCTTCCCCT GCGAAAGCCC 1740 
ACCACACTTA GGCAGTGCG A AGTGCTCATC CGCG AGCTGT GG AATACCAA CCTCCTGCAG 1 800 
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ACCCAAGAGC TGCGGCACCT CA AGTCCCTC CTGGAAGGG A GCCAGAGGCC CCAGGCAGCC 1 860 
CCGGAGGAAG CTAGCTTTCC CAGGGACCAA GAAGCCACGC ATTTCCCCAA GGTCTCCACC 1920 
AAGAGCCTCT CCAAGAAATG CCTGAGCCCA CCTGTGGCGG AGCGTGCCAT CCTGCCCGCA 1980 
CTGAAGCAGA CCCCGAAGAA CAACTTTGCC GAGAGGCAGA AGAGGCTGCA GGCAATGCAG 2040 
AAACGGCGCC TGCATCGCTC AGTGCTTTGA 

Scq ID NO: 28 Protein sequence: 

Protein Accession U: FGENESH predicted 



1 II 21 31 41 51 

MSGAGVAAGT RPPSSPTPGS RRRRQRPSVG VQSLRPQSPQ LRQSDPQKRN LDLEKSLQFL 60 
QQQHSEMLAK LHEEIEHLKR ENKGEPARGP RPALPPQAHS TLPLPQHRNT AINSSTRLGS 120 
GGTQDGEPLQ TVLAHLAALA PVCQPSGYRF WGTWTDAATS SRGWTMLCSQ AQHVLLSGSP 1 80 
GPEVIAGRQV ATGCSPDLPP PSRAEMGRNP WDSPCPARSL PQIAAVARPR ISSPMALSPH 240 
MLGAQGIWTH SIQGSLPAIW AATMGTK.GGS RVLFPCHLSK ALPHPDSGPH PAQDPGLWSQ 300 
AHFPLSLGLG LTSGGHLTGG WSQPGNIAAG AVPRALPSQG DMEKGVEGGP FPSRCGNSSE 360 
LFWAKCGPSR QPQPCSAGDA DRTREEAMLS LGTCCSMCPK PSCFPDGPSG NHLSRASAPL 420 
GARWVCINGV WVEPGGPSPA RLKEGSSRTH RPGGKRGRLA GGSADTVRSP ADSLSMSSFQ 480 
SVKSISNSAN SQGKARPQPG SFNKQDSKAD VSQKADLEEE PLLHNSKLDK VPG VQGQARK 540 
EKAEASNAGA ACMGNSQHQG RQMG AGAHPP MILPLPLRKP TTLRQCEVLI RELWNTNLLQ 600 
TQELRHLKSL LEGSQRPQAA PEEASFPRDQ EATHFPKVST KSLSKKCLSP PVAERAILPA 660 
LKQTPKNNFA ERQKRLQAMQ KRRLHRSVL 

Seq ID NO: 29 DNA sequence 

Nucleic Acid Accession #: NM0123 19.2 

Coding sequence: 138-2405 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

CTCGTGCCGA ATTCGGCACG AGACCGCGTG TTCGCGCCTG GTAGAGATTT CTCGAAGACA 60 
CCAGTGGGCC CGTGTGGAAC CAAACCTGCG CGCGTGGCCG GGCCGTGGGA CAACGAGGCC 120 
GCGGAG ACGA AGGCGCAATG GCGAGGAAGT TATCTGTA AT CTTG ATCCTG ACCTTTGCCC 1 80 
TCTCTGTCAC AAATCCCCTT CATGAACTAA AAGCAGCTGC TTTCCCCCAG ACCACTGAGA 240 
AAATTAGTCC G AATTGGG AA TCTGGCATTA ATGTTGACTT GGCAATTTCC ACACGGCAAT 300 
ATCATCTACA ACAGCTTITC TACCGCTATG GAGAAAA i AA TTC I i iGiCAGi i GAAGGGT 360 
TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 420 
ACCACGACCA TCACTCAGAC CACG AGCATC ACTCAG ACCA TGAGCGTCAC TCAGACCATG 480 
AGCATCACTC AGACCACGAG CATCACTCTG ACCATGATCA TCACTCTCAC CATAATCATG 540 
CTGCTTCTGG TAAAAATAAG CGAAAAGCTC TTTGCGCAGA CCATGACTCA G ATAGTTCAG 600 
GTAAAGATCC TAGAAACAGC CAGGGGAAAG GAGCTCACCG ACCAGAACAT GCCAGTGGTA 660 
GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTG 720 
TCTCTGAAGG AACTCACTTT CTAGAGACAA TAGAGACTCC AAGACCTGGA AAACTCTTCC 780 
CCAAAGATGT AAGCAGCTCC ACTCCACCCA GTGTCACATC AAAGAGCCGG GTGAGCCGGC 840 
TGGCTGGTAG GAAAACAAAT GAATCTGTGA GTG AGCCCCG AAAAGGCTTT ATGTATTCCA 900 
GAAACACAAA TGAAAATCCT CAGGAGTGTT TCAATGCATC AAAGCTACTG ACATCTCATG 960 
GCATGGGCAT CCAGGTTCCG CTGAATGCAA CAGAGTTCAA CTATCTCTGT CCAGCCATCA 1020 
TCAACCAAAT TGATGCTAGA TCTTGTCTGA TTCATACAAG TGAAAAGAAG GCTGAAATCC 1080 
CTCCAAAGAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC AT TTCCAT CA 1 140 
TCAGTTTCCT GTCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGT TTTTCA 1200 
AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT GATGCTTTTT 1260 
TACACCTTCT TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT GAAGAACCAG 1320 
CAATGGAAAT GAAAAGAGGA CCACTTTTCA GTCATCTGTC TTCTCAAAAC ATAGAAGAAA 1380 
GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATGT 1440 
TTCTTGTTGA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC 1500 
AGAAGAAACC TGAAAATGAT GATG ATGTGG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 1560 
CTCAACTTTC AACAAATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 1620 
GAGCAGACTC ACAAGAGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAGTC TTGGAAGAAG 1680 
AAGAGGTCAT GATAGCTCAT GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 1740 
GGTGC AAGAA TAAATGCCAT TCACATTTCC ACG ATACACT CGGCCAGTCA G ACG ATCTCA 1 800 
TTCACCACCA TCATGACTAC CATCATATTC TCCATCATCA CCACCACCAA AACCACCATC 1860 
CTCACAGTCA CAGCCAGCGC T ACTCTCGGG AGGAGCTGAA AGATGCCGGC GTCGCCACTT 1920 
TGGCCTGGAT GGTGATAATG GGTGATGGCC TGCACAATTT CAGCGATGGC CTAGCAATTG 1980 
GTGCTGCTTT TACTGAAGGC TTATCAAGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 2040 
ATGAGTTGCC TCATGAATTA GGTGACTTTG CTGTTCTACT AAAGGCTGGC ATGACCGTTA 2100 
AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGGCAACAG 2160 
GAATTTTCAT TGGTCATTAT GCTGAAAATG TTTCTATGTG GATATTTGCA CTTACTGCTG 2220 
GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTGA AATGCTGCAC AATGATGCTA 2280 
GTGACCATGG ATGTAGCCGC TGGGGGTATT TCTTTTTACA GAATGCTGGG ATGCTTTTGG 2340 
GTTTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATCGTGTTT CGTATAAATT 2400 
TCTAGTTAAG GTTTAAATGC TAGAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT- 2460 
AGGGAGATGA GTTTGTATGC TGTACTATGC AGCGTTTAAA GTTAGTGGGT TTTGTGATTT 2520 
TTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTACGTTTTA ATATTTAAGT 2580 
TATTCTATCT TGGAGATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTATTATG 2640 
TAAACAAGAG ATTTGGCATG AC ATGTTCTG TATGTTTCAG GGAAAAATGT CTTTAATGCT 2700 
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TTTTCAAGAA CTAACACAGT TATTCCTATA CTGGATTTTA GGTCTCTGAA GAACTGCTGG 2760 
TGTTTAGGAA TAAGAATGTG CATGAAGCCT AAAATACCAA GAAAGCTTAT ACTGAATTTA 2820 
AGCAAAGAAA TAAAGGAGAA AAGAGAAGAA TCTG AG AATT GG G G AG GCAT AGATTCTTAT 2880 
AAAAATCACA AAATTTGTTG TAAATTAGAG GGGAGAAATT TAGAATTAAG TATAAAAAGG 2940 
CAGAATTAGT ATAGAGTACA TTCATTAAAC ATTTTTGTCA GGATTATTTC CCGTAAAAAC 3000 
GTAGTGAGCA CTCTCATATA CTAATTAGTG TACATTTAAC TTTGTATAAT ACAGAAATCT 3060 
AAATATATTT AATG AATTCA AGCAATATAC ACTTG ACCAA GAAATTGG AA TTTCAAAATG 3 120 
TTCGTGCGGG TTATATACCA GATGAGTACA GTGAGTAGTT TATGTATCAC CAGACTGGGT 31 80 
TATTGCCAAG TTATATATCA CCAAAAGCTG TATGACTGGA TGTTCTGGTT ACCTGGTTTA 3240 
CAAAATTATC AG AGTAGTAA AACTTTGATA TATATG AGGA TATTAAAACT ACACTAAGTA 3300 
TCATTTGATT CGATTCAGAA AGTACTTTGA TATCTCTCAG TGCTTCAGTG CTATCATTGT 3360 
GAGCAATTGT CTTTATATAC GGTACTGTAG CCATACTAGG CCTGTCTGTG GCATTCTCTA 3420 
GATGTTTCTT TTTTACACAA TAAATTCCTT ATATCAGCTT G 

Seq ID NO: 30 Protein sequence : 
Protein A ccession #: NP_03 645 1 .2 

1 II 21 31 41 51 

MARKLSVILI LTFALSVTNP LHELKAAAFP QTTEK1SPNW ESGINVDLAl STRQYHLQQL 60 
FYRYGENNSL SVEGFRKLLQ NIGIDKDCRI HIHHDHDHHS DHEHHSDHER HSDHEHHSDH 120 
EHHSDHDHHS HHNHAASGKN KRKALCPDHD SDSSGKDPRN SQGKGAHRPE HASGRRNVKD 180 
SVSASEVTST VYNTVSEGTH FLET1ETPRP GKLFPKDVSS STPPSVTSKS RVSRLAGRKT 240 
NESVSEPRKG FMYSRNTNEN PQECFNASKL LTSHGMGIQV PLNATEFNYL CPAIINQIDA 300 
RSCLIHTSEK KAEIPPKTYS LQIAWVGGFI AISIISFLSL LGVILVPLMN RVFFKFLLSF 360 
LVALAVGTLS GDAFLHLLPH SHASHHHSHS HEEPAMEMKR GPLFSHLSSQ NIEESAYFDS 420 
TWKGLTALGG LYFMFLVEHV LTUKQFKDK KKKNQKKPEN DDDVEUCKQL SKYESQLSTN 480 
EEKVDTDDRT EG YLRADSQE PSHFDSQQPA VLEEEEVMIA HAHPQEVYNE YVPRGCKNKC 540 
HSHFHDTLGQ SDDLIHHHHD YHHILHHHHH QNHHPHSHSQ RYSREELKDA GVATLAWMVI 600 
MGDGLHNFSD G LAIGAAFTE GLSSGLSTSV AVFCHELPHE LGDFAVLUCA GMTVKQAVLY 660 
NALSAMLAYL GMATGIFIGH YAENVSMWIF ALTAGLFMYV ALVDMVPEML HNDASDHGCS 720 
RWGYFFLQNA GMLLGFGIML USIFEHKIV FRINF 

Seq ID NO: 31 DNA sequence 

Nucleic Acid Accession #: NM_002 1 84. 1 

Coding sequence: 256-301 2(underiined sequences correspond to start and stop codons) 

I II 21 31 41 51 

GAGCAGCCAA AAGGCCCGCG GAGTCGCGCT GGGCCGCCCC GGCGCAGCTG AACCGGGGGC 60 
CGCGCCTGCC AGGCCGACGG GTCTGGCCCA GCCTGGCGCC AAGGGGTTCG TGCGCTGTGG 120 
AGACGCGGAG GGTCGAGGCG GCGCGGCCTG AGTGAAA(XC AATGGAAAAA GCATGACATT 180 
TAGAAGTAGA AG ACTTAGCT TCAAATCCCT ACTCCTTCAC TTACTAATTT TGTG ATTTGG 240 
AAATATCCGC GCAAGATGTT GACGTTGCAG ACTTGGGTAG TGCAAGCCTT GTTTATTTTC 300 
CTCACCACTG AATCTACAGG TG AACTTCTA GATCCATGTG GTTATATCAG TCCTGAATCT 360 
CCAGTTGTAC AACTTCATTC TAATTTCACT GCAGTTTGTG TGCTAAAGGA AAAATGTATG 420 
GATTATTTTC ATGTAAATGC TAATTACATT GTCTGGAAAA CAAACCATTT TACTATTCCT 480 
AAGGAGCAAT ATACTATCAT AAACAGAACA GCATCCAGTG TCACCTTTAC AG ATATAGCT 540 
TCATTAAATA TTCAGCTCAC TTGCAACATT CTTACATTCG GACAGCTTGA ACAGAATGTT 600 
TATGGAATCA CAATAATTTC AGGCTTGCCT CCAGAAAAAC CTAAAAATTT GAGTTGCATT 660 
GTGAACGAGG GG AAGAAAAT G AGGTGTGAG TGGGATGGTG GAAGGG AAAC ACACTTGGAG 720 
ACAAACTTCA CTTTAAAATC TGAATGGGCA ACACACAAGT TTGCTGATTG CAAAGCAAAA 780 
CGTGACACCC CCACCTCATG CACTGTTGAT TATTCTACTG TGTATTTTGT CAACATTGAA 840 
GTCTGGGTAG AAGCAGAGAA TGCCCTTGGG AAGGTTACAT CAGATCATAT CAATrTTGAT 900 
CCTGTATATA AAGTGAAGCC CAATCCGCCA CATAATTTAT CAGTGATCAA CTCAG AGG AA 960 
CTGTCTAGTA TCTTAAAATT GACATGGACC AACCCAAGTA TTAAGAGTGT TATAATACTA 1020 
AAATATAACA TTCAATATAG GACCAAAGAT GCCTCAACTT GGAGCCAGAT TCCTCCTGAA 1080 
GACACAGCAT CCACCCGATC TTCATTCACT GTCCAAGACC TTAAACCTTT TACAGAATAT 1 140 
GTGTTTAGG A TTCGCTGTAT GAAGGAAGAT GGTAAGGG AT ACTGGA GTGA CTGGAGTG AA 1200 
GAAGCAAGTG GGATCACCTA TGAAG ATAGA CCATCTAAAG CACCAAGTTT CTGGTATAAA 1260 
ATAGATCCAT CCCATACTCA AGGCTACAGA ACTGTACAAC TCGTGTGGAA GACATTGCCT 1320 
CCTTTTGAAG CCAATGGAAA AATCTTGGAT TATGAAGTGA CTCTCACAAG ATGGAAATCA 1380 
CATTTACAAA ATTACACAGT TAATGCCACA AAACTGACAG TAAATCTCAC AAATGATCGC 1440 
TATCTAGCAA CCCTAACAGT AAGAAATCTT GTTGGCAAAT CAGATGCAGC TGTTTTAACT 1500 
ATCCCTGCCT GTG ACTTTCA AGCTACTCAC CCTGTAATGG ATCTTAAAGC ATTCCCCAAA 1 560 
GATAACATGC TTTGGGTGGA ATGGACTACT CCAAGGGAAT CTGTAAAGAA ATATATACTT 1620 
GAGTGGTGTG TGTTATCAGA TAAAGCACCC TGTATCACAG ACTGGCAACA AGAAGATGGT 1680 
ACCGTGCATC GCACCTATTT AAGAGGGAAC TTAGCAGAGA GCAAATGCTA TTTGATAACA 1740 
GTTACTCCAG TATATGCTGA TGGACCAGGA AGCCCTGAAT CCATAAAGGC ATACCTTAAA 1800 
CAAGCTCCAC CTTCCAAAGG ACCTACTGTT CGGACAAAAA AAGTAGGGAA AAACGAAGCT 1860 
GTCTTAGAGT GGGACCAACT TCCTGTTGAT GTTCAGAATG GATTTATCAG AAATTATACT 1920 
ATATTTTATA GAACCATCAT TGG AAATGAA ACTGCTGTGA ATGTGG ATTC TTCCCACACA 1980 
GAATATACAT TGTCCTCTTT GACTAGTGAC ACATTG TACA TGGTACGAAT GGCAGCATAC 2040 
ACAGATGAAG GTGGGAAGGA TGGTCCAGAA TTCACTTTTA CTACCCCAAA GTTTGCTCAA 2100 
GGAGAAATTG AAGCCATAGT CGTGCCTGTT TGCTTAGCAT TCCTATTGAC AACTCTTCTG 2160 
GGAGTGCTGT TCTGCTTTAA TAAGCGAGAC CTAATTAAAA AACACATCTG GCCTAATGTT 2220 
CCAGATCCTT CAAAGAGTCA TATTGCCCAG TGGTCACCTC ACACTCCTCC AAGGCACAAT 2280 
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TTTAATTCAA AAGATCAAAT GTATTCAGAT GGCAATTTCA CTGATGTAAG TGTTGTGGAA 2340 
ATAGAAGCAA ATGACAAAAA GCCTTTTCCA GAAGATCTGA AATCATTGGA CCTGTTCAAA 2400 
AAGGAAAAAA TTAATACTGA AGGACACAGC AGTGGTATTG GGGGGTCTTC ATGCATGTCA 2460 
TCTTCTAGGC CAAGCATTTC TAGCAGTGAT GAAAATGAAT CTTCACAAAA CACTTCG AGC 2520 
ACTGTCCAGT ATTCTACCGT GGTACACAGT GGCTACAG AC ACCAAGTTCC GTCAGTCCAA 2580 
GTCTTCTCAA GATCCG AGTC TACCCAGCCC TTGTTAGATT CAGAGGAGCG GCCAGAAGAT 2640 
CTACAATTAG TAG ATCATGT AGATGGCGGT GATGGTATTT TGCCCAGGCA ACAGTACTTC 2700 
AAACAGAACT GCAGTCAGCA TG AATCCAGT CCAGATATTT CACATTTTGA AAGGTCAAAG 2760 
CAAGTTTCAT CAGTCAATGA GGAAG ATTTT GTTAG ACTTA AACAGCAGAT TTCAGATCAT 2820 
ATTTCACAAT CCTGTGG ATC TGGGCAAATG AAAATGTTTC AGGAAGTTTC TGCAGCAGAT 2880 
GCTTTTGGTC CAGGTACTGA GGGACAAGTA G AAAGATTTG AAACAGTTGG CATGGAGGCT 2940 
GCGACTGATG AAGGCATGCC TAAAAGTTAC TTACCACAGA CTGTACGGCA AGGCGGCTAC 3000 
ATGCCTCAGTGAAGG ACTAG TAGTTCCTGC TACAACTTCA GCAGTACCTA TAAAGTAAAG 3060 
CTA AAATGAT TTTATCTGTG AATTC 

Seq ID NO: 32 Protein sequence: 
Protein Accession #: NP 002 J 75. I 

1 11 21 31 41 51 
I } I I 1 I 

MLTLQTWVVQ ALF1FLTTES TGELLDPCGY ISPESPWQL HSNFTAVCVL KEKCMDYFHV 60 
NANYIVWKTN HFTIPKEQYT IINRTASSVT FTDIASLNIQ LTCNILTFGQ LEQNVYGIT1 120 
ISGLPPEKPK NLSCIVNEGK KMRCE WDGG R ETHLETNFTL KSEWATHKFA DCKAKRDTPT 1 80 
SCTVDYSTVY FVNIEVWVEA ENALGKVTSD HINFDPVYKV KPNPPHNLSV INSEELSSIL 240 
KLTWTNPSIK SVIILKYNIQ YRTKDASTWS QEPPEDTAST RSSFTVQDLK PFTEYVFRlR 300 
CMKEDGKGYW SDWSEEASGI TYEDRPSKAP SFWYK1DPSH TQGYRTVQLV WKTLPPFEAN 360 
GKILDYEVTL TRWKSHLQNY TVNATKLTVN LTNDRYLATL TVRNLVGKSD AAVLTIPACD 420 
FQATHPVMDL KAFPKDNMLW VEWTTPRESV KKYILEWCVL SDKAPCITDW QQEDGTVHRT 480 
YLRGNLAESK CYLITVTPVY ADGPGSPESI KA YLKQAPPS KGPTVRTKKV GKNEAVLEWD 540 
QLPVDVQNGF 1RNYTIFYRT HGNETAVNV DSSHTEYTLS SLTSDTLYMV RMAAYTDEGG 600 
KDGPEFTFTT PKFAQGEIEA IVVPVCLAFL LTTLLGVLFC FNKRDUKKH IWPNVPDPSK 660 
SHIAQWSPHT PPRHNFNSKD QMYSDGNFTD VSWEIEAND KKPFPEDLKS LDLFKKEKJN 720 
TEGHSSGIGG SSCMSSSRPS ISSSDENESS QNTSSTVQYS TWHSGYRHQ VPSVQVFSRS 780 
ESTQPLLDSE ERPEDLQLVD HVDGGDGILP RQQYFKQNCS QHESSPDrSH FERSKQVSSV 840 
NEEDFVRIKQ QISDHISQSC GSGQMKMFQE VSAADAFGPG TEGQVERFET VGMEAATDEG 900 
MPKSYLPQTV RQGGYMPQ 

Seq ID NO: 33 DNA sequence 

Nucleic Add Accession tf: NM_01 8255. 1 

Coding sequence: 1 1 -249 1 (underlined sequences correspond to start aud stop codons) 

I II 21 31 41 51 

i i I I I I 

AGTTGGCGAC ATGGTGGCAC CCGTGCTGGA GACTTCTCAC GTGTTTTGCT GCCCAAACCG 60 
GGTGCGGGGA GTCCTGAACT GGAGCTCTGG GCCCAGAGGA CTTCTGGCCT TTGGCACGTC 120 
CTGCTCCGTG GTGCTCTATG ACCCCCTGAA AAGGGTTGTT GTTACCAACT TGAATGGTCA 1 80 
CACCGCCCGA GTCAATTGCA TACAGTGGAT TTGTA AACAG GATGGCTCCC CTTCTACTGA 240 
ATTAGTTTCT GGAGGATCTG ATAATCAAGT GATTCACTGG GAAATAGAGG ATAATCAGCT 300 
TTTAAAAGCA GTGCATCTTC AAGGCCATG A AGGACCTGTT TATGCGGTGC ATGCTGTTTA 360 
CCAGAGGAGG ACATCAGATC CTGCATTATG TACACTGATC GTTTCTGCAG CTGCAGATTC 420 
TGCTGTTCGA CTCTGGTCTA AAAAGGGTCC AGAAGTAATG TGCCTTCAGA CTTTAAACTT 480 
TGGAAATGGA TTTGCTTTGG CTCTCTGCTT ATCTTTTTTG CCAAATACTG ATGTACCAAT 540 
ATTAGCATGT GGCAATGATG ATTGCAGAAT TCACATATTT GCTCAACAAA ATGATCAGTT 600 
TCAGAAAGTG CTTTCTCTCT GTGGACATGA GGATTGGATT AGAGGAGTGG AATGGGCAGC 660 
CTTTGGTAGA GATCTTTTCC TAGCAAGCTG TTCACAAGAT TGCCTGATAA GAATATGGAA 720 
GCTGTATATA AAGTCAACAT CTTTAGAAAC TCAGG ATGAC GATAACATAA GACTGAAAG A 780 
AAATACTTTT ACCATAGAAA ATG AAAGTGT TAAAATAGCA TTTGCTGTTA CTCTGGAGAC 840 
AGTGCTAGCC GGTCATGAAA ACTGGGTAAA TGCAGTTCAC TGGCAACCTG TGTTTTACAA 900 
AGATGGTGTC CTACAGCAGC CAGTGAGATT ATTATCTGCT TCCATGGATA AAACCATG AT 960 
TCTCTGGGCT CCAGATGAAG AGTCAGGAGT TTGGCTAGAA CAGGTTCGAG TAGGTGAAGT 1020 
AGGTGGGAAT ACTTTGGGAT TTTATGATTG CCAGTTCAAT GAAGATGGCT CCATOATCAT 1080 
TGCTCATGCT TTCCACGG AG CGTTGCACCT TTGGAAACAG AATACAGTTA ACCCAAGAGA 1 140 
GTGGACTCCA GAGATTGTCA TTTCAGGACA CTTTGATGGT GTCCAAGACC TAGTCTGGGA 1200 
TCCAGAAGGA G AATTTATTA TCACTGTTGG TACTGATCAG ACAACTAGAC TTTTTGCTCC 1260 
ATGGAAGAGA AAAGACCAAT CACAGGTGAC TTGGCATGAA ATTGCAAGGC CTCAGATACA 1320 
TGGGTATGAC CTGAAATGTT TGGCAATGAT TAATCGGTTT CAGTTTGTAT CTGGAGCAGA 1380 
TGAAAAAGTT CTTCGGGTTT TTTCTGCACC TCGGAATTTT GTGGAAAATT TTTGTGCCAT 1440 
TACAGGACAA TCACTGAATC ATGTGCTCTG TAATCAAGAT AGTGATCTTC CAGAAGGAGC 1500 
CACTGTCCCT GCATTGGGAT TATCAAATAA AGCTGTCTTT CAGGGAGATA TAGCTTCTCA 1560 
GCCTTCTGAT GAAGAGGAGC TGTTAACTAG TACTGGTTTT GAGTATCAGC AGGTGGCCTT 1620 
TCAGCCCTCC ATACTTACTG AGCCTCCCAC TGAGGATCAT CTTCTGCAGA ATACTTTGTG 1680 
GCCTGAAGTT CAAAAACTAT ATGGGCACGG TTATGAAATA TTTTGTGTTA CTTGTAACAG 1740 
TTCAAAGACT CTGCTTGCCT CAGCTTGTAA GGCAGCTAAG AAAGAGCATG CAGCTATCAT 1800 
TCTTTGGAAC ACTACATCTT GGAAACAGGT GCAGAATTTA GTTTTCCACA GTTTGACAGT 1860 
CACGCAGATG GCCTTCTCAC CTAATGAGAA GTTCTTACTA GCTGTTTCCA GAGATCGAAC 1920 
CTGG TCATTG TGGA AAAAGC AGGATACAAT CTCACCTGAG TTCGAGCCAG TTTTTAGTCT 1980 
TTTTGCCTTC ACCAACAAAA TTACTTCTGT GCACAGTAGA ATTATTTGGT CTTGTGATTG 2040 
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GAGTCCTG AC AGCAAGTATT TCTTCACTGG G AGTCG AG AC AAAAAGGTGG TTGTCTGGGG 2 1 00 
TGAGTGCGAC TCCACTGATG ACTGTATTGA GCACAACATT GGCCCCTGCT CCTCAGTCCT 2160 
GGACGTGGGT GGGGCTGTGA CAGCTGTCAG CGTCTGCCCA GTGCTCCACC CTTCTCAACG 2220 
ATACGTGGTT GCAGTAGGAT TGGAGTGTGG AAAGATTTGC TTATATACCT GG AAAAAGAC 2280 
TGATCAAGTT CCAGAAATAA ATGACTGG AC CCACTGTGTA GAAACAAGTC AAAGCCA AAG 2340 
TCATACACTG GCTATCAGAA AATTATGCTG GAAGAATTGC AGTGGAAAAA CTGAACAGAA 2400 
GGAAGCAGAA GGTGCTGAGT GGTTACACTT TGCAAGCTGT GGTGAAGATC ACACTGTGAA 2460 
GATACACAGA GTCAATAAAT GTGCACTG TA AT GG 

Seq ID NO: 34 Protein sequence: 
Protein Accession #: NP_060725. 1 

I 11 21 31 41 51 
I I I I I I 

MVAPVLETSH VFCCPNRVRG VLNWSSGPRG LLAFGTSCSV VLYDPLKRVV VTNLNGHTAR 60 
VNCIQWICKQ DGSPSTELVS GGSDNQVIHW EiEDNQLLKA VHLQGHEGPV YAVHAVYQRR 120 
TSDPALCTLI VSAAADSAVR LWSKKGPEVM CLQTLNFGNG FALALCLSFL PNTDVPILAC 1 80 
GNDDCR1HIF AQQNDQFQKV LSLCGHEDWI RGVEWAAFGR DLFLASCSQO CURIWKLYI 240 
KSTSLETQDD DNIRLKENTF TIENESVKIA FAVTLETVLA GHENWVNAVH WQPVFYKDGV 300 
LQQPVRLLSA SMDKTMILWA PDEESGVWLE QVRVGEVGGN TLGFYDCQFN EDGSMJ1AHA 360 
FHGALHLWKQ NTVNPREWTP EIVISGHFDG VQDLVWDPEG EFUTVGTDQ TTRLFAPWKR 420 
KDQSQVTWHE IARPQIHG YD LKCLAMINRF QFVSGADEKV LRVFSAPRNF VENFCAITGQ 480 
SLNHVLCNQD SDLPEGATVP ALGLSNKAVF QGDIASQPSD EEELLTSTGF EYQQVAFQPS 540 
ILTEPPTEDH LLQNTLWPEV QKLYGHGYEI FCVTCNSSKT LLASACKAAK KEHAAHLWN 600 
TTSWKQVQNL VFHSLTVTQM AFSPNEKFLL AVSRDRTWSL WKKQDTISPE FEPVFSLFAF 660 
TNKJTSVHSR 1IWSCDWSPD SKYFFTGSRD KKVVVWGECD STDDCIEHNI GPCSSVLDVG 720 
GAVTAVSVCP VLHPSQRYVV AVGLECGKIC LYTWKKTDQV PEINDWTHCV ETSQSQSHTL 780 
AIRKLCWKNC SGKTEQKEAE GAEWLHFASC GEDHTVKIHR VNKCAL 

Seq ID NO: 35 DNA sequence 

Nucleic Acid Accession #: NM J)22 1 3 1 

Coding sequence: 1 1 -2878 (underlined sequences correspond to start and stop codous) 

1 11 21 31 41 51 
I I I I I I 

TGCTGCGAGG ATGCTGCCTG GGCGGCTGTG CTGGGTGCCG CTCCTGCTGG CGCTGGGCGT 60 
GGGGAGCGGC AGCGGCGGTG GCGGGG ACAG CCGGCAGCGC CGCCTCCTCG CGGCTAAAGT 1 20 
CAATAAGCAC AAGCCATGGA TCGAGACTTC ATATCATGGA GTCATAACTG AGAACAATGA 1 80 
CACAGTCATT TTGGACCCAC CACTGGTAGC CCTGGATAAA GATGCACCGG TTCCTTTTGC 240 
AGGGGAAATC TGTGCGTTCA AGATCCATGG CCAGGAGCTG CCCTTTGAGG CTGTGGTGCT 300 
CAACAAGACA TCAGG AG AGG GCCGGCTCCG TGCCAAGAGC CCCATTGACT GTGAGTTGCA 360 
GAAGGAGTAC ACATTCATCA TCCAGGCCTA TGACTGTGGT GCTGGGCCCC ACGAGACAGC 420 
CTGGAAAAAG TCACACAAGG CCGTGGTCCA TATACAGGTG AAGGATGTCA ACGAGTTTGC 480 
TCCCACCTTC AAAGAGCCAG CCTACAAGGC TGTTGTGACG GAGGGCAAGA TCTATGACAG 540 
CATTCTGCAG GTGGAGGCCA TTGACGAGGA CTGCTCCCCA CAGTACAGCC AGATCTGCAA 600 
CTATGAAATC GTCACCACAG ATGTGCCTTT TGCCATCGAC AGAAATGGCA ACATCAGGAA 660 
CACTGAGAAG CTGAGCTATG ACAAACAACA CCAGTATGAG ATCCTGGTGA CCGCCTACGA 720 
CTGTGGACAG AAGCCCGCTG CTCAGGACAC CCTGGTGCAG GTGGATGTGA AGCCAGTTTG 780 
CAAGCCTGGC TGGCAAGACT GGACCAAGAG GATTGAGTAC CAGCCTGGCT CCGGGAGCAT 840 
GCCCCTGTTC CCCAGCATCC ACCTGGAGAC GTGCGATGGA GCCGTGTCTT CCCTCCAG AT 900 
CGTCACAGAG CTGCAGACTA ATTACATTGG G AAGGGTTGT GACCGGGAGA CCTACTCTGA 960 
GAAATCCCTT CAGAAGTTAT GTGGAGCCTC CTCTGGCATC ATTGACCTCT TGCCATCCCC 1020 
TAGCGCTGCC ACCAACTGGA CTGCAGGACT GCTGGTGGAC AGCAGTGAGA TGATCTTCAA 1080 
GTTTGACGGC AGGCAGGGTG CCAAAATCCC CGATGGGATT GTGCCCAAGA ACCTGACCGA 1 140 
TCAGTTCACC ATCACCATGT GGATGAAACA OGGCCCCAGC CCTGGTGTGA GAGCCGAGAA 1200 
GGAAACCATC CTCTGCAACT CAGACAAAAC CGAAATGAAC CGGCATCACT ATGCCCTGTA 1260 
TGTGCACAAC TGCCGCCTCG TCTTTCTCTT GCGGAAGGAC TTCGACCAGG CTGACACCTT 1320 
TCGCCCCGCG GAGTTCCACT GGAAGCTGG A TCAGATTTGT GACAAAGAGT GGCACTACTA 1 380 
TGTCATCAAT GTGGAGTTTC CTGTGGTAAC CTTATACATG GATGGAGCAA CATATGAACC 1440 
ATACCTGGTG ACCAACGACT GGCCCATTCA TCCATCTCAC ATAGCCATGC AACTCACAGT 1 500 
CGGCGCTTGT TGGCAAGGAG GAGAAGTCAC CAAACCACAG TTTGCTCAGT TCTTTCATGG 1560 
AAGCCTGGCC AGTCTCACCA TCCGCCCTGG CAAAATGGAA AGCCAGAAGG TGATCTCCTG 1620 
CCTGCAGGCC TGCAAGGAAG GGCTGGACAT TAATTCCTTG GAAAGCCTTG GCCAAGGAAT 1680 
AAAGTATCAC TTCAACCCCT CGCAGTCCAT CCTGGTG ATG GAAGGTGACG ACATTGGG AA 1 740 
CATTAACCGT GCTCTCCAGA AAGTCTCCTA CATCAACTCC AGGCAGTTCC CAACGGCGGG 1 800 
TGTGCGGCGC CTCAAAGTAT CCTCCAAAGT CCAGTGCTTT GGGGAAGACG TATGCATCAG 1 860 
TATCCCTGAG GTAG ATGCCT ATGTG ATGGT CCTCCAGGCC ATCG AGCCCC GG ATCACCCT 1 920 
CCGGGGCACA GACCACTTCT GG AGACCTGC TGCCCAGTTT G AAAGTGCCA GGGGAGTGAC 1 980 
CCTCTTCCCT G ATATCAAG A TTGTGAGCAC CTTCGCCAAA ACCG AAGCCC CCGGGGACGT 2040 
GAAAACCACA GACCCCAAAT CAGAAGTCTT AGAGGAAATG CTTCATAACT TAGATTTCTG 2100 
TGACATTTTG GTGATCGGAG GGGACTTGGA CCCAAGGCAG GAGTGCTTGG AGCTCAACCA 2160 
CAGTGAGCTC CACCAACGAC ACCTGGATGC CACTAATTCT ACTGCAGGCT ACTCCATCTA 2220 
CGGTGTGGGC TCCATGAGCC GCTATGAGCA GGTGCTACAT CACATCCGCT ACCGCAACTG 2280 
GCGTCCGGCT TCCCTTGAGG CCCGGCGTTT CCGGATTAAG TGCTCAGAAC TCAATGGGCG 2340 
CTACACTAGC AATG AGTTCA ACTTGGAGGT CAGCATCCTT CATGAAGACC AAGTCTCAGA 2400 
TAAGGAGCAT GTCAATCATC TGATTGTGCA GCCTCCCTTC CTCCAGTCTG TCCATCATCC 2460 
TGAGTCCCGG AGTAGCATCC AGCACAGTTC AGTGGTCCCA AGCATTGCCA CAGTGGTCAT 2520 
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CATCATCTCC GTGTGCATGC TTGTGTTTGT CGTGGCCATG GGTGTGTACC GGGTCCGGAT 2580 
CGCCCACCAG CACTTCATCC AGGAG ACTG A GGCTGCCAAG GAA7CTG AGA TGGACTGGGA 2640 
CGATTCTGCG CTGACTATCA CAGTCAACCC CATGGAGAAA CATGAAGGAC CAGGGCATGG 2700 
GGAAGATGAG ACTGAGGGAG AAGAGGAGGA AGAAGCCGAG GAAGAAATGA GCTCCAGCAG 2760 
TGGCTCTGAC GACAGCGAAG AGGAGGAGG A GGAGGAAGGG ATGGGCAGAG GCAGACATGG 2820 
GCAGAATGGA GCCAGGCAAG CCCAGCTGGA GTGGGATGAC TCCACCCTCC CCTACIAGTG 2880 
CCCAGGGGTC TGCTGCCTGG CCCACATGTC CCTTTTGTAA ACCCTGACCC AGTGTATGCC 2940 
CATGTCTATC ATACCTCACC TCTGATGTCT GTGACATGTC TGGGAAGGCC TTCTCCAGCT 3000 
TCCTGGAGCC CACCCTTTAA GCCTTGGGCA CTCCCTGTGT TTCATCCATG GGGAAGTTCC 3060 
AAGAAGCCCA GCATGGCCAT CAGTGAGGAC TTCAGGGTAG ACTTTGTCC T GTAGCCTCC A 3120 
CTTCTGCCCT AAGTTCCCCA GCATCCTGAC TACCTGTCTG CAGAGTTTGC CTTTGTTTTT 3180 
TCCTGCAGGG AAGAAGGCCC ACCTTTGTGT CACTCACCTC CCCAGGCTCA GAGTCCCCAA 3240 
GGCCCTGGGG TTCCAACTCA CTGTGCGTCT CCTCCACACA GACCAGTAGG TTCTCCTATG 3300 
CTGACTCCAG GTTGCTTCAT ACAAGGAGGG TGGTTG AACT TCACACACGT AAGGTCTTAG 3360 
TGCTTAACAG TTTAAAGGAA AGTCCTTGTT GAGGCAGAAC TAAGTTTACA GGGAAAGGTA 3420 
CACACATTCT CTCTCTCTCT CTCTCTCTGT CTATCTAGTT CCCCAGCTTG GAGAGCCTTT 3480 
CCCCTTGCTT CTTTCTG AGG CCATATAAGC TTATAAGAAA AGTCCCAAAC CAAGAATAGG 3540 
TCCTTGGCCA CAAGCAGGGT CTGATCCCCC ATCAGAGCTA TCTG AGCCTG CCTGTCTGGG 3600 
CACCTGCTGC AACCATGCAG CTACCCTGCC AGGGGCACTC AGCAAACAGA ACCACAGGGC 3660 
CCAGG AGGCA TTCCACACAG GCACTGCCCC AGGACAACAC AACAAGGACA GTCACAACAA 3720 
GG ACAACAAG G ACACAACAC AACACACAAC AAGGACAGTC ACAACAAGCC TAGAGCCAG A 3780 
AAGCAGATGG AAATGCTAAT GAGGTCAAAC GTAGGCTTCA TGGTGGGTGG AGTGGGGGTG 3840 
GCTGGGCTCC CCCAGGACAG AGGGGACCCT GAGGTTGGCA AGGCTCTCAC CACTCAGCCT 3900 
TATGGTCCCT TATCTCCTAT CTTCCCTCTT G AGAAAATAC ACGCTTTCTG CATGTATTAG 3960 
AAACGCACGA GCTCC ACCAA GTCTACAATG AAAGTTTGAA ATTTAACTGC AAGGAATTAG 4020 
AAGCATATTT GCAATCATTG CAGCTTCTTC TTTCTTCTGC TCATAAAAGG AGGAACACTT 4080 
TAG ATAG AGG GCAAATATAT CTG AAAACCT AATTTCTTTC TTTTTTTGAT AAGGAAATCT 4 1 40 
TTTCCATCTC CATCCTAACA TGCACAACCT GTGAAGAG AA TTGTTTCTAT AGTAACTGGT 4200 
CTGTG ATCTT TTGTGGCCAA GAGAATAGCA GGCAAGAATT AGGGCCTTGA CAGAATTTCC 4260 
ACGAAGCTCT GAG AACATGT TTGTTTCGAA TGTCTGATTC CTCTTTGTCA TCAATGTGTA 4320 
TGCTCTGTCC CCATCCTTCA CTCCTCCTCA AGCTCACACC AATTGGTTTG GCACAGGCAC 4380 
AG AGCTGGTC CCTAGTTAAG TGGCATTTAT GTTAAA AAA A AATAGTTCAG AATCTCAGCC 4440 
TTTTCTTTGT GTCATCAAAA CAGCTTAAGA AGGGGACTAC TGCCAATGTC CTCTAGTCTG 4500 
ACCTCCACCC AGGGAGGACC CATGGCAGGT CTTTTCAACT TTCTG ATTCA TGAGAACAAC 4560 
CTTGTGAAGC TTTTCCCACC TCCTAAAGTG TTTTCTGCAT CTGTTCC TTC C TTTGGACCT 4620 
CACAACAAAT CCTGTGAAGT AACTG AGACA TCTGTTGTTA GATACATTTT TGTGATGAGT 4680 
AA ACTGAGGC TTCG 

Seq ID NO: 36 Protein sequence: 
Protein Accession #: NP_07H14.1 

1 II 21 31 41 51 v 

MLPGRLCWVP LLLALGVGSG SGGGGDSRQR RLLAAKVNKH KPWIETSYHG V1TENNDTVI 60 
LDPPLVALDK DAPVPFAGEI CAFKIHGQEL PFEAWLNKT SGEGRLRAKS P1DCELQKEY 120 
TFUQAYDCG AGPHETAWKK SHKAVVHIQ V KDVNEFAPTF KEPAYKAVVT EGKIYDSILQ 1 80 
VEAIDEDCSP QYSQICNYEI VTTDVPFA1D RNGNIRNTEK LS YDKQHQYE ILVTAYDCGQ 240 
KPAAQDTLVQ VDVKPVCKPG WQDWTKRIEY QPGSGSMPLF PSIHLETCDG AVSSLQIVTE 300 
LQTNYIGKGC DRETYSEKSL QKLCG ASSGI IDLLPSPSAA TNWTAGLLVD SSEMIFKFDG 360 
RQGAKIPDGI VPKNLTDQFT ITMWMKHGPS PG VRAEKETI LCNSDKTEMN RHHYALYVHN 420 
CRLVFLLRKD FDQADTFRPA EFHWKLDQIC DKEWHYYVIN VEFPWTLYM DGATYEPYLV 480 
TNDWPIHPSH IAMQLTVGAC WQGGEVTKPQ FAQFFHGSLA SLTIRPGKME SQKVISCLQA 540 
CKEGLDINSL ESLGQGKYH FNPSQSILVM EGDDIGNINR ALQKVSYINS RQFPTAGVRR 600 
LKVSSKVQCF GED VCISIPE VDAYVMVLQA IEPRITLRGT DHFWRPAAQF ESARG VTLFP 660 
DBOVSTFAK TEAPGDVKTT DPKSEVLEEM LHNLDFCDIL VIGGDLDPRQ ECLELNHSEL 720 
HQRHLDATNS TAGYSIYGVG SMSRYEQVLH HIRYRNWRPA SLEARRFRIK CSELNGRYTS 780 
NEFNLEVSIL HEDQVSDKEH VNHLIVQPPF LQSVHHPESR SSIQHSSWP SIATVVIIIS 840 
VCMLVFVVAM GVYRVRIAHQ HFIQETEAAK ESEMDWDDSA LTTTVNPMEK HEGPGHGEDE 900 
TEGEEEEEAE EEMSSSSGSD DSEEEEEEEG MGRGRHGQNG ARQAQLEWDD STLPY 

Seq ID NO: 37 DNA sequence 

Nucleic Acid Accession none found 

Coding sequence: 143-874 (underlined sequences correspond to start aud stop codous) 

1 11 21 31 41 51 

GGGAGGG AGA G AGGCGCGCG GGTG AAAGGC GCATTGATGC AGCCTGCGGC GGCCTCGGAG 60 
CGCGGCGGAG CCAGACGCTG ACCACGTTCC TCTCCTCGGT CTCCTCCGCC TCCAGCTCCG 120 
CGCTGCCCGG CAGCCGGGAG CCAJGCGACC CCAGGGCCCC GCCGCCTCCC CGCAGCGGCT 180 
CCGCGGCCTC CTGCTGCTCC TGCTGCTGCA GCTGCCCGCG CCGTCGAGCG CCTCTGAGAT 240 
CCCCAAGGGG AAGCAAAAGG CGCAGCTCCG GCAGAGGG AG GTGGTGGACC TGTATAATGG 300 
AATGTGCTTA CAAGGGCCAG CAGGAGTGCC TGGTCGAGAC GGGAGCCCTG GGGCCAATGG 360 
CATTCCGGGT ACACCTGGGA TCCCAGGTCG GGATGGATTC AA AG GAG AAA AGGGGGAATG 420 
TCTGAGGGAA AGCTTTGAGG AGTCCTGGAC ACCCAACTAC AAGCAGTGTT CATGGAGTTC 480 
ATTGAATTAT GGCATAGATC TTGGGAAAAT TGCGGAGTGT ACATTTACAA AGATGCGTTC 540 
AAATAGTGCT CTAAGAGTTT TGTTCAGTGG CTCACTTCGG CTAAAATGCA GAAATGCATG 600 
CTGTCAGCGT TGGTATTTCA CATTCAATGG AGCTG AATGT TCAGG ACCTC TTCCCATTGA 660 
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AGCTATAATT TATTTGGACC AAGGAAGCCC TGAAATGAAT TCAACAATTA ATATTCATCG 720 
CACTTCTTCT GTGGAAGGAC TTTGTGAAGG AATTGGTGCT GGATTAGTGG ATGTTGCTAT 780 
CTGGGTTGGC ACTTGTTCAG ATTACCCAAA AGGAG ATGCT TCTACTGGAT GGAATTCAGT 840 
TTCTCGCATC ATTATTGAAG AACTACCAAA ATAAATGCTT TAATTTTCAT TTGCTACCTC 900 
H mil ATT ATGCCTTGGA ATGGTTCACT TAAATGACAT TTTAAATAAG TTTATGTATA 960 
CATCTGAATG AAAAGCAAAG CTAAATATGT TTACAGACCA AAGTGTGA TT TCACA CTGTT 1020 
TTTAAATCTA GCATTATTCA TTTTGCTTCA ATCAAAAGTG GTTTCAATAT TTTTTTTAGT 1080 
TGGTTAGAAT ACTTTCTTCA TAGTCACATT CTCTCAACCT ATAATTTGGA ATATTGTTGT 1 140 
GGTCTTTTGT TTTTTCTCTT AGTATAGCAT TTTTAAAAAA ATATAAAAGC TACCAATCTT 1200 
TGTACAATTT GTAAATGTTA AGAA' l 11111 TTATATCTGT TAAATAAAAA TTATTTCCAA 1260 
CAACCTTAAA AAAAAAAAAA AAAA 

Seq ID NO: 38 Protein sequence: 
Protein Accession 0: none found 

1 11 21 31 41 SI 

MRPQGPAASP QRLRGUXLL LLQLPAPSSA SEIPKGKQKA QLRQREWDL YNGMCLQCPA 60 
GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLRESFEE SWTPNYKQCS WSSLNYGIDL 120 
GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNGAECSGPL PIEAIIYLDQ 1 80 
GSPEMNST1N IHRTSSVEGL CEGIGAGLVD VAIWVGTCSD YPKGDASTGW NSVSRIHEE 240 
LPK 

Seq ID NO: 39 DNA sequence 

Nucleic Acid Accession*: NM_000949 

Coding sequence: 285-2 1 53(underlined sequences correspond to start and stop codons) 

1 11 21 31 41 SI 

GGAGGCTGAA ATCCCCAGAC GCCGGTTTTC TGGGCTGGGC TTTCTGCTTA CTCACTCCTT 60 
CTCCCTCTTT CTGGATTTTA CCGACCGTTC GCGAAACAGC TTTCCACACA ATGGAGCTTC 120 
ATGTCCTCGT GCAGG AAGTA CTCATCG ACT GATGTGGCAG ACTTTGCTCC CTG ACAAAAC 1 80 
TAAAGAACTC TCCTATTCAT GGAGGCGAAC ACTGAGGATG CTTTCCACAT G AACCCTGAA 240 
GTGAACTTCT GATACATTTC CTGCAGCAAG AGAAGGCAGC CAACATGAAG GAAAATGTGG 300 
CATCTGCAAC CGTTTTCACT CTGCTACTTT TTCTCAACAC CTGCCTTCTG AATGGACAGT 360 
TACCTCCTGG AAA ACCTG AG ATCTTTAAAT GTCGTTCTCC CAATAAGGAA ACATTCACCT 420 
GCTGGTGGAG GCCTGGGACA GATGG AGGAC TTCCTACCAA TTATTCACTG ACTTACCACA 480 
GGGAAGGAGA GACACTCATG CATGAATGTC CAGACTACAT AACCGGTGGC CCCAACTCCT 540 
GCCACTTTGG CAAGCAGTAC ACCTCCATGT GGAGGACATA CATCATGATG GTCAATGCCA 600 
CTAACCAGAT GGGAAGCAGT TTCTCGGATG AACTTTATGT GGACGTGACT TACATAGTTC 660 
AGCCAGACCC TCCTTTGGAG CTGGCTGTGG AAGTAAAACA GCCAGAAGAC AGAAAACCCT 720 
ACCTGTGGAT TAAATGGTCT CCACCTACCC TGATTGACTT AAAAACTGGT TGG TTCA CGC 780 
TCCTGTATGA AATTCGATTA AAACCCG AG A AAGCAGCTGA GTGGGAGATC CATTTTGCTG 840 
GGCAGCAAAC AGAGTTTAAG ATTCTCAGCC TACATCCAGG ACAG AAATAC CTTGTCCAGG 900 
TTCGCTGCAA ACCAG ACCAT GGATACTGGA GTGCATGGAG TCCAGCGACC TTCATTCAGA 960 
TACCTAGTGA CTTCACCATG AATGATACAA CCGTGTGGAT CTCTGTGGCT GTCCTTTCTG 1020 
CTGTCATCTG TTTGATTATT GTCTGGGCAG TGGCTTTGAA GGGCTATAGC ATGGTGACCT 1080 
GCATCTTTCC GCCAGTTCCT GGGCCAAAAA TAAAAGGATT TGATGCTCAT CTGTTGGAGA 1 140 
AGGGCAAGTC TGAAGAACTA CTGAGTGCCT TGGGATGCCA AGACTTTCCT CCCACTTCTG 1200 
ACTATGAGGA CTTGCTGGTG GAGTATTTAG AAGTAGATGA TAGTGAGGAC CAGCATCTAA 1260 
TGTCAGTCCA TTCAAAAGAA CACCCAAGTC AAGGTATGAA ACCCACATAC CTGGATCCTG 1320 
ACACTGACTC AGGCCGGGGG AGCTGTGACA GCCCTTCCCT TTTGTCTGAA AAGTGTGAGG 1380 
AACCCCAGGC CAATCCCTCC ACATTCTATG ATCCTG AGGT CATTGAG AAG CCAGAGAATC 1440 
CTGAAACAAC CCACACCTGG GACCCCCAGT GCATAAGCAT GGAAGGCAAA ATCCCCTATT 1500 
TTCATGCTGG TGGATCCAAA TGTTCAACAT GGCCCTTACC ACAGCCCAGC CAGCACAACC 1560 
CCAGATCCTC TTACCACAAT ATTACTGATG TGTGTGAGCT GGCTGTGGGC CCTGCAGGTG 1620 
CACCGGCCAC TCTGTTGAAT G AAG C AGGT A AAGATGCTTT AAAATCCTCT CAAACCATTA 1680 
AGTCTAGAGA AGAGGGAAAG GCAACCCAGC AGAGGGAGGT AGAAAGCTTC CATTCTGAGA 1740 
CTG ACCAG G A TACGCCCTGG CTGCTGCCCC AGGAGAAAAC CCCCTTTGGC TCCGCTAAAC 1 800 
CCTTGGATTA TGTGGAG ATT CACAAGGTCA ACAAAG ATGG TGCATTATCA TTCCTACCAA 1 860 
AACAGAGAGA GAACAGCGGC AAGCCCAAGA AGCCCGGGAC TCCTGAGAAC AATAAGGAGT 1920 
ATGCCAAGGT GTCCGGGGTC ATGGATAACA ACATCCTGGT GTTGGTGCCA GATCCACATG 1980 
CTAAAAACGT GGCTTGCTTT GAAG AATCAG CCAAAGAGGC CCCACCATCA CTTGAACAGA 2040 
ATCAAGCTGA GAAAGCCCTG GCCAACTTCA CTGCAACATC AAGCAAGTGC AGGCTCCAGC 2100 
TGGGTGGTTT GGATTACCTG GATCCCGCAT GTTTTACACA CTCCTTTCAC TGATAGCTTG 2160 
ACTAATGGAA TGATTGGTTA AAATGTGATT TTTCTTCAGG TAACACTACA GAGTACGTGA 2220 
AATGCTCAAG AATGTAGTCA GACTG ACACT ACTAAAGCTC CCAGCTCCTT TCATGCTCCA 2280 
TTTTTAACCA CTTGCCTCTT TCTCCAGCAG CTGATTCCAG AACAAATCAT TATGTTTCCT 2340 
AACTGTGATT TGTAG ATTTA CTTTTTGCTG TTAGTTATAA AACTATGTGT TCAATGAAAT 2400 
AAAAGCACAC TGCTTAGTAT TCTTG AGGG A CAATGCCAAT AGGTATATCC TCTGGAAAAG 2460 
GCTTTCATGA TTTGGCATGG GACAGACGGA AATGAAATTG TCAAAATTGT TTACCATAGA 2520 
AAGATGACAA AAG AAAATTT TCCACATAGG AAAATGCCAT GAAAATTGCT TTTGAAAAAC 2580 
AACTGCATAA CCTTTACACT CCTCGTCCAT TTTATTAGG A TTACCCAAAT ATAACCATTT 2640 
AAAGAAAGAA TGCATTCCAG AACAAATTGT TTACATAAGT TCCTATACCT TACTG ACACA 2700 
TTGCTGATAT GCAAGTAAGA AAT 



Seq ID NO: 40 Protein sequence: 



364 



WO 02/059377 



Protein Accession #: NP 000940. 1 
1 11 21 31 41 51 

MKENVASATV FTLLLFLNTC LLNGQLPPGK PEIFKCRSPN KETFTCWWRP GTDGGLPTNY 60 
SLTYHREGET LMHECPDYIT GGPNSCHFGK QYTSMWRTY1 MMVNATNQMG SSFSDELYVD 120 
VTYIVQPDPP LELAVEVKQP EDRKPYLWIK WSPPTUDLK TGWFTLLYEI RLKPEKA AEW 1 80 
EIHFAGQQTE FKILSLHPGQ KYLVQVRCKP DHGYWSAWSP ATF1QIPSDF TMNDTTVW1S 240 
VAVLS AVICL UVWAVALKG YSMVTCIFPP VPGPKIKGFD AHLLEKGKSE EIXSALGCQD 300 
FPPTSDYEDL LVEYLEVDDS EDQHLMSVHS KEHPSQGMKP TYLDPDTDSG RGSCDSPSLL 360 
SERCEEPQAN PSTFYDPEVI EKPENPETTH TWDPQCISME GKIPYFHAGG SKCSTWPLPQ 420 
PSQHNPRSSY HNITDVCELA VGPAGAPATL LNEAGKDALK SSQTIKSREE GKATQQREVE 480 
SFHSETDQDT PWLLPQEKTP FGSAKPLDYV EIHKVNKDGA LSLLPKQREN SGKPKKPGTP 540 
ENNKEYAKVS GVMDNNILVL VPDPHAKNVA CFEESAXEAP PSLEQNQAEK AXANFTATSS 600 
KCRLQLGGLD YLDPACFTHS FH 

Seq ID NO: 41 DNA sequence 

Nucleic Acid Accession none found, Eos cloned sequence 

Coding sequence: 1 -1 572 (underlined sequences correspond to start and stop codons) 

I 11 21 31 41 51 

ATGACCCAAA ATAAATTAAA GCTTTGTTCC AAAGCCAATG TGTATACTG A AGTGCCTGAT 60 
GGAGGATGGG GCTGGGCGGT AGCTGTTTCA TTTTTCTTCG TTGAAGTCTT CACCTACGGC 120 
ATCATCAAGA CATTTGGTGT CTTCTTTAAT GACTTAATGG ACAGTTTTAA TGAATCCAAT 180 
AGCAGG ATCT CATGG ATAAT CTCAATCTGT GTGTTTGTCT TAACATTTTC AGCTCCCCTC 240 
GCCACAGTCC TGAGCAATCG TTTCGGACAC CGTCTGGTAG TGATGTTGGG GGGGCTACTT 300 
GTCAGCACCG GGATGGTGGC CGCCTCCTTC TCACAAGAGG TTTCTCATAT GTACGTCGCC 360 
ATCGGCATCA TCTCTGGTCT GGGATACTGC TTTAGTTTTC TCCCAACTGT AACCATCCTA 420 
TCACAATATT TTGGCAAAAG ACGTTCCATA GTCACTGCAG TTGCTTCCAC AGGAGAATGT 480 
TTCGCTGTGT TTGCTTTCGC ACCAGCAATC ATGGCTCTGA AGGAGCGCAT TGGCTGGAGA 540 
TACAGCCTCC TCTTCGTGGG CCTACTACAG TTAAACATTG TCATCTTCGG AGCACTGCTC 600 
AGACCCATCT TTATCAGAGG ACCAGCGTCA CCGAAAATAG TCATCCAGGA AAATCGGAAA 660 
GAAGCGCAGT ATATGCTTG A AAATGAGAAA ACACGAACCT CAATAGACTC CATTGACTCA 720 
GGAGTAG AAC TAACTACCTC ACCTAAAAAT GTGCCTACTC ACACTAACCT GGAACTGGAG 780 
CCG AAGGCCG ACATGCAGCA GGTCCTGGTG AAGACCAGCC CCAGGCCAAG CGAAAAGAAA 840 
GCCCCGCTAT TAGACTTCTC CATTTTGAAA GAGAA AAGTT TTATTTGTTA TGCATTATTT 900 
^^^.^^.q CAACACTGGG a -rrr TTTr,/- a crTTCCTTQT ACATCATTCC TCTGGGCATT 960 
AGTCTGGGCA TTGACCAGG A CCGCGCTGCT TTTTTATTAT CTACGATGGC CATTGCAGAA 1020 
GTTTTCGGAA GGATCGGAGC TGGTTTTGTC CTCAACAGGG AGCCCATTCG TAAGATTTAC 1080 
ATTG AGCTCA TCTGCGTCAT CTTATTGACT GTGTCTCTGT TTGCCTTTAC TTTTGCTACT 1 140 
GAATTCTGGG GTCTAATGTC ATGCAGCATA TTTTTTGGGT TTATGGTTGG AACAATAGGA 1200 
GGGACTCACA TTCCACTGCT TGCTGAGGAT GATGTCGTGG GCATTGAOAA GATGTCTTCT 1260 
GCAGCTGGGG TCTACATCTT CATTCAG AGC ATAGCAGGAC TGGCTGG ACC GCCCCTTGCA 1320 
GGTTTGTTGG TGGACCAAAG TAAGATCTAC AGCAGGGCCT TCTACTCCTG CGCAGCTGGC 1380 
ATGGCCCTGG CTGCTGTGTG CCTCGCCCTG GTGAGACCGT GTAAGATGGG ACTGTGCCAG 1440 
CATCATCACT CAGGTGAAAC AAAGGTAGTG AGCCATCGTG GGAAGACTTT ACAGGACATA 1500 
CCTGAAGACT TTCTGGAAAT GGATCTTGCA AAAAATGAGC ACAGAGTTCA CGTGCAAATG 1560 
GAGCCGGTATGA 

Seq ID NO: 42 Protein sequence: 

Protein Accession U: none found, Eos cloned sequence 

1 11 21 31 41 51 

MTONKLKLCS KANVYTEVPD GGWGWAVAVS FFFVEVFTYG I1KTFGVFFN DLMDSFNESN 60 
SRISWIISIC VFVLTFSAPL ATVLSNRFGH RLVVMLGGLL VSTGMVAASF SQEVSHMYVA 120 
IGIISGLGYC FSFLPTVTIL SQYFGKRRSI VTAVASTGEC FAVFAFAPAI MALKERIGWR 180 
YSLLFVGLLQ LNIVIFGALL RPIFIRGPAS PKIVIQENRK EAQYMLENEK TRTSIDSIDS 240 
GVELTTSPKN VPTHTNLELE PKADMQQVLV KTSPRPSEKK APLLDFSILK EKSFICYALF 300 
GLFATLGFFA PSLYIIPLGI SLGIDQDRAA FLLSTMAIAE VFGRIGAGFV LNREPIRKIY 360 
IELICVILLT VSLFAFTFAT EFWGLMSCSI FFGFMVGTIG GTFflPLLAED DWGIEKMSS 420 
AAG VYIF1QS IAGLAGPPLA GLLVDQSKIY SRAFYSCAAG MALAAVCLAL VRPCKMGLCQ 480 
HHHSGETKVV SHRGKTLQDI PEDFLEMDLA KNEHRVHVQM EPV 

Seq ID NO: 43 DNA sequence 

Nucleic Acid Accession »: FGENESH predicted ORF 

Coding sequence: M 749 (underlined sequences correspond to start and stop codons) 

1 U 21 31 41 51 

ATGCTGTCTG GCTTCTTGATGAGTCCCAGT ACCCAGCACA G AGCAC AGTA CACTCCCGG A 60 
GGAAAGAAAC TTCCGTGGG A GGCTTCCATC GGTGCGCACA CCTCCCG AGG GCG AGGCAGC 120 
GACCGGGAGA GGGAGAGCCG GCCGGAGGCTGCCGGGCTCC TGTGGGACCG CGCTGCAGCC 180 
GGGGAGGCGG AGAAGGGGAA CCGGGGCGAG CCGCCCGCCT GGATCCGCGC CCAGCAGCAG 240 
CCGCGGCCGC CGCCAGCTGG GCAGGCTCCC GGG ACTGCGG CTGGGGGCGC GCAGG ACCCT 300 
CGCCTGCGTC CTGGACGTTC CCGGGGGAGG GTCCGGTTGC CAGTGAAACC TCCAGAGGCT 360 

365 



WO 02/059377 



TCCGGACGAC AGCCCCGGGG GCCTTCTGAC TGCATCCCGA GATTTCCATC AGCGAGTGCA 420 
ACTCATAAGG CAGTCCCTAA GGGGACCGGG CCACCGGCTG AGGACGGGGA TGGCTTAGGA 480 
GCTCCTGGAC CTAGGGCCCG GCGTCGTCGC CTCCTGGGCG TCGCGGCAGA GGGGAGTGGC 540 
CCGCGCGGAA AGCGCCGCGG GACAGTCAGT GACGAGGCCC GGGGGTCGCC GGGGCCACGA 600 
CTTCTCGGAG ACCGTCCTGC GCTCTCTGGA GACGCGCTGT CCGCGCCCAG GGTGGTGCCA 660 
TGTGGGGCGC TCGCCGCTCG TCCGTCTCCT CATCCTGGAA CGCCGCTTCG CTCCTGCAGC 720 
TGCTGCTGGC TGCGCTGCTG GCGGCGGGGG CGAGGGCCCA GCGGCGAGTA CTGCCACGGC 780 
TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 840 
GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 900 
GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 960 
CGGGCGGACA AAGACGGGCC CCGACGGCTC GGCAGGGCTT CATGTCTTAG GGGTACCCAA 1020 
GGAGACGGCG AGGGTGCGCC CCCACCCGTG AGGGCCTGGC AGCGGTGCTC CCCTGAAGGC 1080 
TCCCCGAAAG GAAGGCAGCT CCTCAGGGCT TTCCCGGGGC TGCTGCCCCG TGCCAGACGC 1 140 
CGCGGATTCC CATCTTCTCC ACGCGGCGGC CXCTCTCCCC TGCAGCGGCC CGCCTTGCCC 1200 
ATCTACGTGC CGTTCCTCAT TGTTGGCTCC GTGTTTGTCG CCTTTATCAT CTTGGGGTCC 1260 
CTGGTGGCAG CCTGTTGCTG CAGATGTCTC CGGCCTAAGC AGGATCCCCA GCAGAGCCGA 1320 
GCCCCAGGGG GTAACCGCTT GATGGAGACC ATCCCCATGA TCCCCAGTGC CAGCACCTCC 1380 
CGGGGGTCGT CCTCACGCCA GTCCAGCACA GCTGCCAGTT CCAGCTCCAG CGCCAACTCC 1440 
GGGGCCCGGG CGCCCCCAAC AAGGTCACAG ACCAACTGTT GCTTGCCGGA AGGGACCATG 1500 
AACAACGTGT ATGTCAACAT GCCCACGAAT TTCTCTGTGC TGAACTGTCA GCAGGCCACC 1560 
CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCCCAT ACGTGGGGTA CACGGTGCAG 1620 
CACGACTCTG TGCCCATGAC AGCTGTGCCA CCTTTCATGG ACGGCCTGCA GCCTGGCTAC 1680 
AGGCAGATTC AGTCCCCCTT CCCTCACACG AACAGTGAAC AGAAGATGTA CCCAGCGGTG 1 740 
ACTGTATAAC CGAGAGTCAC TGGTGGGTTC CTTTACTGAA GGGAGACGAA GGCAGGGGTG 1800 
GATTCTCGAG GTGGAAGTCC GCACATGTCG GTGGTATTTA TGGCACGATT CCTTTGGATG 1860 
GCTTCATTTG CCCCCAGACT GTATGAAAAC ATCTCCGAAT TAGCATTTCT GGATATGTTT 1920 
CATCCAGGGT ATCATTGATT TATGATGGAA AACCGGCCTC AGCTGGAGAT GACTGTGATG 1980 
TTGCTGATGG GTGTATAACA AATGCTTG AG TCCGAAGTGC CCTTGAGATA TGGTTGACGA 2040 
AAGAATTTTA TAAACTGATA AATTAAGGAT TTTTATTATG TTGTTATTAT TATTTCTTTT 2100 
TTGTTGTTGA CTGCACA GGA TCAAAATGCC TGTTATCTCC CTTTTACTGO OA C1 H 1 U 1 2160 
TTTTTTTTTT TTTTTTTTAA TCAGACAGGG TCTTGCTCTG TTGCCCAGGC TGGAGTGCAG 2220 
TGGTGCGATC TCGGCTCACT GCAACTTCAG CCTCCTGGAT TCAGGCAACA CTCCTGCCTC 2280 
AGCCTCCCAC GTGGCTGGGA TTACAGGTGC CTGCCCCCAT GGCTAATTTT TTGTATTTTT 2340 
TGTAGAGATG GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTGA CCTCAAGCAA 2400 
TCTGCCTGTC TCAGCCTCCC AAAGTGCTGG GATTACAGGC GTGAGCCACC GCCCCCAGCC 2460 
TGAGCCTTTT 1111 J 1 1CTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 
ATTCTAAAAG GAAACCTGTT TGAACTCTGT GAGATCAGTC ATCAGTCTCA GTATTCCACA 2580 
GGCACACCTT AATTTCATTG TAAAAAGATA TATATATTTT GTCTATTTTT GTGCTTTTGG 2640 
GGGCCTATTT TGTGCTTTTT TACCTTATGT AGAG ATCTTA TTACAAAGTG ATTTTCTACA 2700 
TTAAAAAGAG ACTGAAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 
TGGGATGATT TTAATCTTGA AGTAGTAGGT GGTATAGTCA TAAAACCATT CATCCCCTTC 2820 
TTGATTGTAT CTTAATTTTC TGGCTTTAAG GTGACATCTG AGAGGTAATG CATTCTTTTT 2880 . 
TATATTGAAA TCATAAACTA TCACCCGCTG CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 2940 
TGGTTATGGT TTGGCGTTTC CTTCTGTTTG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 
GAGTGCAAGT AATTACTATA CTTGTAAATG AAG ATCAGTA TTTCTGCCTA GATCTGATA A 3060 
AAAAATTTTC TTGTCTTAGT TATAAAAATT CAAAGAAATG TGTTACAAAG ATACTTAGTA 3120 
TAGCTCCTCA GCCATAACCT GAGACTTGGG ATGAAATTTA AACCAGATAC GATTTACTTT 3180 
GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 
GATTGTTAAG AGAAAAGCTT TTCAACGAAG GATTGCCTTT CTTCTCCCAC ACTGTTCTTG 3300 
ATTTCCTCTC TCTTTCAGGC CTCAACAGGC ACTGTATTCA TTGCCAATGT TCCAAATTAT 3360 
CAAATTCAAG TGAATTTATT TGTGTGTTCT TTACTTATAT AAAAAAAGAT AACTTTAAGG 3420 
ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAATCGCT 3480 
GTATGGTATG GTCTTCTACA CATTTATGTC TATAGATATC TATCG ATCAT CTTTCTATTC 3540 
TGTTTC ATG A CTGAATAATG TAAAACCAGT GTTGGCAATT GGTATCATCA ATGATACTCA 3600 
TTTTTTAATA ACCAAAGGCA GGGGAAAATC ATTTTACTTA TTAATAAATA TTTTATGATG 3660 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 44 Protein sequence: 

Protein Accession 0: FGENESH predicted 

1 11 21 31 41 51 
i I I I I I 

MLSGFLMSPS TQHRAQYTPG GKKLPWEASl GAHTSRGRGS DRERESRPEA AGLLWDRAAA 60 
GEAEKGNRG E PPAW1RAQQQ PRPPPAGQAP GTAAGG AQDP RLRPG RSRGR VRLPVKPPEA 120 
SGRQPRGPSD CIPRFPSASA THKAVPKGTG PPAEDGDGLG APGPRARRRR LLGVAAEGSG 1 80 
PRGKRRGTVS DEARGSPGPR LLGDRPALSG DALSAPRVVP CGALAARPSP HPGTPLRSCS 240 
CCWLRCWRRG RGPSGEYCHG WLDAQGVWRI GFQCPERFDG GDATICCGSC ALRYCCSSAE 300 
ARLDQGGCDN DRQQGAGEPG RADKDGPRRL GRASCLRGTQ GDGEGAPPPV RAWQRCSPEG 360 
SPKGRQLLRA FPGLLPRARR RGFPSSPRGG PSPLQRPALP IYVPFL1VGS VFVAFIILGS 420 
LVAACCCRCL RPKQDPQQSR APGGNRLMET IPM1PSASTS RGSSSRQSST AASSSSSANS 480 
GARAPPTRSQ TNCCLPEGTM NNVWNMPTN FSVLNCQQAT QIVPHQGQYL HPPYVG YTVQ 540 
HDSVPMTAVP PFMDGLQPGY RQIQSPFPHT NSEQKM YPAV TV 

Seq ID NO: 45 DNA sequence 

Nucleic Acid Accession #: NM_002285 

Coding sequence: 55-3738 (underlined sequences correspond to start and stop codons) 
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1 11 21 31 41 51 
1 I I I I I 

GGCCGAGCCT CGGCGGCGGC GGTAGCGGCG GCGGCGACGC TGACACCTCC CACCATGGAC 60 
AGCTTCGACT TAGCCCTGCT CCAGGAATGG GACCTCGAGT CACTGTGTGT CTATGAACCA 120 
5 GATAGAAATG CATTACGGAG G AAAGAACGA GAAAG AAGAA ATCAAGAAAC TCAACAGGAT 1 80 
GATGGCACGT TTAATTCTAG TTACTCTCTC TTCAGTG AGC CCTACAAGAC TAACAAGGGG 240 
GATGAACTCT CCAACCGGAT CCAGAACACT TTAGGCAATT ATGATGAAAT GAAAGACTTT 300 
TTAACTGATA GAACCAATCA GAGTCATCTC GTTGGAGTTC CCAAACCTGG GGTTCCTCAG 360 
ACTCCTGTGA ACAAGATCGA TGAACATTTT GTTGCAGATT CAAGAGCCCA GAACCAGCCC 420 

1 0 TCGTCTATCT GTAGCACTAC AACTTCCACA CCAGCAGCTG TCCCCGTGCA GCAGAGTAAA 480 
AGAGGCACTA TGGGCTGGCA GAAGGCTGGG CACCCACCCT CTGACGGCCA ACAGAGAGCA 540 
ACACAACAGG GCTCTCTCAG G ACCTTGCTT GGAGATGGTG TTGGCAGACA GCAGCCTCGG 600 
GCCAAACAAG TGTGCAATGT GG AGGTGGGC CTTCAGACCC AGGAGAGGCC ACCTGCCATG 660 
GCGGCCAAGC ACAGCAGCAG CGGACACTGT GTTCAGAACT TTCCTCCATC CCTAGCTTCA 720 

15 AAACCCAGCC TGGTCCAGCA GAAACCG ACC GCGTATGTGA GGCCAATGGA CGGCCAAGAT 780 
CAGGCCCCTG ATGAGTCTCC TAAGCTGAAG TCGTCTTCGG AAACCAGCGT GCACTGCACA 840 
TCATACAGGG GAGTCCCTGC CAGCAAGCCG GAGCCTGCCA GAGCCAAGGC CAAGCTCTCC 900 
AAGTTCAGCA TCCCCAAGCA GGGGGAGGAG AGTAGATCTG GAGAAACCAA CAGCTGTGTT 960 
GAAGAAATAA TCCGGGAG AT G ACCTGGCTT CCACCACTTT CTGCTATTCA AGCACCTGGC 1020 

20 AAAGTGGAAC CAACCAAATT TCCATTTCCA AATAAGGACT CTCAGCTTGT ATCCTCTGGA 1080 
CACAATAATC CAAAGAAAGG TGATGCAGAG CCAGAGAGTC CAGACAATGG CACATCGAAT 1140 
ACATCAATGC TGGAAGATGA CCTTAAGCTA AGCAGTGATG AAGAGGAGAA TGAACAGCAG 1200 
GCAGCTCAGA GAACGGCTCT CCGCGCTCTC TCTGACAGCG CCGTGGTCCA GCAGCCCAAC 1260 

. TGCAGAACCT CGGTGCCTTC CAGCAAGGGC AGCAGCAGCA GCAGCAGCAG CGGCACGAGC 1320 

25 AGCTCCTCCA GCGACTCAGA GAGCAGCTCC GGATCTGACT CGGAG ACCGA GAGCAGCTCC 1 380 
AGCGAG AGTG AGGGCAGCAA GCCCCCCCAC TTCTCCAGCC CCGAGGCTGA ACCGGCATCC 1440 
TCTAACAAGT GGCAGCTGGA TAAATGGCTA AACAA AGTTA ATCCCCACAA GCCTCCTATT 1 500 
CTG ATCC AAA. ATG AAAGCCA CGGGTCAGAG AGCAATCAGT ACTACAACCC GGTGAAAGAG 1560 

„ GACGTCCAGG ACTGTGGGAA AGTCCCCGAC GTTTGCCAGC CCAGCCTGAG AGAGAAGGAG 1620 

3 0 ATCAAG AGCA CTTGCAAGG A GG AGCAAAGG CCAAGGACAG CCAACAAGGC CCCTGGGAGT 1680 
AAAGGCGTGA AGCAGAAGTC CCCGCCCGCG GCCGTGGCCG TGGCGGTGAG CGCAGCCGCC 1740 
CCGCCACCCG CAGTGCCCTG TGCGCCCGCG GAGAACGCGC CCGCGCCTGC CCGGAGGTCC 1800 
GCGCGCAAGA AGCCCACCAG GCGCACCGAG AGGACCTCAG CCGGGGACGG CGCCAACTGC 1860 
CACCGGCCCG AGG AGCCCGC GGCCGCGG AC GCGCTGGGG A CG AGCGTGGT GGTCCCCCCG 1 920 

3 5 GAGCCCACCA AAACCAGGCC CTGTGGCAAC AACAGAGCGA GCCACCGCAA GGAGCTGCGC 1980 
TCCTCCGTGA CCTGCGAGAA GCGCCGCACG CGGGGGCTAA GCAGGATCGT CCCCAAATCC 2040 
AAGGAGTTCA TTGAGACAGA GTCGTCATCT TCATCCTCCT CCTCGGACTC CGACCTGGAG 2100 
TCCG AGCAG G AGG AG TACCC TCTGTCCAAA GCACAGACCG TGGCTGCCTC TGCCTCCTCC 2160 
GGGAATGATC AGAGGCTGAA GGAGGCCGCT GCCAACGGGG GCAGTGGTCC TAGGGCCCCT 2220 

40 GTAGGCTCCA TCAACGCCAG G ACCACCAGT GACATCGCCA AGGAGCTGGA GGAGCAGTTC 2280 
TACACACTGG TCCCCTTTGG CCGG AACGAA CTTCTCTCCC CTCTAAAGG A CAGTGATGAG 2340 
ATCAGGTCTC TCTGGGTCAA AATCGACCTG ACCCTCCTGT CCAGGATCCC AGAACACCTG 2400 
CCCCAGGAGC CAGGGGTATT G AGCGCCCCT GCCACCAAGG ACTCTGAGAG CGCACCGCCC 2460 
AGCCACACCT CGGACACACC TGCAGAAAAG GCTTTGCCAA AATCCAAGAG GAAACGCAAG 2520 

45 TGTGACAACG AAGACGACTA CAGGGAGATC AAGAAGTCCC AGGGAGAGAA AGACAGCTCT 2580 
TCAAGACTGG CCACCTCCAC CAGTAATACT TTGTCTGCAA ACCACTGCAA CATGAACATC 2640 
AACAGTGTGG CAATACCAAT AAATAAAAAT GAAAAAATGC TTCGGTCGCC CATCTCACCC 2700 
CTCTCTGATG CATCTAAACA CAAATACACC AGCGAGGACT TAACTTCTTC CAGCCGACCT 2760 
AATGGCAACA GTTTGTTTAC TTCAGCCTCT TCCAGCAAAA AGCCTAAGGC CG ACAGCCAG 2820 

5 0 CTGCAGCCTC ACGGCGG AGA CCTCACGAAA GCAGCTCACA ACAATTCTG A AAACATTCCC 2880 
CTCCACAAGT CACGGCCGCA GACGAAGCCG TGGTCTCCAG GCTCCAACGG CCACAGGGAC 2940 
TGCAAGAGGC AGAAACTTGT CTTCGATGAT ATGCCTCGCA GTGCCGATTA TTTTATGCAA 3000 
GAAGCTAAAC GAATGAAGCA TAAAGCAGAT GCAATGGTGG AAAAGTTTGG AAAGGCTTTG 3060 
AACTATGCTG AAGCAGCATT GTCGTTTATC GAGTGTGGAA ATGCAATGGA ACAAGGCCCC 3 120 

55 ATGGAATCCA AATCTCCTTA TTACCTGATG TATTCAGAAA CAGTAGAGCT CATCAGGTAT 31 80 
GCTATGAG AC TAAAAACCCA CTCAGGCCCC AATGCCACAC CAGAAGACAA ACAACTGGCT 3240 
GCATTATGTT ACCG ATGCCT GGCCCTCCTG TACTGGCGGA TGTTTCGACT CAAAAGGGAC 3300 
CACGCTGTAA AGTATTCAAA AGCACTAATC GACTATTTCA AGAACTCATC TAAAGCCGCC 3360 

r . CAAGCCCCAT CTCCGTGGGG GGCCAGTGGA AAGAGCACTG GAACCCCATC CCCCATTTCT 3420 

60 CCCAACCCCT TTCCCGGCAG CTCCGTGGGG TCTCAGGGCA GCCTCTCCAA CGCCAGCGCC 3480 
CTGTCCCCGT CGACCATCGT CAGCATCCCA CAGCGCATCC ACCAGATGGC GGCCAACCAC 3540 
GTCAGCATCA CCAACAGCAT CCTGCACAGC TACGACTACT GGGAGATGGC CGACAACCTG 3600 
GCCAAGG AAA ACCG AGAATT CTTCAACGAC CTGG ATCTGC TCATGGGGCC GGTCACCCTG 3660 
CACAGCAGCA TGGAGCACCT GGTCCAGTAC TCCCAACAGG GCCTGCACTG GCTGCGGAAC 3720 

65 AGCGCCCACC TGTCATAGGG ACCTCACCCT GGGGCCAGAG TGGGCTCTGG TCTCCACAGA 3780 
TGGCTCAACG TTTTTGGACA CTGTGCTACT GAAACTCCCA GCCACAGCAT TTATAGACTG 3840 
CGGTGAACAT TTCCTCA 

SeqIDNO: 46 Protein sequence: 
70 Protein Accession #: NPJM2276 

1 11 21 31 41 51 
I I I I I I 

MDSFDLALLQ EWDLESLCVY EPDRNALRRK ERERRNQETQ QDDGTFNSSY SLFSEPYKTN 60 
7 5 KGDELSNR1Q NTLGNYDEMK DFLTDRTNQS HLVGVPKPG V PQTP VNKIDE HF VADSRAQN 120 
QPSSICSTTT STPAAVPVQQ SKRGTMG WQK AGHPPSDGQQ RATQQGSLRT LLGDG VGRQQ 1 80 
PRAKQVCNVE VGLQTQERPP AMAAKHSSSG HCVQNFPPSL ASKPSLVQQK PTAYVRPMDG 240 
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QDQAPDESPK LKSSSETSVH CTSYRGVPAS KPEPARAKAK LSKFSIPKQG EESRSGETNS 300 
CVEEIIREMT WLPPLSAIQA PGKVEPTKFP FPNKDSQLVS SGHNNPKKGD AEPESPDNGT 360 
SNTSMLEDDL KLSSDEEENE QQAAQRTALR ALSDSAWQQ PNCRTSVPSS KGSSSSSSSG 420 
TSSSSSDSES SSGSDSETES SSSESEGSKP PHFSSPEAEP ASSNKWQLDK. WLNKVNPHKP 480 
PIUQNESHG SESNQYYNPV KEDVQDCGKV PDVCQPSLRE KEDCSTCKEE QRPRTANKAP 540 
GSKG VKQKSP PAAVAVAVSA AAPPPAVPCA PAENAPAPAR RSAGKKPTRR TERTSAGDGA 600 
NCHRPEEPAA ADALGTSVW PPEPTKTRPC GNNRASHRKE LRSSVTCEKR RTRGLSRIVP 660 
KSKEFIETES SSSSSSSDSD LESEQEEYPL SKAQTVAASA SSGNDQRLKE AAANGGSGPR 720 
APVGSINART TSDIAKELEE QFYTLVPFGR NELLSPLKDS DEIRSLWVKl DLTLLSRIPE 780 
HLPQEPGVLS APATKDSESA PPSHTSDTPA EKALPKSKRK RKCDNEDDYR EJKKSQGEKD 840 
SSSRLATSTS NTLSANHCNM NINSVAIPIN KNEKMLRSPI SPLSDASKHK YTSEDLTSSS 900 
RPNGNSLFTS ASSSKKPKAD SQLQPHGGDL TKAAHNNSEN IPLHKSRPQT KPWSPGSNGH 960 
RDCKRQKXVF DDMPRSADYF MQEAKRMKHK ADAMVEKFGK ALNYAEAALS FIECGNAMEQ 1020 
GPMESKSPYY LMYSETYELI RYAMRLKTHS GPNATPEDKQ LAALCYRCLA LLYWRMFRLK 1080 
RDHAVKYSKA LIDYFKNSSK AAQAPSPWGA SGKSTGTPSP ISPNPFPGSS VGSQGSLSNA 1140 
SALSPST1VS IPQRIHQMAA NHVSITNSIL HSYDYWEMAD NLAKENREFF NDLDLLMGPV 1200 
TLHSSMEHLV QYSQQGLHWL RNSAHLS 

Seq ID NO: 47 DNA sequence 

Nucleic Acid Accession* NMM3151 

Coding sequence: 35 1 -4499(underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

ACTGGGATAA AGCAAGAAGA CTGATTTTAT GAGCAGGGGT TTGATACATC AAAGGAGATT 60 
GCCCAGGATC AAGGGTGCGG TGTTGGGGGT GGGTTGGGGA GGGTGGTTAG AGAAGGTTTC 1 20 
ACTAAGTGAT TTGGGCCTGA GGCCTGAG AA GATGTTTAAA AAG AGGGATC AAGCACAGGC 1 80 
TAAGGAGAGG AAAGAGCAGG CACCCAAACC TCTGCATGGC CCCAATATGC TCCCTGCAGG 240 
GTAGTGCCCC CTCTTCTGGC TGCTCAAGGC GAGATCTAAG CTTCTTCTAA CTCCTGCTGT 300 
CTTTTCATAT TCTCTGATTC TGGGAAACGA AGAATTGGCA GGAACTGAAA ATGACTAGGA 360 
AGAGGACATA CTGGGTGCCC AACTCTTCTG GTGGCCTCGT G AATCGTGGC ATCGACATAG 420 
GCGATGACAT GGTTTCAGGA CTTATTTATA AAACCTATAC TCTCCAAGAT GGCCCCTGGA 480 
GTCAGCAAGA GAGAAATCCT G AGGCTCCAG GGAGGGCAGC TGTCCCACCG TGGGGGAAGT 540 
ATGATGCTGC CTTGAGAACC ATGATTCCCT TCCGTCCCAA GCCGAGGTTT CCTGCCCCCC 600 
AGCCCCTGGA CAATGCTGGC CTGTTCTCCT ACCTCACCGT GTCATGGCTC ACCCCGCTCA 660 
TGATCCAAAG CTTACGGAGT CGCTTAGATG AGAACACCAT CCCTCCACTG TCAGTCCATG 720 
ATGCCTCAG A CAAAAATGTC CAAAGGCTTC ACCGCCTTTG GGAAG AAG AA GTCTCAAGGC 780 
GAGGGATTGA AAAAGCTTCA GTGCTTCTGG TGATGCTGAG GTTCCAGAGA ACAAGGTTGA 840 
TTTTCGATGC ACTTCTGGGC ATCTGCTTCT GCATTGCCAG TGTACTCGGG CCAATATTGA 900 
TTATACCAAA GATCCTGGAA TATTCAGAAG AGCAOTTGGG GAATGTTGTC CATGGAGTGG 960 
GACTCTGCTT TGCCCTTTTT'CTCTCCGAAT GTGTGAAGTC TCTGAGTTTQTCCTCCAGTT 1020 
GGATCATCAA CCAACGCACA GCCATCAGGT TCCGAGCAGC TGTTTCCTCC TTTGCCTTTG 1080 
AGAAGCTCAT CCAATTTAAG TCTGTAATAC ACATCACCTC AGGAGAGGCC ATCAGCTTCT 1 140 
TCACCGGTGA TGTAAACTAC CTGTTTGAAG GGGTGTGCTA TGGACCCCTA GTACTGATCA 1200 
CCTGCGCATC GCTGGTCATC TGCAGCATTT CTTCCTACTT CATTATTGGA TACACTGCAT 1260 
TTATTGCCAT CTTATGCTAT CTCCTGGTTT TCCCACTGGC GGTATTCATG ACAAGAATGG 1320 
CTGTGAAGGC TCAGCATCAC ACATCTGAGG TCAGCGACCA GCGCATCCGT GTGACCAGTG 1380 
AAGTTCTCAC TTGCATTAAG CTGATTAAAA TGTACACATG GG AGAAACCA TTTGCAAAAA 1440 
TCATTGAAGA CCTAAGAAGG AAGGAAAGGA AACTATTGGA GAAGTGCGGG CTTGTCCAGA 1500 
GCCTGACAAG TATAACCTTG TTCATCATCC CCACAGTGGC CACAGCGGTC TGGGTTCTCA 1 560 
TCCACACATC CTTAAAGCTO AAACTCACAG CGTCAATGGC CTTCAGCATG CTGGCCTCCT 1620 
TGAATCTCCT TCGGCTGTCA GTGTTCTTTG TGCCTATTGC AGTCAAAGGT CT CACG AATT 1680 _ 
CCAAGTCTGC AGTGATGAGG TTCAAG AAGT TTTTCCTCCA GGAGAGCCCT GTTTTCTATG 1740 
TCCAGACATT ACAAGACCCC AGCAAAGCTC TGGTCTTTG A GG AGGCCACC TTGTCATGGC 1800 
AACAGACCTG TCCCGGGATC GTCAATGGGG CACTGGAGCT GGAGAGGAAC GGGCATGCTT 1860 
CTGAGGGGAT GACCAGGCCT AGAGATGCCC TCGGGCCAGA GGAAGAAGGG AACAGCCTGG 1920 
GCCCAGAGTT GCACAAGATC AACCTGGTGG TGTCCAAGGG GATGATGTTA GGGGTCTGCG 1980 
GCAACACGGG GAGTGGTAAG AGCAGCCTGT TGTCAGCCAT CCTGGAGGAG ATGCACTTGC 2040 
TCGAGGGCTC GGTGGGGGTG CAGGGAAGCC TGGCCTATGT CCCCCAGCAG GCCTGGATCG 2100 
TCAGCGGGAA CATCAGGGAG AACATCCTCA TGGGAGGCGC ATATGACAAG GCCCGATACC 2160 
TCCAGGTGCT CCACTGCTGC TCCCTGAATC GGGACCTGG A ACTTCTGCCC TTTGG AG ACA 2220 
TGACAGAGAT TGGAGAGCGG GGCCTCAACC TCTCTGGGGG GCAGAAACAG AGGATCAGCC 2280 
TGGCCCGCGC CGTCTATTCC GACCGTCAGA TCTACCTGCT GGACG ACCCC CTGTCTGCTG 2340 
TGGACGCCCA CGTGGGGAAG CACATTTTTG AGGAGTGCAT T AAGAA GACA CTCAGGGGGA 2400 
AGACGGTCGT CCTGGTGACC CACCAGCTGC AGTACTTAGA ATTTTGTGGC CAGATCATTT 2460 
TGTTGGAAAA TGGGAAAATC TGTGAAAATG GAACTCACAG TGAGTTAATG CAGAAAAAGG 2520 
GGAAATATGC CCAACTTATC CAGAAGATGC ACAAGGAAGC CACTTCGGAC ATGTTGCAGG 2580 
ACACAGCAAA GATAGCAGAG AAGCCAAAGG TAGAAAGTCA GGCTCTGGCC ACCTCCCTGG 2640 
AAGAGTCTCT CAACGGAAAT GCTGTGCCGG AGCATCAGCT CACACAGGAG GAGGAGATGG 2700 
AAGAAGGCTC CTTGAGTTGG AGGGTCTACC ACCACTACAT CCAGGCAGCT GGAGGTTACA 2760 
TGGTCTCTTG CATAATTTTC TTCTTCGTGG TGCTGATCGT CTTCTTAACO ATCTTCAGCT 2820 
TCTGGTGGCT GAGCTACTGG TTGGAGCAGG GCTCGGGGAC CAATAGCAGC CGAGAGAGCA 2880 
ATGGAACCAT GGCAGACCTG GGCAACATTG CAGACAATCC TCAACTGTCC TTCTACCAGC 2940 
TGGTGTACGG GCTCAACGCC CTGCTCCTCA TCTGTGTGGG GGTCTGCTCC TCAGGGATTT 3000 
TCACCAAAGT CACGAGGAAG GCATCCACGG CCCTGCACAA CAAGCTCTTC AACAAGGTTT 3060 
TCCGCTGCCC CATGAGTTTC TTTGACACCA TCCCAATAGG CCGGCTTTTG AACTGCTTCG 3120 
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CAGGGGACTT GGAACAGCTG GACCAGCTCT TGCCCATCTT TTCAGAGCAG TTCCTGGTCC 3180 
TGTCCTTAAT GGTGATCGCC GTCCTGTTG A TTGTCAGTGT GCTGTCTCCA TATATCCTGT 3240 
TAATGGGAGC CATAATCATG GTTATTTGCT TCATTTATTA TATGATGTTC AAGAAGGCCA 3300 
TCGGTGTGTT CAAGAGACTG G AGAACTATA GCCGGTCTCC TTTATTCTCC CACATCCTCA 3360 
ATTCTCTGCA AGGCCTGAGC TCCATCCATG TCTATGGAAA AACTGAAGAC TTCATCAGCC 3420 
AGTTTAAGAG GCTGACTG AT GCGCAGAATA ACTACCTGCT GTTGTTTCTA TCTTCCACAC 3480 
GATGGATGGC ATTGAGGCTG GAGATCATGA CCAACCTTGT GACCTTGGCT GTTGCCCTGT 3540 
TCGTGGCTTT TGGCATTTCC TCCACCCCCT ACTCCTTTAA AGTCATGGCT GTCAACATCG 3600 
TGCTGCAGCT GGCGTCCAGC TTCCAGGCCA CTGCCCGGAT TGGCTTGGAG ACAGAGGCAC 3660 
AGTTCACGGC TGTAG AGAGG ATACTGCAGT ACATGAAGAT GTGTGTCTCG GAAGCTCCTT 3720 
TACACATGGA AGGCACAAGT TGTCOCCAGG GGTGGCCACA GCATGGGGAA ATCATATTTC 3780 
AGGATTATCA CATGAAATAC AG AGACAACA CACCCACCGT GCTTCACGGC ATCAACCTGA 3840 
CCATCCGCGG GCACGAAGTG GTGGGCATCG TGGGAAGGAC GGGCTCTGGG AAGTCCTCCT 3900 
TGGGCATGGC TCTCTTCCGC CTGGTGGAGC CCATGGCAGG CCGGATTCTC ATTGACGGCG 3960 
TGGACATTTG CAGCATCGGC CTGGAGGACT TGCGGTCCAA GCTCTCAGTG ATCCCTCAAG 4020 
AtCCAGTGCT GCTCTCAGGA ACCATCAG AT TCAACCTAGA TCCCTTTGAC CGTCACACTG 4080 
ACCAGCAG AT CTGGG ATGCC TTGG AGAGGA CATTCCTGAC CAAGGCCATC TCAAAGTTCC 41 40 
CCAAAAAGCT GCATACAGAT GTGGTGGAAA ACGGTGGAAA CTTCTCTGTG GGGGAGAGGC 4200 
AGCTGCTCTG CATTGCCAGG GCTGTGCTTC GCAACTCCAA GATCATCCTT ATCGATG AAG 4260 
CCACAGCCTC CATTGACATG GAG ACAGACA CCCTGATCCA GCGCACAATC CGTGAAGCCT 4320 
TCCAGGGCTG CACCGTGCTC GTCATTGCCC ACCGTGTCAC CACTGTGCTG AACTGTGACC 4380 
ACATCCTGGT TATGGGCAAT GGGAAGGTGG TAGAATTTGA TCGGCCGGAG GTACTGCGGA 4440 
AGAAGCCTGG GTCATTGTTC GCAGCCCTCA TGGCCACAGC CACTTCTTCA CTGAGATAAG 4500 
GAGATGTGGA GACTTCATGG AGGCTGGCAG CTGAGCTCAG AGGTTCACAC AGGTGCAGCT 4560 
TCGAGGCCCA CAGTCTGCGA CCTTCTTGTT TGGAGATGAG AACTTCTCCT GGAAGCAGGG 4620 
GTAAATGTAG GGGGGGTGGG GATTGCTGGA TGGAAACCCT GGAATAGGCT ACTTGATGGC 4680 
TCTCAAGACC TTAGAACCCC AGAACCATCT AAGACATGGG ATTCAGTGAT CATGTGGTTC 4740 
TCCTTTTAAC TTACATGCTG AATAATTTTA TAATA AGGTA AAAGCTTATA GTTTTCTG AT 4800 
CTGTGTTAGA AGTGTTGCAA ATGCTGTACT GACTTTGTAA AATATAAAAC TAAGGAAAAC 4860 
TC 

Seq ID NO: 48 Protein sequence: 
Protein Accession #: NP_1 49 1 63.2 

MTRKRTYWVPNSSGGLV^GIDIGDDMVSGUYKTYTLQ 

MIPFRPKPRFP APQPLDNAG LFSYLTVS WLTPLMIQSLRSRLDENTIPP LSVHDASDKN VQRLHRLWEEE 

VSRRG I EKASV LLVMLRFQRTRLIFDALLG ICFCIASVLGPILIIPK1LEYS EEQLGNVVHG VGLCF A LF 

LSECVKSLSFSSSV/HNQRTAIRFRAAVSSFAFEKXIQI^YIHITSGEAISFFTG 

VUTCAS LV ICSISSYFnG YTAF I AI IXYLLV FPL A VFMTRMA VKAQHHTSEV SDQRIRVTSEVLTCIK 

LIKMYTWEKJFAKIIEDUIRKERKIXEKCGLVQSL^ 

I^SLNIXRLSVFFVPIAVKGLTNSKSAVMRFKKFFLQESPWYVQTI^DPSr^ 

VNGALELERNGHASEGMTRPRDALGPEEEGNSLGPELHKINLVVSKGMMLGVCGNTGSGKSSLLSAILEE 
MHLLEGSVGVQGSLAYVPQQAWIVSGNIRENILMGG 

GLNLSGGQKQRISLARAWSDRQr/LLDDPLSAVDAHVGKHIFEECIKKTLRGKTVVLVTHQLQYLEFCG 
QILLLENGK1CENGTHSELMQKKGKYAQLIQKMHKEATSDMLQDTAKIAEKPKVESQALATSLEESLNGN 
AVPEHQLTQEEEMEEGSLSWRVYHHYIQAAGGYMVSCIIFFFVVUVFLTIFSFWWLSYWLEQGSGTNSS 
RESNGTMADLGhHADWQI^FYQLVYGLNALLlICVGVCS^ 

FDTIPIGRLLNCFAGDLEQLIX^LU'IFSEQFLVLSLMVIAVLUVSVLSPYIIIMGAIIMVICFIYYM^ 

KKAIGVFKRLENYSRSPLFSHILNSLQGL^SIHVYGKTEDFISQF 

EiMTNLVTLAVALFVAFGlSSTPYSFKVMAVhnVLQUV^ 

EAPUiMEGTSCPQGWPQHGEIIFQDYHMKYRDhTIPTVLHGINLTIRGHEWGIVGRTGSGKSSL^ 
LVrTMAGRIUDGVDICSlGLEDLRSrOSVIPQDPVLLSGTIRFNLDPFDRHTDQQIWDALERTFLTK^ 
SKJPKKLHTDWENGGWSVGERQLLCIARAVLRNSKIILrDF^TASroMETDTLIQRTIREAFQGCTVL 
VIAHRVTTVLNCDmLVMGNGKWEFDRPEVLRKKPGSLFAALMATATSSLR 

Seq ID NO: 49 DNA sequence 

Nucleic Acid Accession^: NM 033419 

Coding sequence: 1 8-980 (underlined sequences correspond to start and stop codons) 
J 11 21 31 41 51 

CGAGCCAGGG AGAAAGGATG GCCGGCCTGG CGGCGCGGTT GGTCCTGCTA GCTGGGGCAG 60 
CGGCGCTGGC GAGCGGCTCC CAGGGCG ACC GTGAGCCGGT GTACCGCG AC TGCGT ACTGC 1 20 
AGTGCG AAG A GCAGAACTGC TCTGGGGGCG CTCTG AATCA CTTCCGCTCC CGCCAGCCAA 1 80 
TCTACATGAG TCTAGCAGGC TGGACCTGTC GGGACGACTG TAAGTATGAG TGTATGTGGG 240 
TCACCGTTGG GCTCTACCTC CAGGAAGGTC ACAAAGTGCC TCAGTTCCAT GGCAAGTGGC 300 
CCTTCTCCCG GTTCCTGTTC TTTCAAGAGC CGGCATCGGC CGTGGCCTCG TTTCTCAATG 360 
GCCTGGCCAG CCTGGTG ATG CTCTGCCGCT ACCGCACCTT CGTGCCAGCC TCCTCCCCCA 420 
TGTACCACAC CTGTGTGGCC TTCGCCTGGG TGTCCCTCAA TGCATGGTTC TGGTCCACAG 480 
TTTTCCACAC CAGGGACACT GACCTCACAG AGAAAATGGA CTACTTCTGT GCCTCCACTG 540 
TCATCCTACA CTCAATCTAC CTGTGCTGCG TCAGGACCGT GGGGCTGCAG CACCCAGCTG 600 
TGGTCAGTGC CTTCCGGGCT CTCCTGCTGC TCATGCTGAC CGTGCACGTC TCCTACCTGA 660 
GCCTCATCCG CTTCGACTAT GGCTACAACC TGGTGGCCAA CGTGGCTATT GGCCTGGTCA 720 
ACGTGGTGTG GTGGCTGGCC TGGTGCCTGT GGAACCAGCG GCGGCTGCCT CACGTGCGCA 780 
AGTGCGTGGT GGTGGTCTTG CTGCTGCAGG GGCTGTCCCT GCTCGAGCTG CTTGACTTCC 840 
CACCGCTCTT CTGGGTCCTG GATGCCCATG CCATCTGGCA CATCAGCACC ATCCCTGTCC 900 
ACGTCCTCTT TTTCAGCTTT CTGGAAG ATG ACAGCCTGTA CCTGCTGAAG G AATCAG AGG 960 
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ACAAGTTCAA GCTGGACTGA AGACCTTGGA GCGAGTCTGC CCCAGTGGGG ATCCTGCCCC 1020 
CGCCCTGCTG GCCTCCCTTC TCCCCTCAAC CCTTGAGATG ATTTTCTCTT TTCAACTTCT 1080 
TGAACTTGGA CATG AAGGAT GTGGGCCCAG AATCATGTGG CCAGCCCACC CCCTGTTGGC 1 140 
CCTCACCAGC CTTGGAGTCT GTTCTAGGGA AGGCCTCCCA GCATCTGGGA CTCGAGAGTG 1200 
GGCAGCCCCT CTACCTCCTG G AGCTGAACT GGGGTGGAAC TG AGTGTGCT CTTAGCTCTA 1260 
CCGGGAGGAC AGCTGCCTGT TTCCTCCCCA TCAGCCTCCT CCCCACATCC CCAGCTGCCT 1320 
GGCTGGGTCC TG AAGCCCTC TGTCTACCTG GGAGACCAGG GACCACAGGC CTTAGGGATA 1380 
CAGGGGGTCC CCTTCTGTTA CCACCCCCCA CCCTCCTCCA GGACACCACT AGGTGGTGCT 1440 
GGATGCTTGT TCTTTGGCCA GCCAAGGTTC ACGGCGATTC TCCCCATGGG ATCTTGAGGG 1500 
ACCAAGCTGC TGGGATTGGG AAGGAGTTTC ACCCTGACCA TTGCCCTAGC CAGGTTCCCA 1560 
GG AGGCCTC A CCATACTCCC TTTCAGGGCC AGGGCTCCAG CAAGCCCAGG GCAAGGATCC 1620 
TGTGCTGCTG TCTGGTTGAG AGCCTGCCAC CGTGTGTCGG GAGTGTGGGC CAGGCTGAGT 1680 
GCATAGGTGA CAGGGCCGTG AGCATGGGCC TGGGTGTGTG TGAGCTCAGG CCTAGGTGCG 1740 
CAGTGTGGAG ACGGGTGTTO TCGGGGAAGA GGTGTGGCTT CAAAGTGTGT GTGTGCAGGG 1800 
GGTGGGTGTG TTAGCGTGGG TTAGGGGAAC GTGTGTGCGC GTGCTGGTGG GCATGTGAGA 1860 
TGAGTGACTG CCGGTGAATG TGTCCACAGT TGAGAGGTTG GAGCAGG ATG AGGGAATCCT 1920 
GTCACCATCA ATAATCACTT GTGGAGCGCC AGCTCTGCCC AAGGCGCCAC CTGGGCGGAC 1980 
AGCCAGG AGC TCTCCATGGC CAGGCTGCCT GTGTGCATGT TCCCTGTCTG GTGCCCCTTT 2040 
GCCCGCCTCCTGCAAACCTC ACAGGGTCCC CACACAACAG TGCCCTCCAG AAGCAGCCCC 2100 
TCGGAGGCAG AGGAAGGAAA ATGGGGATGG CTGGGGCTCT CTCCATCCTC CTTTTCTCCT 2160 
TGCCTTCGCA TGGCTGGCCT TCCCCTCCAA AACCTCCATT CCCCTGCTGC CAGCCCCTTT 2220 
GCCATAGCCT GATTTTGGGG AGGAGGAAGG GGCGATTTGA GGG AGAAGGG GAGAAAGCTT 2280 
ATGGCTGGGT CTGGTTTCTT CCCTTCCCAG AGGGTCTTAC TGTTCCAGGG TGGCCCCAGG 2340 
GCAGGCAGGG GCCACACTAT GCCTGCGCCC TGGTAAAGGT GACCCCTGCC ATTTACCAGC 2400 
AGCCCTGGCA TGTTCCTGCC CCACAGGAAT AGAATGGAGG GAGCTCCAGA AACTTTCCAT 2460 
CCCAAAGGCA GTCTCCGTGG TTGAAGCAGA CTGGATTTTT GCTCTGCCCC TGACCCCTTG 2520 
TCCCTCTTTG AGGG AGGGGA GCTATGCTAG GACTCCAACC TCAGGGACTC GGGTGGGCTG 2580 
CGCTAGCTTC TTTTG ATACT GAAAACTTTT AAGGTGGG AG GGTGGCA AGG GATGTGCTTA 2640 
ATAAATCAAT TCCAAGCCTC AAAAAAAAAA AAAAAAAAAA AA 

Scq ID NO: 50 Protein sequence: 
Protein Accession #: NP_2 1 9487. 1 

1 11 21 31 41 51 

1 I I I I 1 

MKDVGPESCG QPTPCWPSPA LESVLGKASQ HLGLESGQPL YLLELNWGGT ECALSSTGRT 60 
AACFLPISLL PTSPAAWLGP EALCLPGRPG TTGLRDTGGP LLLPPPTLLQ DTTRWCWMLV 120 
LWPAKVHGDS PHGILRDQAA GIGKEFHPDH CPSQVPRRPH HTPFQGQGSS KPRARILCCC 1 80 
LVESLPPCVG SVGQAECIGD RAVSMGLGVC ELRPRCAVWR RVLSGKRCGF KVCVCRGWVC 



Seq ID NO: 51 DNA sequence 

Nucleic Acid Accession #: XM JH9098. 1 

Coding sequence: 178-61 8(underfined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

G ATGTACACT CTGAAGTGAG CACATTCCTG TTGGCAGGAC ATGACACCTT GGCAGCAAGC 60 
ATCTCCTGGA TCCTTTACTG CCTGGCTCTG AACCCTGAGC ATCAAGAGAG ATGCCGGGAG 120 
GAGGTCAGGG GCATCCTGGG GGATGGGTCT TCTATCACTT GGG ACCAGCT GGGTGAGATG 1 80 
TCGTACACCA CAATGTGCAT CAAGGAGACG TGCCGATTG A TTCCTGCAGT CCCGTCCATT 240 
TCCAGAG ATC TCAGCAAGCC ACTTACCTTC CCAGATGGAT GCACATTGCC TGCAGGG ATC 300 
ACCGTGGTTC TTAGTATTTG GGGTCTTCAC CACAACCCTG CTGTCTGGAA AAACCCAAAG 360 
GTCTTTGACC CCTTG AGGTT CTCTCAGGAG AATTCTGATC AGAGACACCG CTATGCCTAC 420 
TTACCATTCT CAGCTGGATC AAGGAACTGC ATTGGGCAGG AGTTTGCCAT GATTGAGTTA 480 
AAGGTAACCA TTGCCTTGAT TCTGCTCCAC TTCAGAGTGA CTCCAGACCC CACCAGGCCT 540 
CTTACTTTCC CCAACCATTT TATCCTCAAG CCCAAGAATG GGATGTATTT GCACCTGAAG 600 
AAACTCTCTG AATGTTAGAT CTCAGGGTAC AATGATTA AA CGTACTTTGT TTTTCGAAGT 660 
TAAATTTACA GCTAATGATC CAAGCAGATA GAAAGGGATC AATGTATGGT GGGAGGATTG 720 
GAGGTTGGTG GG ATAGGGGT CTCTGTGAAG AGATCCAAAA TCATTTCTAG GTACACAGTG 780 
TGTCAGCTAG ATCTGTTTCT ATATAACTTT GGGAGATTTT CAGATCTTTT CTGTTAAACT 840 
TTCACTACTA TTAATGCTGT ATACACCAAT AG ACTTTCAT ATATTTTCTG TTGTTTTTAA 900 
AATAGTTTTC AG AATTATGC AAGTAATAAG TGCATGTATG CTCACTGTCA AAAATTCCCA 960 
ACACTAGAAA ATCATGTAGA ATAAAAATTT TAAATCTCAC TTCACTTAGC CGACATTCCA 1020 
TGCCCTGACC AATCCTACTG CTTTTCCTAA AAACAGAATA ATTTGGTGTG CATTCTTTCA 1080 
GACTTTTTCC TATACATTTT ATATGTAGAA ATGTAGCAAT GTATTTGTAT AGATGTGATC 1 140 
ATTCCTATAT TGTTATTG AT TTTTTTCACT TAATAAAAAT TCACCTTATT CCTT 

Seq ID NO: 52 Protein sequence: 
Protein Accession #: XPJJ59098. 1 

1 U 21 31 41 51 
I I I I I 1 

MSYTTMCIKE TCRLIPAVPS 1SRDLSKPLT FPDGCTLPAG rrWLSIWGL HHNPAVWKNP 60 
KVFDPLRFSQ ENSDQRHPYA YLPFSAGSRN C1GQEFAMIE LKVTIALILL HFRVTPDPTR 120 
PLTFPNHFIL KPKNGMYLHL KJCLSEC 
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Seq ID NO: 53 DNA sequence 

Nucleic Acid Accession U: NM_O309 1 6 

Coding sequence: 1-1533 (underlined sequences correspond to start and stop codons) 

5 1 11 21 31 41 51 
I I I I I 1 

ATGCCCCTGT CCCTGGGAGC CGAGATGTGG GGGCCTGAGG CCTGGCTGCT GCTGCTGCTA 60 
CTGCTGGCAT CATTTACAGG CCGGTGCCCC GCGGGTGAGC TGGAGACCTC AGACGTGGTA 120 
ACTGTGGTGC TGGGCCAGG A CGCAAAACTG CCCTGCTTCT ACCGAGGGGA CTCCGGCGAG 1 80 

10 CAAGTGGGGC AAGTGGCATG GGCTCGGGTG GACGCGGGCG AAGGCGCCCA GGAACTAGCG 240 
CTACTGCACT CCAAATACGG GCTTCATGTG AGCCCGGCTT ACGAGGGCCG CGTGGAGCAG 300 
CCGCCGCCCC CACGCAACCC CCTGGACGGC TCAGTGCTCC TGCGCAACGC AGTGCAGGCG 360 
GATGAGGGCG AGTACGAGTG CCGGGTCAGC ACCTTCCCCG CCGGCAGCTT CCAGGCGCGG 420 
CTGCGGCTCC GAGTGCTGGT GCCTCCCCTG CCCTCACTGA ATCCTGGTCC AGCACTAGAA 480 

1 5 G AGGGCCAGG GCCTGACCCT GGCAGCCTCC TGCACAGCTG AGGGCAGCCC AGCCCCCAGC 540 
GTGACCTGGG ACACGGAGGT CAAAGGCACA ACGTCCAGCC GTTCCTTCAA GCACTCCCGC 600 
TCTGCTGCCG TCACCTCAGA GTTCCACTTG GTGCCTAGCC GCAGCATGAA TGGGCAGCCA 660 
CTGACTTGTG TGGTGTCCCA TCCTGGCCTG CTCCAGGACC AAAGGATCAC CCACATCCTC 720 
CACGTGTCCT TCCTTGCTGA GGCCTCTGTG AGGGGCCTTG AAG ACCAAAA TCTGTGGCAC 780 

20 ATTGGCAGAG AAGGAGCTAT GCTCAAGTGC CTGAGTGAAG GGCAGCCCCC TCCCTCATAC 840 
AACTGGACAC GGCTGGATGG GCCTCTGCCC AGTGGGGTAC GAGTGGATGG GGACACTTTG 900 
GGCTTTCCCC CACTGACCAC TGAGCACAGC GGCATCTACG TCTGCCATGT CAGCAATGAG 960 
TTCTCCTCAA GGGATTCTCA GGTCACTGTG GATGTTCTTG ACCCCCAGGA AGACTCTGGG 1020 
AAGCAGGTGG ACCTAGTGTC AGCCTCGGTG GTGGTGGTGG GTGTGATCGC CGCACTCTTG 1080 

2 5 TTCTGCCITC TGGTGGTGGT GGTGGTGCTC ATGTCCCGAT ACCATCGGCG CAAGGCCCAG 1 140 

CAGATG ACCC AGAAATATGA GGAGGAGCTG ACCCTGACCA GGGAGAACTC CATCCGGAGG 1200 
CTGCATTCCC ATCACACGGA CCCCAGGAGC CAGCCGGAGG AGAGTGTAGG GCTGAGAGCC 1260 
GAGGGCCACC CTGATAGTCT CAAGGACAAC AGTAGCTGCT CTGTGATGAG TGAAGAGCCC 1320 
_ GAGGGCCGCA GTTACTCCAC GCTGACCACG GTGAGGGAGA TAGAAACACA GACTGAACTG 1 3 80 

30 CTGTCTCCAG GCTCTGGGCG GGCCGAGGAG GAGGAAGATC AGGATGAAGG CATCAAACAG 1440 
GCCATGAACC ATTTTGTTCA GGAGAATGGG ACCCTACGGG CCAAGCCCAC GGGCAATGGC 1500 
ATCTACATCA ATGGGCGGGG ACACCTGGTC TGA 

Seq ID NO: 54 Protein sequence: 

3 J Protein Accession U: NPJ 1 2 1 78. 1 

1 11 21 31 41 51 

! ! ! ! i I 

MPL^LGAEMWGPEAWLUXLLLASFTGRCPAGELETSDVVTVVUjQDAKLPCFYRGDSGE 60 

40 QVGQVAWARV dagegaqela llhskyglhv spayegrveq PPPPRNPLDG SVLLRNAVQA 120 

DEGEYECRVS TFP AGSFQAR LRLRVLVPPL PSLNPGP ALE EGQGLTLAAS CTAEGSPAPS 1 80 
VTWDTEVKGT TSSRSFKHSR SAAVTSEFHL VPSRSMNGQP LTCVVSHPGL LQDQRJTHIL 240 
HVSFLAEASV RGLEDQNLWH IGREGAMLKC LSEGQPPPSY NWTRLDGPLP SGVRVDGDTL 300 
GFPPLTTEHS GIYVCHVSNE FSSRDSQVTV DVLDPQEDSG KQVDLVSASV VVVGVIAALL 360 
45 FCLLVWWL MSRYHRRKAQ QMTQKYEEEL TLTRENSIRR LHSHHTDPRS QPEESVGLRA 420 
EGHPDSLKDN SSCSVMSEEP EGRSYSTLTT VREIETQTEL LSPGSGRAEE EEDQDEGKQ 480 
AMNHFVQENG TLRAKPTGNG IYINGRGHLV 

Seq ID NO: 55 DNA sequence 
5 0 Nucleic Acid Accession #: AF0071 70. 1 

Coding sequence: 73-1725 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

5 5 AAGGAGGCGG CCTCCGGGAA AAGCGACCGC AGGACTCCTG AGAGCAGCCT CCATGAGGCC 60 
CTGGACCAGT GCATGACCGC CCTGGACCTC TTCCTCACCA ACCAGTTCTC AGAAGCACTC 120 
AGCTACCTCA AGCCCAG AAC CAAGG AAAGC ATGTACCACT CACTGACATA TGCCACCATC 1 80 
CTGGAGATGC AGGCCATGAT GACCTTTGAC CCTCAGG ACA TCCTGCTTGC CGGCAACATG 240 
ATGAAGGAGG CACAGATGCT GTGTCAGAGG CACCGGAGGA AGTCTTCTGT AACAGATTCC 300 

6 0 TTCAGCAGCC TGGTGAACCG CCCCACGCTG GGCCAATTCA CTGAAGAAGA AATCCACGCT 360 
GAGGTCTGCT ATGCAGAGTG CCTGCTGCAG CGAGCAGCCC TGACCTTCCT GCAGGACGAG 420 
AACATGGTG A GCTTCATCAA AGGCGGC ATC AAAGTTCGAA ACAGCTACCA GACCTACAAG 480 
G AGCTGGACA GCCTTGTTCA GTCCTCACAA TACTG CAAGG GTGAGAACCA CCCGCACTTT 540 
G AAGGAGGAG TGAAGCTTGG TGTAGGGGCC TTCAACCTG A CACTGTCCAT GCTTCCTACT 600 

65 AGGATCCTGA GGCTGTTGGA GTTTGTGGGG TTTTCAGGAA ACAAGGACTA TGGGCTGCTG 660 
CAGCTGGAGG AGGGAGCGTC AGGGCACAGC TTCCGCTCTG TGCTCTGTGT CATGCTCCTG 720 
CTGTGCTACC ACACCTTCCT CACCTTCGTG CTCGGTACTG GGAACGTCAA CATCGAGGAG 780 
GCCGAGAAGC TCTTGAAGCC CTACCTGAAC CGGTACCCTA AGGGTGCCAT CTTCCTGTTC 840 
TTTGCAGGGA GGATTGAAGT CATTAAAGGC AACATTGATG CAGCCATCCG GCGTTTCGAG 900 

70 GAGTGCTGTG AGGCCCAGCA GCACTGGAAG CAGTTTCACC ACATGTGCTA CTGGGAGCTG 960 
ATGTGGTGCT TCACCTACAA GGGCCAGTGG AAGATGTCCT ACTTCTACGC CGACCTGCTC 1020 
AGCAAGGAGA ACTGCTGGTC CAAGGCCACC TACATTTACA TG AAGGCCGC CTACCTCAGC 1080 
ATGTTTGGGA AGGAGGACCA CAAGCCGTTC GGGGACGACG AAGTGGAATT ATTTCGAGCT 1 140 
GTGCCAGGCC TGAAGCTCAA GATTGCTGGG AAATCTCTAC CCACAGAGAA GTTTGCCATC 1200 

75 CGGAAGTCCC GGCGCTACTT CTCCTCCAAC CCTATCTCGC TGCCAGTGCC TGCTCTGGAA 1260 
ATGATGTACA TCTGGAACGG CTACGCCGTG ATTGGGAAGC AGCCGAAACT CACGGATGGG 1320 
ATACTTGAGA TTATCACTAA GGCTGAAGAG ATGCTGG AGA AAGGCCCAGA GAACGAGTAC 1380 
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TCAGTGGATG ACGAGTGCTT GGTGAAATTG TTGAAAGGCC TGTGTCTGAA ATACCTGGGC 1440 
CGTGTCCAGG AGGCCGAGGA GAATTTTAGG AGCATCTCTG CCAATGAAAA GAAGATTAAA 1500 
TATGACCACT ACTTGATCCC AAACGCCCTG CTGGAGCTGG CCCTGCTGCT TATGGAGCAA 1560 
GACAGAAACG AAG AGGCCAT CAAACTTTTG GAATCTGCCA AGCAAAACTA CAAGAATTAC 1620 
TCCATGGAGT CAAGG ACACA CTTTCGAATC CAGGCAGCCA CACTCCAAGC CAAGTCTTCC 1 680 
CTAGAGAACA GCAGCAGATC CATGGTCTCA TCAGTGTCCT TGTAGCTTTG TGCAGCAGTT 1 740 
CCGGGCTGGA AGACAGAGAC AGCTGGACAG AGCTCCTGAA AACATTTCAA AATACCCCCT 1800 
CCCCCTGCCC TGCCCTGCCT TTGGGGTCCA CCGGCACTCC AGTTGG ATGG CACAACATAG 1 860 
TGTATCCGTG CAGAAGCCGA GCTGGCATTT TCACCAGTGT AGCCAAGGGC CTTTGCCAAG 1920 
GGCAGAGCAG GTGGAGCCCT CTGCCTGCCC TATCACACAT ACGGGTACTT GCTTTTCACT 1980 
GTGATGTTTA AGAGAATGTA TGAACAGTTT ACATTTTCCT TAGAAATACA TTGATGGGAT 2040 
CACAGTTGGC TTTAAAAACC AACAACAATC AACCACCTGT AAGTCTTTGT CTTCACCTAT 2100 
TATCATCTGG AGGTAAATCT CTTTATATGA TGATGCCAAA GGGCAAATTG CTTTTCAAAT 21 60 
TCAGCAAGTT CTCAGCTTGT GTGACGGAAG GTCCTTCAGA GGACCTGAGG AATGCCTGGG 2220 
AGAGGCTAAG CCTCAGGCTT CAATGCTTCT GGGGTTGGGC ATGAGGATGT ACACAGACAC 2280 
CCACTACCTT ACTACTCACA CTTCATTTCA CTCCTTTTGT AAATTTCCAA TTTAAAAATC 2340 
AAGCACGTCT TTTTAGTGAG ATAAAATCTG AGCTCTTCTG TAG AAAAATC AATCTCTACC 2400 
AGTAGAAAAT GCCAGGGCTT GATGGAAGAG CTGTGTAGCC CTTTCTATGC CAAAGCCAGG 2460 
AAATTTGGGG GGCAGGAGGA GGTTCTCAGA ATCCAGTCTG TATCTTTGCT GTATGCCAAA 2520 
CTGAAACCAC TGGGAATAAT TTATGAAACA TAAAAATCTT CTGTACTTCA CTCCAAGGTA 2580 
CATTTGCTTA CTGACAGCAT TTTTGTTAAA ACTGTTATTC TTGAAAAAAA AAAAAAAAAA 2640 
AA 

Seq ID NO: 56 Protein sequence: 
Protein Accession #: AAC39582. 1 

1 11 21 31 41 51 
I I I I I I 

MTALDLFLTN QFSEALSYLK PRTKESMYHS LTYATILEMQ AMMTFDPQDI LLAGNMMKEA 60 
QMLCQRHRRK SSVTDSFSSL VNRPTLGQFT EEEIHAEVCY AECLLQRAAL TFLQDENMVS 120 
FIKGG1KVRN SYQTYKELDS LVQSSQYCKG ENHPHFEGGV KLGVGAFNLT LSMLPTRJLR 180 
LLEFVGFSGN KDYGLLQLEE GASGHSFRSV LCVMLLLCYH TFLTFVLGTG NVNIEEAEKL 240 
LKPYLNRYPK GAIFLFFAGR IEVIKGNIDA AIRRFEECCE AQQHWKQFHH MCYWELMWCF 300 
TYKGQWKMSY FYADLLSKEN CWSKATYIYM KAAYLSMFGK EDHKPFGDDE VELFRAVPGL 360 
KLKIAGKSLP TEKFAIRKSR RYFSSNPISL PVPALEMMY1 WNGYAVIGK.Q PKLTDGILEI 420 
mCAEEMLEK GPENEYSVDD ECLVKLLKGL CLKYLGRVQE AEENFRSISA NEKKIKYDHY 480 
UPMALLELA LLLMEQDRNE EAIKLLESAK QNYKNYSMES RTHFR1QAAT LQAKSSLENS 540 
SRSMVSSVSL 

Seq ID NO: 57 DNA sequence 

Nucleic Acid Accession ft: NM_006670. 1 

Coding sequence: 1-927 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I 1 I I 

ATGCCTGGGG GGTGCTCCCG GGGCCCCGCC GCCGGGGACG GGCGTCTGCG GCTGGCGCGA 60 
CTAGCGCTGG TACTCCTGGG CTGGGTCTCC TCGTCTTCTC CCACCTCCTC GGCATCCTCC 1 20 
TTCTCCTCCT CGGCGCCGTT CCTGGCTTCC GCCGTGTCCG CCCAGCCCCC GCTGCCGGAC 1 80 
CAGTGCCCCG CGCTGTGCGA GTGCTCCGAG GCAGCGCGCA CAGTCAAGTG CGTTAACCGC 240 
AATCTGACCG AGGTGCCCAC GGACCTGCCC GCCTACGTGC GCAACCTCTT CCTTACCGGC 300 
AACCAGCTGG CCAGCAACCA CTTCCTTTAC CTGCCGCGGG ATGTGCTGGC CCAACTGCCC 360 
AGCCTCAGGC ACCTGGACTT AAGTAATAAT TCGCTGGTGA GCCTGACCTA CGTGTCCTTC 420 
CGCAACCTGA CACATCTAGA AAGCCTCCAC CTGGAGGACA ATGCCCTCAA GGTCCTTCAC 480 
AATGGCACCC TGGCTGAGTT GCAAGGTCTA CCCCACATTA GGGTTTTCCT GGACAACAAT 540 
CCCTGGGTCT GCGACTGCCA CATGGCAGAC ATGGTGACCT GGCTCAAGGA AACAGAGGTA 600 
GTGCAGGGCA AAGACCGGCT CACCTGTGCA TATCCGGAAA AAATGAGGAA TCGGGTCCTC 660 
TTGGAACTCA ACAGTGCTGA CCTGGACTGT GACCCGATTC TTCCCCCATC CCTGCAAACC 720 
TCTTATGTCT TCCTGGGTAT TGTTTTAGCC CTGATAGGCG CTATTTTCCT CCTGGTTTTG 780 
TATTTGAACC GCAAGGGGAT AAAAAAGTGG ATGCATAACA TCAGAGATGC CTGCAGGGAT 840 
CACATGGAAG GGTATCATTA CAGATATGAA ATCAATGCGG ACCCCAGATT AACAAACCTC 900 
AGTTCTAACT CGGATGTCCT CGAGTGA 

Seq ID NO: 58 Protein sequence 
Protein Accession 0: NP.006661.1 



1 11 21 31 41 51 
I I I I I I 

MPGGCSRGPA AGDGRLRLAR LALVLLGWVS SSSPTSSASS FSSSAPFLAS AVSAQPPLPD 60 
QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLASNHFLY LPRDVLAQLP 120 
SLRHLDLSNN SLVSLTY VSF RNLTHLESLH LEDNALKVLH NGTLAELQG L PH1RVFLDNN 1 80 
PWVCDCHMAD MVTWLK.ETEV VQGKDRLTCA YPEKMRNRVL LELNSADLDC DPILPPSLQT 240 
SYYFLGIVLA UGADFLLVL YLNRKGIKKW MHNIRDACRD HMEGYHYRYE INADPRLTNL 300 
SSNSDVLB 



Seq ID NO: 59 DNA sequence 

Nucleic Acid Accession S: NM_024022 
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Coding sequence: l-1362(underlined sequences correspond to start and stopcodons) 
1 11 21 31 41 51 

5 ATGGGGG AAA ATGATCCGCC TGCTQTTGAA GCCCCCTTCT CATTCCOATC GCTTTTTGGC 60 
m^TC A^TG^^AAG TCCTGTTGCA CCAG ATGCAG ATGCTGTTGC TGCACAGATC 120 
rTrrrlrTfiC TGCCATTGAA GTTTTTTCCA ATCATCGTCA TTGGG ATCAT TGCATTGATA 180 

SagcIcTggccaSctgg^ ™ 

TCATOCT^A^STt^ATCGA GCTGATAGCT CGATGTGACG GAGTCTCGGA TTGCAAAGAC 300 
10 G^GG^^GACG^AGTACCGCTG TGTCCGGGTG GGTGGTCAG A ATGCCGTGCT ^ A GGTGTTC 

ACAGCTOCnr OTTCGAAGAC CATGTGCTCC GATGACTGG A AGGGTCACTA CGCAAATGTT 420 
GCCTCTGCCC AACTGGGTTT CCCAAGCTAT GTG AGTTCAG ATAACCTCAG AGTGAGCTCG 480 
CTGG AGGG GC^TTTCCG G G A GGAGTTTGTG TCCATCGATC ACCTCTTGCC AGATGACAAG 540 

Sgactgca^tacS SS*£™S" S 

1 5 ACCTTGCAGT GCACAGCCTG TGGTCATAGA AGGGGCTACA GCTCACGCAT (^jTGGGTCGA 660 
AACATGTCCT TGCTCTCGCA GTGGCCCTGG CAGGCCAGCC TTCAGTTCCA ^G^ACCAC 720 
^GTGCGGGG GCTCTGTCAT CACGCCCCTG TGGATCATCA CTGCTGCACA CTGTGTTTAT 780 
G^C^GTAC^^^XCAAGTC ATGGACCATC CAGGTGGGTC TAGTTTCCCT GTTGGACAAT 840 
rcA^CCCC^T CCCACTTGGT GGAGAAGATT GTCTACCACA GCAAGTACAA G ^^^^f?9 Q 9 ™ 

20 CTGGGCAATG ACMTCGCCCT TATGAAGCTG GCCGGGCCAC TCACGTTCAA TGAAATGATC 960 
^^f^r?GCCTGCCCAA CTCTGAAGAG AACTTCCCCG ATGGAAAAGT GTGCTGGACG 1020 

tcaSggggS^^ 

Src^^GA *mCCAACAA GATCTGCAAC CACAGGGACG TGTACGGTGG CATCATCTCC 1140 
C^CTCCATGC TCTGCGCGGG CTACCTGACG GGTGGCGTGG ACAGCTGCCA GGGGGACAGC 1200 
? 5 r^rr CCCC^GTOrGTC A , AGAGAGGAGG CTGTGGAAGT TAGTGGGAGC GACCAGCTTT 1260 

25 1320 
G ACTGG ATCC ACGAGCAGAT GG AG AG AG AC CTAAAAACCTGA 

Seq ID NO: 60 Protein sequence 
3 0 Protein Accession #: NPJJ76927 

1 11 21 31 41 51 

MGENDPPAVE APFSFRSLFG LDDLKISPVA PDADAVAAQI LSLLPLKFFP irVIGUALI 60 
1 S i^T^LGIH FDCSGKYRCR SSFKCIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVF 120 
^W^C^Sg^At^CAQLGFPSY VSSDNLRVSSLEGQFREEFV SIDWJLPDDK 180 
VTAi hSvYV REGCASG HVV TLQCT ACGHR RGYSSRIVGG NMSLLSQWPW QASLQFQGYH 240 
^GGSVI^pI X5SS5^DLYLPKSWT1 QVULVSLLDN PAFSHLVIM VYHSKYKPKR 300 

MI^L AGpi™ SGWGATEDGG DASPVLNHAA 360 

40 ESK^^ 420 

GIGCAEVNKP GVYTRVTSFL DWIHEQMERD LKT 

Seq ID NO: 61 DNA sequence 
Nucleic Acid Accession H: NM_006475 
45 Coding sequence: 28-2538 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

L ^}mXvr ri AraGAGAG ACTCAAGATG ATTCCCTTTT TACCC ATGTT TTCTCT ACTA 60 

50 TOCTG^T^TT^^ 

a rrrrTiTPA rtfinr.TCGGGA CCAAGGCCCA AATGTCTGTG CCCTTCAACA G ATTTTGGGC 1 80 
a^^II A aatAOTCAG CAOTCTAAG AACTGGTATA AAAAGTCCAT CTGTGGACAG 240 

r^rSn^A^S^TO^TATTOGAO GCCCTTGGAA GAGACGGTCA CTTCACACTC 840 
CGAGACAAAO TGGCTTCCGA AGCTOTATG ^CT^CACA TCTTAAATAC ^™£T ^ 

/*ty^ap a a at^ ATrGTGTG AT CCATTTX3ATT G ATCAGGTCC TAATTCCTGA TTCTGCCAAA 1140 
?AArTTATT^AGCTGG^GG AAAACAGCAA ACCACCTTCA CGGATCTTGT GGCCCAATTA 1200 
rrTTTGGCAT CTGCTCTCAG GCCAGATGGA GAATACACTT TGCTGGCACC TGTG AATAAT 1260 
™ ^SSS^r SnSScTC^cTTGOTrCAGCGCCTCCTTAAATTAATTCTGCAGAAT 1320 
70 SrlTISl^CTA^A^ "SmT GAGCTTTACA ACGGGCAAAT ACTGGAAACC 1380 
AT^GAGGCA^^A^^TCAG AGT^TCGTA ^^^^I^I^^^^^^^^jI^qq^^^^C^qq 
TGCATGGAGA AAGGGAGTAA GCAAGGGAG A AA^^GTGCGA TTCACATATT <~^°™g? ™ 
A-rrAAGCCAG CAGAGAAATC CCTCCATGAA AAGTTAAAAC AAGATAAGCG CTTTAGCACC 15«J 
7S ^CCTCA^3CC TACTTG AAGC TGC AG ACTTG AAAG AGCTCC TG ACACAACC TGGAGACTGG 1620 
75 I^TATTTG TGCCAACCAA TGATGCTTTT AAGGOAATGA CTAGTGAAGA AAAAGAAATT 1680 
CT^G^ACG^G^CA^V^VVr^^^TTCAAAAC ATCATTCTTT ATCACCTGAC ACCAGGAGTT 1740 
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TTCATTGGAA AAGGATTTGA ACCTGGTGTT ACTAACATTT TAAAGACCAC ACAAGGAAGC 1800 
AAAATCTTTC TGAAAGAAGT AAATGATACA CTTCTGGTGA ATGAATTGAA ATCAAAAGAA 1860 
TCTGACATCA TGACAACAAA TGGTGTAATT CATGTTGTAG ATAAACTCCT CTATCCAGCA 1920 
GACACACCTG TTGGAAATGA TCAACTGCTG GAAATACTTA ATAAATTAAT CAAATACATC 1 980 
5 CAAATTAAGT TTGTTCGTGG TAGCACCTTC AAAGAAATCC CCGTGACTGT CTATACAACT 2040 
AAAATTATAA CCAAAGTTGT GGAACCAAAA ATTAAAGTGA TTGAAGGCAG TCTTCAGCCT 2100 
ATTATCAAAA CTGAAGGACC CACACTAACA AAAGTCAAAA TTGAAGGTGA ACCTGAATTC 2160 
AGACTG ATTA AAGAAGGTGA AACAATAACT GAAGTGATCC ATGGAGAGCC AATTATTAAA 2220 
AAATACACCA AAATCATTGA TGGAGTGCCT GTGGAAATAA CTGAAAAAGA GACACGAGAA 2280 

1 0 G AACGAATCA TTACAGGTCC TGAAATAAAA TACACTAGGA TTTCTACTGG AGGTGGAGAA 2340 
ACAGAAGAAA CTCTGAAG AA ATTGTTACAA GAAGAGGTCA CCAAGGTCAC CAAATTCATT 2400 
GAAGGTGGTG ATGGTCATTT ATTTGAAGAT GAAGAAATTA AAAGACTGCT TCAGGGAGAC 2460 
ACACCCGTGA GGAAGTTGCA AGCCAACAAA AAAGTTCAAG GTTCTAGAAG ACGATTAAGG 2520 
GAAGGTCGTT CTCAGTGAAA ATCCAAAAAC CAGAAAAAAA TGTTTATACA ACCCTAAGTC 2580 

1 5 AATAACCTGA CCTTAG AAAA TTGTGAGAGC CAAGTTG ACT TCAGGAACTG AAACATCAGC 2640 
ACAAAGAAGC AATCATCAAA TAATTCTGA A CACAAATTTA ATATTTTTTT TTCTGAATGA 2700 
GAAACATGAG GGAAATTGTG GAGTTAGCCT CCTGTGGTAA AGGAATTGAA GAAAATATAA 2760 
CACCTTACAC CCTTTTTCAT CTTGACATTA AAAGTTCTGG CTAACTTTGG AATCCATTAG 2820 
AGAAAAATCC TTGTCACCAG ATTCATTACA ATTCAAATCG AAGAGTTGTG AACTGTTATC 2880 

20 CCATTGAAAA GACCGAGCCT TGTATGTATG TTATGGATAC ATAAAATGCA CGCAAGCCAT 2940 
TATCTCTCCA TGGGAAGCTA AGTTATAAAA ATAGGTGCTT GGTGTACAAA AC Ml 11 ATA 3000 
TCAAAAGGCT TTGCACATTT CTATATG AGT GGGTTTACTG GTAAATTATG TTATTTTTTA 3060 
CAACTAATTT TGTACTCTCA GAATGTTTGT CATATGCTTC TTGCAATGCA TATTTTTTAA 3120 
TCTCAAACGT TTCAATAAAA CCATTTTTCA G ATATAAAGA GAATTACTTC AAATTGAGTA 3 1 80 

25 ATTCAGAAAA ACTCAAGATT TAAGTTAAAA AGTGGTTTGG ACTTGGGAA 

Seq ID NO: 62 Protein sequence: 
Protein Accession^: NP 006466 

30 1 11 21 31 41 51 
I 1 I I I I 

MIPFLPMFSL LLLLIVNPIN ANNHYDK1LA HSRIRGRDQG PNVCALQQIL GTKKKYFSTC 60 
KNWYKKSICG QKTTVLYECC PG YMRMEGMK GCPA VLPIDH VYGTLGI VG A TTTQRYSD AS 1 20 

_ c KLREEIEGKG SFTYFAPSNE AWDNLDSDIR RGLESNVNVE LLNALHSHMI NKRMLTKDLK 180 

3 5 NGMIIPSMYN NLGLFINHYP NGVVTVNCAR UHGNQIATN GWHVIDRVL TQIGTSIQDF 240 
IEAEDDLSSF RAAAITSD1L EALGRDGHFT LFAFTNEAFE KLPRGVLERF MGDKVASEAL 300 
MKYHILNTLQ CSESIMGGAV FETLEGNTIE IGCDGDSITV NGIKMVNKKD IVTNNGVIHL 360 
IDQVLtPDSA KQVIELAGKQ QTTFTDLVAQ LGLASALRPD GEYTLLAPVN NAFSDDTLSM 420 
. VQRLLKLILQ NHIUCVKVGL NELYNGQILE TIGGKQLRVF VYRTAVC1EN SCMEKGSKQG 480 

40 RNGAIHIFRE IQCPAEKSLH EKUCQDKRFS TFLSLLEAAD LK.ELLTQPGD WTLFVPTNDA 540 
FKGMTSEEKE IURDKNALQ NTILYHLTPG VFIGKGFEPG VTNILKTTQG SKDFLKEVND 600 
TLLVNELKSK ESDIMTTNGV IHWDKLLYP ADTPVGNDQL LEILNKLDCY IQKFVRGST 660 
FKEIPVTVYT TKHTKWEP KIKVIEGSLQ PUKTEGPTL TKVKIEGEPE FRUKEGETI 720 
TEVIHGEPH KJCYTKIIDGV PVEITEKFTR EERHTGPEI KYTRJSTGGG ETEETLKKLL 780 

45 QEEVTKVTKF IEGGDGHLFE DEE1KRLLQG DTPVRKLQAN KKVQGSRRRL REGRSQ 

Seq ID NO: 63 DNA sequence 
Nucleic Acid Accession #: NM 020974 

^ ^ Coding sequence: 8 1-3080 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 SI 
I I I I I I 

GGCGTCCGCG CACACCTCCC CGCGCCGCCG CCGCCACCGC CCGCACTCCG CCGCCTCTGC 60 
- CCGCAACCGC TGAGCCATCC ATGGGGGTCG CGGGCCGCAA CCGTCCCGGG GCGGCCTGGG 120 

5 5 CGGTGCTGCT GCTGCTGCTG CTGCTGCCGC CACTGCTGCT GCTGGCGGGG GCCGTCCCGC 1 80 

CGGGTCGGGG CCGTGCCGCG GGGCCGCAGG AGGATGTAGA TGAGTGTGCC CAAGGGCTAG 240 
ATGACTGCCA TGCCGACGCC CTGTGTCAGA ACACACCCAC CTCCTACAAG TGCTCCTGCA 300 
AGCCTGGCTA CCAAGGGGAA GGCAGGCAGT GTGAGGACAT CG ATGAATGT GGAAATG AGC 360 
TCAATGGAGG CTGTGTCCAT GACTGTTTGA ATATTCCAGG CAATTATCGT TGCACTTGTT 420 

60 TTGATGGCTT CATGTTGGCT CATGAOGGTC ATAATTGTCT TGATGTGGAC GAGTGCCTGG 480 
AGAACAATGG CGGCTGCCAG CATACCTGTG TCAACGTCAT GGGG AGCTAT GAGTGCTGCT 540 
GCAAGGAGGG GTTTTTCCTG AGTGACAATC AGCACACCTG CATTCACCGC TCGGAAGAGG 600 
GCCTGAGCTO CATGAATAAG GATCACGGCT GTAGTCACAT CTGCAAGGAG GCCCCAAGGG 660 
_ GCAGCGTCGC CTGTGAGTGC AGGCCTGGTT TTGAGCTGGC CAAGAACCAG AGAGACTGCA 720 

65 TCTTGACCTG TAACCATGGG AACGGTGGGT GCCAGCACTC CTGTGACGAT ACAGCCGATG 780 
GCCCAGAGTG CAGCTGCCAT CCACAGTACA AGATGCACAC AGATGGGAGG AGCTGCCTTG 840 
AGCGAGAGGA CACTGTCCTG GAGGTGACAG AGAGCAACAC CACATCAGTG GTGGATGGGG 900 
ATAAACGGGT GAAACGGCGG CTGCTCATGG AAACGTGTGC TGTCAACAAT GGAGGCTGTG 960 
ACCGCACCTG TAAGGATACT TCGACAGGTG TCCACTGCAG TTGTCCTGTT GGATTCACTC 1020 

70 TCCAGTTGGA TGGGAAGACA TGTAAAG ATA TTG ATGAGTG CCAGACCCGC AATGGAGGTT 1 080 
GTGATCATTT CTGCAAAAAC ATCGTGGGCA GTTTTGACTG CGGCTGCAAG AAAGGATTTA 1 140 
AATTATTAAC AGATGAGAAG TCTTGCCAAG ATGTGGATG A GTGCTCTTTG GATAGGACCT 1200 
GTGACCACAG CTGCATCAAC CACCCTGGCA CATTTGCTTG TGCTTGCAAC CGAGGGTACA 1260 
CCCTGTATGG CTTCACCCAC TGTGGAG ACA CCA ATGAGTG CAGCATCAAC AACGGAGGCT 1 320 

75 GTCAGCAGGT CTGTGTGAAC ACAGTGGGCA GCTATGAATG CCAGTGCCAC CCTGGGTACA 1 380 
AGCTCCACTG GAATAAAAAA GACTGTGTGG AAGTGAAGGG GCTCCTGCCC ACAAGTGTGT 1440 
CACCCCGTGT GTCCCTGCAC TGCGGTAAGA GTGGTGGAGG AGACGGGTGC TTCCTCAGAT 1500 
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GTCACTCTGG C ATTCACCTC TCTTCAG ATG TCACCACCAT CAGG ACAAGT GTAACCTTTA 1 560 
AGCTAAATGA AGGCAAGTGT AGTTTGAAAA ATGCTGAGCT GTTTCCCGAG GGTCTGCGAC 1620 
CAGCACTACC AGAGAAGCAC AGCTCAGTAA AAGAGAGCTT CCGCTACGTA AACCTTACAT 1680 
GCAGCTCTGG CAAGCAAGTC CCAGGAGCCC CTGGCCGACC AAGCACCCCT AAGGAAATGT 1740 
5 TTATCACTGT TG AGTTTG AG CTTG AAACTA ACCAAAAGG A GGTG ACAGCT TCTTGTG ACC 1 800 
TGAGCTGCAT CGTAAAGCGA ACCGAGAAGC GGCTCCGTAA AGCCATCCGC ACGCTCAGAA 1860 
AGGCCGTCCA CAGGGAGCAG TTTCACCTCC AGCTCTCAGG CATGAACCTC GACGTGGCTA 1920 
AAAAGCCTCC CAGAACATCT GAACGCCAGG CAGAGTCCTG TGGAGTGGGC CAGGGTCATG 1980 
CAGAAAACCA ATGTGTCAGT TGCAGGGCTG GG ACCTATTA TGATGGAGCA CGAGAACGCT 2040 

1 0 GCATTTTATG TCCAAATGGA ACCTTCCAAA ATGAGGAAGG ACAAATGACT TGTGAACCAT 2100 
GCCCAAGACC AGGAAATTCT GGGGCCCTGA AGACCCCAGA AGCTTGGAAT ATGTCTGAAT 2160 
GTGGAGGTCT GTGTCAACCT GGTG AATATT CTGCAGATGG CTTTGCACCT TGCCAGCTCT 2220 
GTGCCCTGGG CACGTTCCAG CCTGAAGCTG GTCGAACTTC CTGCTTCCCC TGTGGAGGAG 2280 
GCCTTGCCACCAAACATCAG GGAGCTACTT(XTTTCAGGA CTGTGAAACC AGAGTTCAAT 2340 

1 5 GTTCACCTGG ACATTTCTAC AACACCACCA CTCACCGATG TATTCGTTGC CCAGTGGGAA 2400 
CATACCAGCC TG AATTTGG A AAAAATAATT GTGTTTCTTG CCCAGGAAAT ACTACGACTG 2460 
ACTTTGATGG CTCCACAAAC ATAACCCAGT GTAAAA AC AG AAGATGTGGA GGGGAGCTGG 2520 
GAG ATTTCAC TGCGTACATT GAATCCCCAA ACTACCCAGG CAATTACCCA GCCAACACCG 2580 
AGTGTACGTG GACCATCAAC CCACCCCCCA AGCGCCGCAT CCTG ATCGTG GTCCCTGAGA 2640 

20 TCTTCCTGCC CATAG AGGAC GACTGTGGGG ACTATCTGGT GATGCGGAAA ACCTCTTCAT 2700 
CCAATTCTGT GACAACATAT GAAACCTGCC AGACCTACGA ACGCCCCATC GCCTTCACCT 2760 
CCAGGTCAAA GAAGCTGTGG ATTCAGTTCA AGTCCAATGA AGGGAACAGC GCTAGAGGGT 2820 
TCCAGGTCCC ATACGTGACA TATGATGAGG ACTACCAGGA ACTCATTGAA GACATAGTTC 2880 
GAGATGGCAG GCTCTATGCA TCTGAGAACC ATCAGGAAAT ACTTAAGGAT AAGAAACTTA 2940 

25 TCAAGGCTCT GTTTGATGTC CTGGCCCATC CCCAGAACTA TTTCAAGTAC ACAGCCCAGG 3000 
AGTCCCG AGA GATGTTTCCA AGATCGTTCA TCCGATTGCT ACGTTCCAAA GTGTCCAGGT 3060 
TTTTGAGACC TTACAAATGA CTCAGCCCAC GTGCCACTCA ATACAAATGT TCTGCTATAG 3120 
GGTTGGTGGG ACAGAGCTGT CTTCCTTCTG CATGTCAGCA CAGTCGGGTA TTGCTGCCTC 3180 
CCGTATCAGT GACTCATTAG AGTTCAATTT TTATAGATAA TACAGATATT TTGGTAAATT 3240 

30 GAACTTGGTT TTTCTTTCCC AGCATCGTGG ATGTAGACTG AGAATGGCTT TGAGTGGCAT 3300 
CAGCTTCTCA CTGCTGTGGG CGG ATGTCTT GGATAGATCA CGGGCTGGCT GAGCTGGACT 3360 
TTGGTCAGCC TAGGTG AGAC TCACCTGTCC TTCTGGGGTC TTACTCCTCC TCAAGGAGTC 3420 
TGTAGTGGAA AGGAGGCCAC AGAATAAGCT GCTTATTCTG AAACTTCAGC TTCCTCTAGC 3480 
CCGGCCCTCT CTAAGGGAGC CCTCTGCACT CGTGTGCAGG CTCTG ACCAG GCAGAACAGG 3540 

3 5 CAAGAGGGGA GGGAAGGAG A CCCCTGCAGG CTCCCTCCAC CCACCTTGAG ACCTGGGAGG i 3600 
ACTCAGTTTC TCCACAGCCT TCTCCAGCCT GTGTGATACA AGTTTGATCC CAGGAACTTG 3660 
AGTTCTAAGC AGTGCTCGTG AA A A A A AAA A GCAGAAAGAA TTAGAAATAA ATAAAAACTA 3720 
AGCACTTCTG GAGACAT 

40 *>q m NO: 64 Protein sequence: 

Protein Accession NP_066025. 1 

1 11 21 31 41 51 

45 MGVAGRNRPG AAWAVLLLLL LLPPLLLLAG AVPPGRGRAA GPQEDVDECA QGLDDCHADA 60 
LCQNTPTSYK CSCKPGYQGE GRQCEDIDEC GNELNGGCVH DCLNIPGNYR CTCFDGFMLA 120 
HDGHNCLDVD ECLENNGGCQ HTCVNVMGSY ECCCKEGFFL SONQHTCIHR SEEGLSCMNK 180 
DHGCSHiaCE APRGSVACEC RPGFELAKNQ RDCILTCKHG NGGCQHSCDD TADGPECSCH 240 
PQYKMHTDGR SCLEREDTVL EVTESNTTSV VDGDKRVKRR LLMETCAVNN GGCDRTCKDT 300 

5 0 STG VHCSCPV GFTLQLDGKT CKD1DECQTR NGGCDHFCKN IVGSFDCGCK KGFKLLTDEK 360 
SCODVDECSL DRTCDHSCIN HPGTFACACN RGYTLYGFTH CGDTNECSIN NGGCQQVCVN 420 
TVGSYECQCH PGYKLHWNKK DCVEVKGLLP TSVSPRVSLH CGKSGGGDGC FLRCHSGIHL 480 
SSDVTT1RTS VTFKLNEGKC SLKNAELFPE GLRPALPEKH SS VKESFRYV NLTCSSGKQV 540 
PGAPGRPSTP KEMFITVEFE LETNQKEVTA SCDLSCIVKR TEKRLRKAIR TLRKAVHREQ 600 

5 5 FHLQLSGMNL DVAKKPPRTS ERQAESCGVG QGHAENQCVS CRAGTYYDG A RERCILCPNG 660 
TFONEEGQMT CEPCPRPGNS G AUCTPEAWN MSECGGLCQP GEYSADGFAP CQLCALGTFQ 720 
PEAGRTSCFP CGGGLATKHQ GATSFQDCET RVQCSPGHFY NTTTHRCIRC PVGTYQPEFG 780 
KNNCVSCPGN TTTDFDGSTN ITQCKNRRCG GELGDFTGYI ESPNYPGNYP ANTECTWTIN 840 
PPPKRRILIV VPEIFLPIED DCGDYLVMRK TSSSNSVTTY ETCQTYERPI AFTSRSKKLW 900 

60 IQFKSNEGNS ARGFQVPYVT YDEDYQELIE DIVRDGRLYA SENHQEILKD KKLIKALFDV 960 
LAHPQNYFKY TAQESREMFP RSFIRLLRSK VSRFLRPYK 

Seq ID NO: 65 DNA sequence 
Nucleic Acid Accession H: NM JW72 1 0 

65 Codiug sequence: 1-1869 (underlined sequences correspond to start and stop codons) 

I 11 21 31 41 51 

ATG AGGCTCC TCCGCAG ACG CCACATGCCC CTGCGCCTGG CCATGGTGGG CTGCGCCTTT 60 
70 GTGCTCTTCC TCTTCCTCCT GCATAGGG AT GTGAGCAGCA GAGAGGAGGC CACAGAGAAG 120 
/U cJGTGGCTGAAGTCCCTGGT GAGCCGGAAG GATCACGTCCTGGACCTCAT GCTGGAGGCC 180 
ATGAACAACC TTAGAGATTC AATGCCCAAG CTCCAAATCA GGGCTCCAG A AGCCCAGCAG l 240 
ACTCTGTTCT CCATAAACCA GTCCTGCCTC CCTGGGTTCT ATACCCCAGC TG AACTGAAG 300 
CCCTTCTGGG AACGGCCACC ACAGGACCCC AATGCCCCTG GGGCAGATGG AAAAGCATTT 360 
75 CAGAAGAGCA AGTGGACCCC CCTGGAGACC CAGGAAAAGG A AGAAGGCTA TAAG AAGC AC 420 
TGTTTCAATG CCTTTGCCAG CGACCGGATC TCCCTGCAG A GOT^^GG GCC^ACACC 480 
CGACCACCTG AGTGTGTGG A CCAGAAGTTC CGGCGCTGCC CCCCACTGGC CACCACCAGC 540 
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GTGATCATTG TGTTCCACAA CGAAGCCTGG TCCACACTGC TGCGAACAGT GTACAGCGTC 600 
CTACACACCA CCCCTGCCAT CTTGCTCAAG GAGATCATAC TGGTGGATGA TGCCAGCACA 660 
GAGGAGCACC TAAAGGAGAA GCTGGAGCAG TACGTGAAGC AGCTGCAGGT GGTGAGGGTG 720 
GTGCGGCAGG AGGAGCGGAA GGGGTTGATC ACCGCCCGGC TGCTGGGGGC CAGCGTGGCA 780 
CAGGCGGAGG TGCTCACGTT CCTGGATGCC CACTGTGAGT GCITCCACGG CTGGCTGGAG 840 
CCCCTCCTGG CTCGAATCGC TGAGGACAAG ACAGTGGTGG TGAGCCCAGA CATCGTCACC 900 
ATCGACCTTA ATACTTTTGA GTTCGCCAAG CCCGTCCAGA GGGGCAGAGT CCATAGCCG A 960 
GGCAACTTTG ACTGGAGCCT GACCTTCGGC TGGGAAACAC TTCCTCCACA TGAGAAGCAG 1020 
AGGCGCAAGG ATGAAACATA CCCCATCAAA TCCCCGACGT TTGCTGGTGG CCTCTTCTCC 1080 
ATCCCCAAGT CCTACTTTGA GCACATCGGT ACCTATGATA ATCAGATGGA GATCTGGGGA 1 140 
GGGGAGAACO TGGAAATGTC CTTCCGGGTG TGGCAGTGTG GGGGCCAGCT GGAGATCATC 1200 
CCCTGCTCTG TCGTAGGCCA TGTGTTCCGG ACCAAGAGCC CCCACACCTT CCCCAAGGGC 1260 
ACTAGTGTCA TTGCTCGCAA TCAAGTGCGC CTGGCAGAGG TCTGGATGGA CAGCTACAAG 1 320 
AAGATTTTCT ATAGGAGAAA TCTGCAGGCA GCAAAGATGG CCCAAGAGAA ATCCTTCGGT 1380 
GACATTTCGG AACGACTGCA GCTGAGGGAA CAACTGCACT GTCACAACTT TTCCTGGTAC 1440 
CTGCACAATG TCTACCCAGA GATGTTTGTT CCTGACCTGA CGCCCACCTT CTATGGTGCC 1500 
ATCAAG AACC TCGGCACCAA CCAATGCCTG GATGTGGGTG AG AACAACCG CGGGGGG AAG 1 560 
CCCCTCATCA TGTACTCCTG CCACGGCCTT GGCGGCAACC AGTACTTTGA GTACACAACT 1620 
CAGAGGGACC TTCGCCACAA CATCGCAAAG CAGCTGTGTC TACATGTCAG CAAGGGTGCT 1680 
CTGGGCCTTG GGAGCTGTCA CTTCACTGGC AAGAATAGCC AGGTCCCCAA GGACGAGGAA 1740 
TGGGAATTGG CCCAGGATCA GCTCATCAGG AACTCAGGAT CTGGTACCTG CCTGACATCC 1800 
CAGGACAAAA AGCCAGCCAT GGCCCCCTGC AATCCCAGTG ACCCCCATCA GTTGTGGCTC 1 860 
TTTGTCTAGG ACCCAGATCA TCCCCAGAGA GAGCCCCCAC AAGCTCCTCA GGAAACAGGA 1920 
TTGCTGATGT CTGGGAACCT GATCACCAGC TTCTCTGGAG GCCGTAAAGA TGGATTTCTA 1980 
AACCCACTGG GTGGCAAGGC AGGACCTTCC TAATCCTTGC AACAACATTG GGCCCATTTT 2040 
CTTTCCTTCA CACCGATGG A AG AG ACCATT AGG ACATATA TTTAGCCTAG CGTTTTCCTG 2 1 00 
TTCTAGAAAT AGAGGCTCCC AAAGTAGGGA AGGCAGCTGG GGGAGGGTTC AGGGCAGCAA 2160 
TGCTGAGTTC AAGAAAAGTA CTTCAGGCTG GGCACAGTGG CTCATGCCTG AAATCCTAGC 2220 
ACTTTGGGAA GACAATGTGG G AG AATGGCT TG AGCCCAGG AGTTCAAGAC CGGCCTGAGC 2280 
AACATAGTGA GGATCCCATC TCTACGCCCA CCCTCCCCCC GGCAAAAAAA AAAGCTGGGT 2340 
ATGGTGGCTT ATGCCTGTAG TCGCAGCTAC TCAGAAGGCT GAGGTGGGAG GATTGCTTGT 2400 
TCCCCGGAGG TTG AAGCTAC AGTGAGCCTT GATTGTGTCA CTGCACTCCA GCCTGGGCAA 2460 
CAGGTAAGAC TCTGTCTCAA AAAAAAAACA AAAAAGAAGA AGAAAAGTAC TTCTACAGCC 2520 
ATGTCCTATT CCTTGATCAT CCAAAGCACC TGCAGAGTCC AGTGAAATGA TATATTCTGG 2580 
CTGGGCACAG TGGCTCACAC CTGTAATCCT AGCACTTTGG GAGGCCAAGG CAGGTGGATC 2640 
ACCTGAGGTC AGAAGTTTGA AACCAGCCTG GACTACATGG TGAAACTCCA TCTCTACTAA 2700 
AAGTACAAAA ATTAGCTGGG CATGATGGCA CGCACCTGCA GTCCCAGCTA CTTGGGAGGC 2760 
TGAGGCAGGA GAATCACTCG AACCCAGGAG GCAGAGGTTG CAGTGAGCCA AGACAGCACC 2820 
ATTGCACCCC AGCCTGAGCA ACAAGAGCGA AACTCCATCT CAGGAAAAAA AAAAAAAAAA 2880 
A 

Seq ID NO: 66 Protein sequence: 
Prote in Accession #: NP_009 1 4 1 

1 11 21 31 41 SI 
11)111 

MRLLRRRHMP LRLAMVGCAF VLFLFLLHRD VSSREEATEK PWLKSLVSRK DHVLDLMLEA 60 
MNNLRDSMPK LQ1RAPEAQQ TLFS1NQSCL PGFYTPAELK PFWERPPQDP NAPGADGKAP 120 
QKSKWTPLET QEKEEGYKKH CFNAFASDRI SLQRSLGPDT RPPECVDQKF RRCPPLATTS 1 80 
VUVFHNEAW STLLRTVYSV LHTTPAILLK EDLVDDAST EEHLKEKLEQ YVKQLQWRV 240 
VRQEERKGLI TARLLGASVA QAEVLTFLDA HCECf HGWLE PLLARIAEDK TV WSPDIVT 300 
IDLNTFEFAK PVQRGRVHSR GNFDWSLTFG WETLPPHEKQ RRKDETYPDC SPTFAGGLFS 360 
IPKSYFEHIG TYDNQMEIWG GENVEMSFRV WQCGGQLEI1 PCSWGHVFR TKSPHTFPKG 420 
TSVIARNQVR LAEVWMDSYK KJFYRRNLQA AKMAQEKSFG DISERLQLRE QLHCHNFSWY 480 
LHNVYPEMFV PDLTPTFYGA IKNLGTNQCL DVGENNRGGK PLIMYSCHGL GGNQYFEYTT 540 
QRDLRHNIAK QLCLHVSKGA LGLGSCHFTG KNSQVPKDEE WELAQDQUR NSGSGTCLTS 600 
QDKKPAMAPC NPSDPHQLWL FV 

Scq ID NO: 67 DNA sequence 

Nucleic Acid Accession #: NM01 41 12 

Coding sequence: 600-4484 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I 1 I I I I 

TTCCTCCGCG AAGGCTCCTT TGATATTAAT AGTGTTGGTG TCTTGAAACT GACGTAATGC 60 
GCGGAGACTG AGGTCCTGAC AAGCGATAAC ATTTCTGATA AAG ACCCGAT CTTACTGCAA 1 20 
TCTCTAGCGT CCTCTTTTTT GGTGCTGCTG GTTTCTCCAG ACCTCGCGTC CTCTCGATTG 1 80 
CTCTCTCGCC TTCCTATTTC Til 1111111 TTTTAAACAA AAAACAACAC CCCCTCCCCT 240 
CTCCCACCCG GCACCGGGCA CATCCTTGCT CTATTTCCTT TCTCTTTCTC TCTCT CTCTC 300 
TCTCTTTTTT AATAAGGGTG GGGGAGGG AA AGGGGGGGG A GGCAGGAAAG ACCTTTTTCT 360 
CTCCCCCCCG CAATAATCCA AGATCAACTC TGCAAACAAC AGAAG ACGGT TCATGGCTTT 420 
OGCCGCCGCG CCACCATCTT TCGGGCTGCC GAGGGTGTTC TTG ACGATTA ATCAACAGAT 480 
GTACAGATCA GCTCTCAA AA TGTCTTCTGT GTCTTCTG AG CGTCTTCTAA GACAATTGCA 540 
TTAGCCTCCT GCTAGTTGAC TAATAGAATT AATAATTGTA AAAAGCACTC TAAAGCCACA 600 
TGCCTTATGA AGTCAATGCT GGGTATGATT TTACAAATAT GGTCCGGAAA AAGAACCCCC 660 
CTCTGAGAAA CGTTGCAAGT GAAGGCGAGG GCCAGATCCT GGAGCCTATA GGTACAGAAA 720 
GCA AGGTATC TGGAAAGAAC AAAGAATTCT CTGCAGATCA GATGTCAGAA AATACGGATC 780 
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AGAGTGATGC TGCAGAACTA AATCATAAGG AGGAACATAG CTTGCATGTT CAAGATCCAT 840 
CTTCTAGCAG TAAGAAGGAC TTGAAAAGCG CAGTTCTGAG TGAGAAGGCT GGCTTCAATT 900 
ATGAAAGCCC CAGTAAGGGA GGAAACTTTC CCTCCTTTCC GCATGATGAG GTGACAGACA 960 
GAAATATGTT GGCTTTCTCA TTTCCAGCTG CTGGGGGAGT CTGTGAGCCC TTGAAGTCTC 1020 
5 CGCAAAGAGC AGAGGCAGAT GACCCTCAAG ATATGGCCTG CACCCCCTCA GGGGACTCAC 1080 
TGGAGACAAA GGAAGATCAG AAGATGTCAC CAAAGGCTAC AGAGGAAACA GGGCAAGCAC 1 140 
AGAGTGGTCA AGCCAATTGT CAAGGTTTGA GCCCAGTTTC AGTGGCCTCA AAAAACCCAC 1 200 
AAGTGCCTTC AGATGGGGGT GTAAGACTGA ATAAATCCAA AACTGACTTA CTGGTGAATG 1260 
ACAACCCAGA CCCGGCACCT CTGTCTCCAG AGCTTCAGGA CTTTAAATGC AATATCTGTG 1320 

1 0 GATATGGTTA CTACGGCAAC GACCCCACAG ATCTGATTAA GCACTTCCGA AAGTATCACT 1380 
TAGGACTGCA TAACCGCACC AGGCAAGATG CTGAGCTGGA CAGCAAAATC TTGGCCCTTC 1440 
ATAACATGGT GCAGTTCAGC CATTCCAAAG ACTTCCAGAA GGTCAACCGT TCTGTGTTTT 1 500 
CTGGTGTGCT GCAGGACATC AATTCTTCAA GGCCTGTTTT ACTAAATGGG ACCTATG ATG 1 560 
TGCAGGTGAC TTCAGGTGGA ACATTCATTG GCATTGGACG GAAAACACCA GATTGCCAAG 1620 

15 GGAACACCAA GTATTTCCGC TGTAAATTCT GCAATTTCAC TTATATGGGC AACTCATCCA 1630 
CCGAATT AGA ACAACATTTT CTTCAG ACTC ACCCAAACAA AATAAAAGCT TCTCTCCCCT 1740 
CCTCTGAGGT TGCAAAACCT TCAGAGAAAA ACTCTAACAA GTCCATCCCT GCACTTCAAT 1800 
CCAGTGATTC TGGAGACTTG GGAAAATGGC AGGACAAGAT AACAGTCAAA GCAGGAGATG 1860 
ACACTCCTGT TGGGTACTCA GTGCCCATAA AGCCCCTCGA TTCCTCTAGA CAAAATGGTA 1920 

20 CAGAGGCCAC CAGTTACTAC TGGTGTAAAT TTTGTAGTTT CAGCTGTG AG TCATCTAGCT 1980 
CACTTAAACT GCTAGAACAT TATGGCAAGC AGCACGGAGC AGTGCAGTCA GGCGGCCTTA 2040 
ATCCAGAGTT AAATG ATAAG CTTTCCAGGG GCTCTGTCAT TAATCAGAAT GATCTAGCCA 2100 
AAAGTTCAGA AGGAGAGACA ATGACCAAGA CAGACAAGAG CTCGAGTGGG GCTAAAAAGA 2160 
AGGACTTCTC CAGCAAGGGA GCCGAGGATA ATATGGTAAC GAGCTATAAT TGTCAGTTCT 2220 

2 5 GTGACTTCCG ATATTCC AAA AGCCATGGCC CTGATGTAAT TGTAGTGGGG CCACTTCTCC 2280 
GTCATTATCA ACAGCTCCAT AACATTCACA AGTGTACCAT TAAACACTGT CCATTCTGTC 2340 
CCAGAGG ACT TTGCAGCCCA GAAAAGCACC TTGGAGAAAT TACTTATCCG TTTGCTTGTA 2400 
GAAAAAGTAA TTGTTCCCAC TGTGCACTCT TGCTTCTGCA CTTGTCTCCT GGGGCGGCTG 2460 
GAAGCTCGCG AGTCAAACAT CAGTGCCATC AGTGTTCATT CACCACCCCT GACGTAG ATG 2520 

30 TACTCCTCTT TCACTATGAA AGTGTGCATG AGTCCCAAGC ATCGGATGTC AAACAAGAAG 2580 
CAAATCACCT GCAAGGATCG GATGGGCAGC AGTCTGTCAA GGAAAGCAAA GAACACTCAT 2640 
GTACC AAATG TGATTTTATT ACCCAAGTGG AAGAAGAGAT TTCCCGACAC TACAGGAGAG 2700 
CACACAGCTG CTACAAATGC CGTCAGTGCA GTTTTACAGC TGCCGATACT CAGTCACTAC 2760 
TGGAGCACTT CAACACTGTT CACTGCCAGG A ACAGGACAT CACTACAGCC AACGGCGAAG 2820 

3 5 AGGACGGTCA TGCCATATCC ACCATCAAAG AGGAGCCCAA AATTGACTTC AGGGTCTACA 2880 
ATCTGCTAAC TCCAG ACTCT AAAATGGGAG AGCCAGTTTC TGAGAGTGTG GTGAAGAGAG 2940 
AGAAGCTGGA AG AG AAGGAC GGGCTCAAAG AG AAAGTTTG GACCGAG AGT TCCAGTGATG 3000 
ACCTTCGCAA TGTGACTTGG AGA.GGGGCAG ACATCCTGCG GGGGAGTCCG TCATACACCC 3060 
AAGCAAGCCT GGGGCTGCTG ACGCCTGTGT CTGGCACCCA AGAGCAGACA AAGACTCTAA 3120 

40 GGGATAGTCC CAATGTGGAG GCCGCCCATC TGGCGCGACC TATTTATGGC TTGGCTGTGG 3180 
AAACCAAGGG ATTCCTGCAG GGGGCGCCAG CTGGCGGAGA GAAGTCTGGG GCCCTCCCCC 3240 
AGCAGTATCC TGCATCGGGA GAAAACAAGT CCAAGG ATGA ATCCCAGTCC CTGTTACGGA 3300 
GGCGTAGAGG CTCCGGTGTT TTTTGTGCCA ATTGCCTGAC CACAAAGACC TCTCTCTGGC 3360 
GAAAGAATGC AAATGGCGGA TATGTATGCA ACGCGTGTGG CCTCTACCAG AAGCTTCACT 3420 

45 CGACTCCCAG GCCTTTAAAC ATCATTAAAC AAAACAACGG TGAGCAGATT ATTAGGAGGA 3480 
G AACAAGAAA GCGCCTTAAC CCAGAGGCAC TTCAGGCTGA GCAGCTCAAC AAACAGCAGA 3540 
GGGGCAGCAA TGAGGAGCAA GTCAATGGAA GCCCGTTAGA GAGGAGGTCA GAAGATCATC 3600 
TAACTGAAAG TCACCAG AGA GAAATTCCAC TCCCCAGCCT AAGTAAATAC GAAGCCCAGG 3660 
GTTCATTGAC TAAAAGCCAT TCTGCTCAGC AGCCAGTCCT GGTCAGCCAA ACTCTGGATA 3720 

5 0 TTCACA AAAG GATGCAACCT TTGCACATTC AG ATAAAAAG TCCTCAGGAA AGTACTGG AG 3780 
ATCCAGGAAA TAGTTCATCC GTATCTG AAG GGAAAGGAAG TTCTGAG AGA GGCAGTCCTA 3840 
TAGAAAAGTA CATG AG ACCT GCGAAACACC CAAATTATTC ACCACCAGGC AGCCCTATTG 3900 
AAAAGTACCA GTACCCACTT TTTGG ACTTC CCTTTGTACA TAATG ACTTC CAGAGTGAAG 3960 
CTGATTGGCT GCGGTTCTGG AGTAAATATA AGCTCTCCGT TCCTGGGAAT CCGCACTACT 4020 

5 5 TGAGTCACGT GCCTGGCCTA CC AAATCCTT GCCAAAACTA TGTGCCTTAT CCCACCTTCA 4080 
ATCTGCCTCC TCATTTTTCA GCTGTTGGAT CAGACAATG A CATTCCTCTA GATTTGGCGA 4140 
TCAAGCATTC CAGACCTGGG CCAACTGCAA ACGGTGCCTC CAAGGAGAAA ACGAAGGCAC 4200 
CACCAAATGT AAAAAATGAA GGTCCCTTGA ATGTAGTAAA AACAGAG AAA GTTGATAGAA 4260 
GTACTCAAG A TGAACTTTCA ACAAAATGTG TGCACTGTGG CATTGTCTTT CTGG ATGAAG 4320 

60 TGATGTATGC TTTGCATATG AGTTGCCATG GTGACAGTGG ACCTTTCCAG TGCAGCATAT 4380 
GCCAGCATCT TTGCACGGAC AAATATGACT TCACAACACA TATCCAG AGG GGCCTGCATA 4440 
GGAACAATGC ACAAGTGGAA AAAAATGGAA AACCTAAAGA GTAAAACCTT AGCACTTAGC 4500 
ACAATTAAAT AGAAATAGGT TTTCTTG ATG GG AATTCAAT AGCTTGTA AT GTCTTATGAA 4560 
GACCTATTAA AAAAATACTT CATAG AGCCT GCCTTATCCA ACATGAAATT CCCTTCTTTT 4620 

65 GTTATTCTTT CTTTTGATGA GTAGGTTACC AAG ATTAAAA AGTGAGATAA ATGGTCAATG 4680 
AG AAAG AATG G A AGATGGTA AACAATCACT TTTTAAAACC TGTTAAGTCA AAACCATCTT 4740 
GGCTAATATG TACTGGGGAA ATAATCCATA AGAGATATCA CCAGACTAGA ATTAATATAT 4800 
TTATAAAGAA AGAGACCAAA ACTGTCTAGA ATTTGAAAGG GTTTACATAT TATTATACTA 4860 
AAGCAGTACT GG ACTGGCCA TTGG ACCATT TGTTCCAAAA CCC ATAAATT GTTGCCTAAA 4920 

70 TTTATAATGA TCATG AAACC CTAGGCAGAG GAGG AG AAAT TGAAGGTCCA GGGCAATGAA 4980 
AG AAAAATGG CGCCCTCTCA ATTTAGTCTT CTCTCATTGG CCATGTTTCA GATTTTG ACC 5040 
TAGAAATGCG AGCTGTGGTT AGGCTTGGTT AGAGTGCAGC AAGCAACATG ACAGATGGTG 5100 ' 
GCACGCTGTT TTTACCCAGC CCTGCCTGTA CATACACATG CACACCCTCT CTG ATATTTT 5 1 60 
TGTCCTTTAG ATGTTCAAAT ACTCAGTAGT CCTTTTGTTT GCGGTTTAG A TTCATTTTGT 5220 
75 CCACACATGT ACCCATTTTA AAAAACAATG TCCTCGATGC TTCTGTAGTG ATTTCATTTT 5280 
AGCCAGGTAT TTCTTTCTTG TGTGTGATGA ACCAGTATGG ATTTGCTTTT CTAAGCCTCC 5340 
TGTTGGTTAC TAATCTCACT TGGCACATTA TAACTAAAGG AATCCCCTCA ATTCAAAAGC 5400 
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ATAGATGGAT ACAAATGTCA GACCGTGGGT TTAATTTGTT TAGAACACAT GGCAUl'CTT 5460 
CACAAGGTAA CCTGCTGTAT TTATTTATTT TCTTTTGGTT AAATATAATT TCCAAACTTT 5520 
GTGGTCAGGC AGCGTCTAAG GTTACGTTAC CACAGACTGA CAGTTGGTAT ATGTACCAGC 5580 
CAATCCCTTC ATTAAATGTA TACAGATTTA GTTAAGTAGC ATTAAATAGG ATTCTTAGAA 5640 
GTATGTCCTC ATAGAACTTT TAATACTTAA GGCTTTGTAA AAACTATCCA TGAAGGGAAA 5700 
GCTCCTCAGC ATAACTGCTC AGGGAAATAG GGCTAAATAA CTGAACATTA AATAATTGGT 5760 
TAAAGGTGCT GTTAGTCGAG CCTCAATGCT TGCTACAAGG ATGTATGTAC AAGGACTGAC 5820 
TTTAATAATT TGCATTATAT TGTCCCAACC AGTAGTTTAT TTTTTGCCAC GGAGATGTAG 5880 
AAGATATTAC AAGCTACTGG ATGCACTGTC AGATTAACTT ATTTCATTAA AGAAGTTGGG 5940 
AGAACAAATA GGAAAAAAAA AACTTATTTT TCTAGTAAAT ATTAATGTAT TACATTTCAA 6000 
ATAATGGTGC CTGACATATT GAATAATTAT TTTCTACAGT GTACGTATGC AACAAAGATA 6060 
TTCCATCATG CATTAGAGTC AGTTCTGGCT CTGCCTAGCT GTTTACATTT GCAAATGTAG 6120 
CAAACAAGGT AATGAAGCAA CTATTTCTAT TGCAGTAGAT ATCCTTTTGT GTGTGTGTGT 6180 
GTGCATTAAA GTTGTAAACG GTAACATGAA ACAAATGAAA GTTCTTGCTA TAATGGTATG 6240 
GAAAACAAGA AGGAAATGAA AATATTTTTA TGCCTACTTA GGAAAAAAAG GGTAGCACTT 6300 
ATTCATTCCA AGTACTTTTT TTTTTTTAAT TTTTAAGCTC TTAACTCACA TTGTTATGCT 6360 
TAAGATGATA AACATATATC CTCTTTTTAT TGCTTTGTCT ATGTTTCATA TGAAACATTT 6420 
CAGAAATTAT TTTGATAAGT GTTGCTGGAA TCTGCAACGC TGATTTTTTT TTGCATTCTG 6480 
TAGTCGCATT TGCACTCCAT TTTTACATTA ATTCGCAGTT GCTTTGTATC ATTGTTTTGT 6540 
TTGGGTTTTG TTTCTTTTTC ACAGTGCCOG GTCTTCGTTT CTTAAAGTTG GATGGCAGGT 6600 
AGAGTTCAAC CAGTTCGTGA CTGTTGTAGC G AATGAAGTT AAA AAAATGT CTTTCTGATG 6660 
TTGTGTTGTC ATTTTCATTT TTGCATTTTT TTGTTTGCAT ATTAAAAAAA G AGAAAAGAG 6720 
AAAGCAAG AG ACAGAAATCA GGACTAAGTC CTCTGCTTCA GTTTCATTGT TAACGGGCCT 6780 
TATTCTGATC TCACCTGTCG CGTAGCTCTA ATATTCACAT AAACTGAAAT AAA GAAG TGG 6840 
AATGAGGAGC TTTG ACATTC AAATTATGTG ATGTAATTTA TCTTCCTTAG GAATTTTGAT 6900 
GGATGCATCT CAAAATGTAT AGCCAGACTT GAGAGGTGAC AATTAAAGAT CTAAAAAAGA 6960 
GAGGAGATTC CCCCAAACAA CAATATTTAA TTTTCTTAGT AAAAAGAATA ACAGAATGCA 7020 
TCGTGGCAAT CCTTAAGCAA CATTATCTAT GTGGACTGCT TAAATCAGCA AAACACCAGA 7080 
AGTTTGGTTA ACTTGGGCAA TATGACAAGT ATTACTTTTT GGGCAAAACT ACTC ATTAA G 7140 
CAATTTCTCT AGTGTGTCGG ACACAAATAG GTTCTTTATT TTTGGCATGT ATGCCTTTTT 7200 
ATTTTCATTC AATTTTTTTT TTTTCTCAGA CAGACATAGT AGTATCAACT AGCATTGGAA 7260 
AATACATATC ACTATTCTTG GAATATTTAT GGTCAGTCTA CTTTTTAGTA AAATATTTTT 7320 
GGATAGCGTT GACACGATAG ATCTTATTCC ATACTTCTTT ATTATTGATA ATTTTA TTTT 7380 
CATTTTTTGC TTTCATTATT ATACATATTT TGGTGGAGAA GAGGTTGGGC TTTTTTGAAA 7440 
GAGACAAAAA TTTATTATAA CACTAAACAC TCCTTTTTTG ACATATTAAA GCC TTTAT TC 7500 
CATCTCTCAA GATATATTAT AAAATTTATT TTTTTAATTT AAGATTTCTG AATTATTTTA 7560 
TCTTAAATTG TGATTTTAAA CGAGCTATTA TGGTACGGAA CTTTTTTTAA TGAGGAATTT 7620 
CATGATGATT TAGGAATTTT CTCTCTTGGA AAAGGCTTCC CCTGTG ATGA AAATGATGTG 7680 
CCAGCTAAAA TTGTGTGCCA TTTAAAAACT GAAAATATTT TAAAATTATT TGTCTATATT 7740 
CTAAATTGAG CTTTGGATCA AACTTTAGGC CAGGACCAGC TCATGCGTTC TCATTCTTCC 7800 
TTTTCTCACT CTTTCTCTCA TCACTCACCT CTGTATTCAT TCTGTTGTTT GGGATAGAAA 7860 
AATCATAAAG AGCCAACCCA TCTCAGAACG TTGTGGATTG AGAGAGACAC TACATGACTC 7920 
CAAGTATATG AGAAA AGGAC AG AGCTCTAA TTGATAACTC TGTAGTTCAA AAGGAAAAGA 7980 
GTATGCCCAA TTCTCTCTAC ATGACATATT G AG ATTTTTT TTAATCAACT TTTAAGATAG 8040 
TGATGTTCTG TTCTAAACTG TTCTGTTTTA GTGAAGGTAG ATTTTTATAA AACAAGCATG 8100 
GGGATTCTTT TCTAAGGTAA TATTAATGAG AAGGGAAAAA AGTATCTTTA ACAGCTCTTT 8160 
GTTGAAGCCT GTGGTAGCAC ATTATGTTTA TAATTGCACA TGTGCACATA ATCTATTATG 8220 
ATCCAATGCA AATACAGCTC CAAAAATATT AAATGTATAT ATATTTTAAA ATGCCTG AGG 8280 
AAATACATTT TTCTTAATAA ACTGAAGAGT CTCAGTATGG CTATTAAAAT AATTATTAGC 8340 
CTCCTGTTGT GTGGCTGCAA AACATCACAA AGTGACCGGT CTTGAGACCT GTGAACTGCT 8400 
GCCCTGTTTA GTAAATAAAA TTAATGCATT TCTAGAGGGG GAATATCTGC CATCCAGTGG 8460 
TGGAAATGTG G AGTAAAG AA GCTGGTGGTC TGCTTCTGTG CTGTATGCCA GCCTTTTGCC 8520 
TTAAGTTGAG AGGAGGTCAA CTTTAGCTAC TGTCTTTGGT TTGAGAGCCA TGGCAAAAAA 8580 
AAAAAAAGAA AAAAAGATCA AGTCGTCTTT GGTGAGCCAG TAAGGTGAAA GCTTGCTOAC 8640 
TGTCCAAGGC ACAAGAGAAA ATTGAGGAAT TGAAATGCAA CCTGAGTATC AAACTAAATA 8700 
TTCTAATCAA AGGTAGGTAC TGTTAGGTGG AATTCTATCA GCAG GCAAC T GCAAATGAGA 8760 
AGAAGATAGA AGGACGCCCG TCGGGACTTT GG AGGGCATT GTTATTTTCC CAAAGAAAGA 8820 
CGGCCAAGGG CAGAGGCATG GATTCTTTGC AGAGCACTTC CTTTTGGTTT TTCAGTACTG 8880 
TTTCATAGAC AGTGGGCTCA CATGTTCCTG ATAGTGCTGC AGTTGCTTAG AAAGCATCCC 8940 
AGTTAATTGC AGTAATTAGA ACTTCTGGAA TATGCTAGGG CAGAAGTATG TCAAGTATGT 9000 
CACATGAAGA AAATGTGAAA 7TCAAG AGTA ATCCACACGT GAG AAACTAG ACAATGTACA 9060 
TTCATGTGTT CTCTTGAAAG GAAAGGGAGA GCTGTAAGCT TCACTCTGTC CTACACCGGA 9120 
GAAAAGCAGG AATAACTTTA CCGTGGAAAT AATGTTTAGC TTTTATCAG A GAAAATTG TC 9180 
CTTCTAG AGC ATAGAGTCCC AAAACTCAAT TCTGGTTTTC CCCTGTTTTT 1111111111 9240 
TTTTTCCCAA CATATGAACT GCAGCATATC ACTTTTTCTT TTTGTGCCTC AGGTTCCTCA 9300 
CCTGTAAAAT TG AAAAATAT ATGTATTAAT AATATTATTA ATAATAATAA TGGTAATGTA 9360 
GTACTTGTTT GT A A AGC ACT TTGAGATCCT TGGTTGAAAG GCACCATAGG AGTGCCAAGT 9420 
ATTATTATGT GGCCAAGGGG GTTATTTAAA CTGTCAGTTC CCAAAGGCCA GGAAAGGTTG 9480 
GGGTCATTTT TCTTAAAGAC GAGCTGTAAA TATCAACTAG GCAGCCAATA GTGTTGACTA 9540 
TGAAGATGCA AAACTATTAC TAGGCTGATA AAATCATAGT TTCTTAATGG CTACCAATAA 9600 
GGCAAATATC ACAATAATAA ACGCCAAATT CCTTAGGGCG GACTATTTGA CAACCACATG 9660 
GAAAACTTTG GGGGAGGCAT GAGGGGGGAA CATCTCAAAA TGCCAATGTA AAATTTAACT 9720 
TACAGCAATA TTCACCAGCA GAAAATGTCT TTCATATGGA ATGATTTCAT GTTGCTAAGA 9780 
AAAAGAATTC AATTTGTAGT CCTGATTTGA ATACTAGAAT GTTGGCTATA ATAGTTCTGT 9840 
TCTTACAACA CATG AAATTT TTTCGTTTTA TTTTATTTTG TTTTCATAGT GCATGTTCAT 9900 
TTCTACTCAC AAACATGTTC TTGGTGTATT TCTTATGCAA ACAATCTTCA GGCAGCAAAG 9960 
ATGTCTGTTA CATCTAAACT TGAATAATAA AGTTTTACCA CCAGTTACAC A 
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Seq ID NO: 68 Protein sequence: 
Protein Accession #: NP_05483 1 

1 11 21 31 41 51 

MPYEVNAGYD FTNM VRKKNP PLRNVASEGE GQILEPIGTE SKVSGKNKEF SADQMSENTD 60 
QSDAAELNHK EEHSLHVQDP SSSSKKDLKS AVLSEKAGFN YESPSKGGNF PSFPHDEVTD 120 
RNMLAFSFPA AGGVCEPLKS PQRAEADDPQ DMACTPSGDS LETKEDQKMS PKATEETGQA 180 
QSGQANCQGL SP VSVASKNP QVPSDGG VRL NKSKTDLLVN DNPDPAPLSP ELQDFKCNIC 240 
GYGYYGNDPT DLDCHFRKYH LGLHNRTRQD AELDSKILAL HNMVQFSHSK DFQKVNRSVF 300 
SGVLQDINSS RPVLLNGTYD VQVTSGGTFI GIGRKTPDCQ GNTKYFRCKF CNFTYMGNSS 360 
TELEQHFLQT HPNK1KASLP SSEVAKPSEK NSNKSIPALQ SSDSGDLGKW QDKITVKAGD 420 
DTPVGYSVPl KPLDSSRQNG TEATSYYWCK FCSFSCESSS SLKLLEHYGK QHGAVQSGGL 480 
NPELNDKLSR GSVINQNDLA KSSEGETMTK TDKSSSGAKK KDFSSKGAED NMVTSYNCQF 540 
CDFRYSKSHG PDVIVVGPLL RHYQQLHNIH KCTIKHCPFC PRGLCSPEKH LGEITYPFAC 600 
RKSNCSHCAL LLLHLSPGAA GSSRVKHQCH QCSFTTPDVD VLLFHYESVH ESQASDVKQE 660 
ANHLQGSDGQ QSVKESKEHS CTKCDFITQV EEEISRHYRR AHSCYKCRQC SFTAADTQSL 720 
LEHFNTVHCQ EQDrTTANGE EDGHAISTIK EEPKIDFRVY NLLTPDSKMG EPVSES WKR 780 
EKLEEKDGLK EKVWTESSSD DLRNVTWRGA DILRGSPSYT QASLGLLTPV SGTQEQTKTL 840 
RDSPNVEAAH LARPIYGLAV ETKGFLQG AP AGGEKSGALP QQYPASGENK SKDESQSLLR 900 
RRRGSGVFCA NCLTTKTSLW RKNANGG YVC NACGLYQKLH STPRPLNUK QNNGEQIIRR 960 
RTRKRLNPEA LQAEQLNKQQ RGSNEEQVNG SPLERRSEDH LTESHQREIP LPSLSKYEAQ 1020 
GSLTKSHSAQ QPVLVSQTLD [HKRMQPLHI QKSPQESTG DPGNSSSVSE GKGSSERGSP 1080 
IEKYMRPAKH PNYSPPGSPI EK.YQYPLFGL PFVHNDFQSE ADWLRFWSKY KLSVPGNPHY 1 140 
LSHVPGLPNP CQNYVPYPTF NLPPHFSAVG SDNDIPLDLA HCHSRPGPTA NGASKEKTKA 1200 
PPNVKNEGPL NWKTEKVDR STQDELSTKC VHCGIVFLDE VMYALHMSCH GDSGPFQCSI 1260 
CQHLCTDKYD FTTHIQRGLH RNNAQVEKNG KPKE 

Seq ID NO: 69 DNA sequence 

Nucleic Acid Accession Ui XM 073879 

Coding sequence: l-387{underlined sequences correspond to start and stop codoos) 
I 11 21 31 41 51 

ATGGGGTTTG GAGACCAGGG AACGGTGGAA GGGAGCCTAG GAACGTCGAA AAAACCACCT 60 
GAAGTGAAAA TGTTTGGAGC CAGTCAAGGT TTGCTGACAA TGGAAACAAA CCAGTCCCTG 120 
GCACA AGGCA C AGGCTGCTC AGTAGTAAAG GTAG ACACTG TTCTCTTCG A GAGTTT ATAC 1 80 
CACTGCGGCT TTGAACACGG GAGCGTGATG CACTGCCTTG GGGATGATCA CCCCCAGGAA 240 
GACAGGAAGG CTCACTTCTC TGCCCCAGTT GCAGCCATCG CCTCTCCAGC ACCGACTCCT 300 
GTCTGTCCTG CACACCACTC AACACAG AGC ATCTGCCAGT TTCTACAGCA CTGCAGGCAG 360 
AACACTCACT TGCAGGCTGC TAACTAA 

Scq ID NO: 70 Protein sequence: 
Protein Accession #: XP J>73 879 

1 11 21 31 41 51 

MGFGDQGTVE GSLGTSKKPP EVKMFG ASQG LLTMETNQSL AQGTGCSWK VDTVLFESLY 60 
HCGFEHGSVM HCUjDDHPQE DRKAHFSAPV AAIASPAPTP VCPAHHSTQS ICQFLQHCRQ 120 
NTHLQAAN 

Seq ID NO: 71 DNA sequence 

Nucleic Acid Accession »: AB033064 

Coding sequence: 826-1986 (underlined sequences correspond to start and stop codons) 
I II 21 31 41 51 

GGGGAACAGC AAATTCAGTC ACAGACAATC CTCCACTCGG TCAAGAGCCA CTTTTCTCTT 60 
CCTGCCTTGC CCCCCCGCAG GGGGTAAGGA ACTGAGCGTT TAATCTTTAG CCGGTTGGCT 120 
ACCAGCTAAA ATTCTACTTA TCTTAGTTTC TAGTGGATAG CTTTCTTATT TTGCCCATGT 180 
TTTCTTAGAA TCCCTGTTTA ATATACTTTT GTCAGTAGTA GTATCTAGGA GTAGCAGGGA 240 
G AGTGACAAT AAATTAGCCC CTTCTTTTTT CCCTTGTCAT TCAGGCCCCT TTTCCTCTCC 300 
AGAGGG AAAT TACCAGTAAA CTCTTCTAAA TCTTCCACCC CTTCTCAGTC ATACTGTGAA 360 
GAAACACACT AAAGTGGACA TTATTTGACC AGTGAACACG AACCCAGCTT CAGGCATTGG 420 
TTTGTTGTGG CACATGG AG A AAGATCTCTT TTAAAATATC TCCCAATTAC CCTTTTCACA 480 
ATTTGTATCC ACCTAGGATT TGCTGCTGGG GTAAGTCACT AGATTTATTT CTCAAAGCTC 540 
CCCTCTCTAT GAGCTGAAAG ACTGACCAAC CATGAACACT AGTAG GGGAT GGGGAAAGGG 600 
GACAGAGCAG AGCCAGTTGT TCCACACTTT GGGAAGCAGG AGTAGCTTTT ATCATCTTCC 660 
TCTGGGGAGC AGGCATAGAG ACATAAACTG AGTGAAAATG GGTGGAGGAA GAACTTCTAT 720 
ACCCACGAAC AACATGTG AA GAGAG AGAAC CAAACATAAA GTAAGGAGGG TAGACGTTAC 780 
ATCCAAGAGG AAATAATCCA GGCAAGGAAG CACAAGCTGA TCAAG ATGTG TAGTTCTGTG 840 
GCTGCCAAGT TGTGGTTTTT GACAGATCGT CGCATCAGGG AAGACTATCC TCAAAAAGAG 900 
ATTTTACGAG CATTGAAGGC CAAATGTTGT GAGGAGGAAC TGGACTTTAG GGCTGTGGTG 960 
ATGGATGAGG TGGTGCTGAC AATCGAGCAA GGAAACCTGG GTCTGCGG AT CAATGG AGAG 1020 
CTAATCACTG CCTACCCACA AGTGGTGGTA GTCAGAGTAC CAACCCCTTG GGTGCAAAGT 1080 
GATAGTGACA TCACTGTTTT GCGCCATCTA GAGAAGATGG GATGTCGGTT AATGAACCGA 1 140 
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CCTCAAGCCA TCCTGAACTG CGTTAATAAG TTCTGGACAT TTCAAGAGTT GGCTGGCCAT 1200 
GGTGTTCCTC TGCCGGATAC TTTCTCTTAT GGTGGCCACG AAAATTTTGC TAAAATGATT 1260 
GATGAGGCTG AAGTTCTGGA GTTCCCAATG GTAGTAAAGA ATACGCGGGG TCACAGAGGT 1320 
AAAGCTGTTT TCTTGGCTCG AGATAAGCAC CATTTGGCTG ATCTAAGCCA TCTTATTCGC 1380 
CATGAAGCGC CATACCTGTT CCAGAAGTAT GTTAAAGAGT CTCATGGACG GGATGTACGT 1440 
GTCATTGTCG TGGGAGGCCG TGTGGTTGGC ACCATGTTAC GTTGTTCAAC AGATGGGAGA 1500 
ATGCAAAGCA ACTGCTCATT AGGTGGTGTG GGGATGATGT GCTCATTGAG TGAACAAGGG 1560 
AAGCAGCTAG CTATCCAGGT GTCTAATATC CTGGGGATGG ATGTGTGTGG CATTGACCTG 1620 
CTGATGAAAG ATG ACGGCTC .CTTCTGCGTC TGTGAGGCCA ATGCAAATGT AGGTTTCATC 1680 
GCCTTTGATA AGGCTTGTAA TCTAGATGTA GCTGGTATCA TAGCAGACTA TGCCGCCTCC 1740 
CTTCTACCCT CTGGCCGGCT CACCCGGCGT ATGTCCCTGC TCTCCGTGGT GTOCACTGCC 1800 
AGTGAGACTA GTGAGCCGGA GCTGGGTCCC CCAGCCAGCA CTGCTGTTGA CAACATGAGT 1860 
GCAAGTTCCA GCTCTGTTGA CAGCGACCCT GAAAGCACGG AGCGAGAGCT GCTCACCAAG 1920 
CTCCCAGGGG GCCTGTTCAA CATGAACCAG CTGCTAGCCA ATGAAATCAA ACTACTGGTG 1980 
GACTGACTCC ACTGGTAATT AACCAACAAA ACCCTTGTAA AACTTTCTTT CTTCTTTTCT 2040 
ATTTTTAAAA CCAACTTGCA ATGCTGTTCA TGGAGGATGC TCAGGAAGAT GAGAGAAAAT 2100 
TAGTAGGATT AGTTGGAGAG AGTGGGAGAT AGATGAGACC TCTGCTAGTA AGATGTTACT 2160 
TTCATTTACA AATCCTACAA ATAGAGAGGC AGAATAGGTG GGGTATAGAA AAATGTCAGG 2220 
CTCTCATAGT TACCCTTTTA AATTGCTAAA AAATGTGTAT GCTCATAGGC CATGAGGAAC 2280 
AAATACTTTT TTTTTTTCAT GGTCCCTTGC TTTTGTTTTT GTACAAAAAA AAATGGTTTT 2340 
GCTACAAATA TCCAAGTAGC ATAACTTCAC ATTGTGTTGG AAGATTTGTC ATCAGTGAGG 2400 
AAAACATCTG CTTAAATTAC AGGAATTTTT GTATTATACA GCTCTGA AAA TTCTGCCATT 2460 
TCCTTATTAA CTAGCAGCTT TAGTTTGTAG TTTATGAA AT CTTGAGGGGC TCTTTTACTG 2520 
GGATTTCTTA TTTTTTTGTT TTTTCCCGCT TAATTTGGTG GGAGGTCAAA TTGAATATAA 2580 
CCCAATAAAG GCTTCTTAAT GACAAAATTG GCATGTTTGC ATGATG AAAT GGA AATG AAC 2640 
AGTATTGCAA TGTCCGGTAT ACAAAATAAC ATTAATTCAA TGTAGATAAA ATTACACTAG 2700 
TTTAAAATAT GTGCATTCAC TTGTATTTGT TAGTGTTTTA GTCTTTTTTG AAAGATGTGC 2760 
TCTGTTAATG TTGCTTTTTT TTTTTTTTTT TAATAC ATGC TAGTCTAACA TTTCCTGCTC 2820 
TATGCCTGCA TCTTTAACAA TGGCCAAAGT GAAGAAAATG CTACCTTTTT TGTTAACAAG 2880 
ACACTGACTT GAAACATGTA CATTTAAAGC CTTTTATTTT TTCCCTTTTT GTTTTGGTAG 2940 
TTGGGCATTT AAATAAGGAC AAGGAAAAAT ATTTTTGGGG GCAAATCAAG AGCCT ATGAG 3000 
TTCTAAGTAT AAAGCTGAAG TGATTTCGAA TGCCAGCGTT ATATATTTGC ATTTTTCACA 3060 
TTTTACGAGG GAGTATATGT GTATGTGTGT GCACGCATGC ATGTGTATGT GTTTTGCTTT 3120 
TTGTTTCCAT CAACTAATCA AAAAGGATAA TTTAGAAAAT GGAGCATGAT GGGAAACAGA 3180 
GTTTTTGACT TTAAAAAACA GATGAGTTGT TTTCATAAGT AGA CTCCA CT GGGGTAGAGG 3240 
TATTCACCTT AAAACATAGG GTGAGTAGAT GCTTTTTTAG GCCTTTTTGT GTATATGTAC 3300 
GTTGTTTGTT TTTTTCCTTT TGTTTCTAGC CTGTTCAGTG TACAGTTTAT TCAAGGCTAC 3360 
ATGCTTTTCT TTAATGCTTC TGGCTATGCA TTTTCTCTTT TTACATATAG GATTTGGGAT 3420 
TGGGGGTGGG TTGGATGTTT TTGTTTGGGG ACTTATTTAG TAGTATTG AG TCTCTTATAG 3480 
CCCTACTCTT AAGCCTTCAA TACTGTCCAC TCTTTATATT CCTTTACTTG CAGAATTTAT 3540 
AAAAGCCCCC AAACTGCATA TAATATGAGC CTTTAAAACA TGGGTAAAAC TAATCCCATT 3600 
GATGGGTTTG GATGGTATGT TAAGAAATGG AGATGCTGCA GAGCCCAACG TAATTTTTTA 3660 
AACAGCAAGT TTTCCATCTC CCTACGAATC CTCTGAAGCT TTTACCCA AG CCCTTTCTTG 3720 
CCTCTCCAGT GCTATTTTCC TTCAGATGGA CCTTAAACAT AATTTCTTGG ACACTACTAG 3780 
AGAGACTTCG AGGCAATAAT AAAAGATCAG TATTAACCAG CTATAACAG A GGTTTGATCA 3840 
TGCTTACTTG TACAGTTTTT CCCCCGTTTT AAAAAGGAAT GTAATAAAAT TTGTTTTTTC 3900 
CATAGAATTA AATAATATTA AAATTGAGTG AAAGGTTGAT TGTTGATGAA TAGAATAGTA 3960 
CCTCTCATCT GTGCAGTGTC TCATTTCACC TCAGAGAAAA GGATACATAA GAGGAGTTTG 4020 
TAATTTATCT TAGGATATTC TAATTGCATT TAAAAG AACT TATCTTGCGC AGGGTAAATG 4080 
GGGGACTCAC ATACATATAT TAATACCTCT GACTCATTAA CAGAAAGAAA TACTTGGTAC 4140 
TTCTTTCGCT GAATG ACCAT ACTGTGGAGG ATGCATACTA TTTGGTATAG AGAAATAAAT 4200 
GAGGAAGAAA GAACTGCTTA ATTAAATTAT CATTCATATG TTCATATAG A G ACCATCTGG 4260 
TTGCCATGTG TATTATGACA CATACACTTT G AATAGTTAC ATATC ACAAG TATGTAGTTC 4320 
ATGTTTGTGT TGGTGGGGTA AGGCATCAGG AAAAATGTAG TTAGTCTTTT CTTAACTTAT 4380 
ACCAAATTAA CCAACTATAT TATAGGAAAT ATGTGAAATT AGTTCATTAG CTTTATTCAC 4440 
TATTATGCAT TCACATGATA TTAAAACGTA CACTCACATG TTAGAATGAA AAGAGCAGTA 4500 
GTTATCTTAG ATTTTAAAAA CATGGATATC TTCTTGAATT CCTTCAAG AT TGAGGTAGAG 4560 
AATAAGAGCA AATCATTCTG GAAGTACCTT AAGGAAACAA ACAGCAGCAG ATATTTAGGT 4620 
TAAACTTATT TTCATAATTG TTTAATAACT TTTGTATAAT CTTCATTGCT ATTATGAGAG 4680 
AGAATGTATA TATCA AATAT GTGTAATGAT AAAATCTGAA TTGTAAA ATT T TTGTA TATT 4740 
GTTAAAATTG TAATTCTAAA TTGTATTTCA A A AATG ATT A TTTCTGATAT TGTTTTTATG 4800 
TCACCCATGA TG AAAACTGG ACTTTATATA TCTAAACATA CAAGTATGAA CTATTCTATT 4860 
TAAAATTTTT AATAGTTTTT TTCTTTTTTG GTGCCTATAA TTGATTGGTC ATTTCTGCTG 4920 
GCTTTTCTCC AATGAACATT GAAATCTTCC TGTATATGTT ACCAATAAGA AAACTA CCCT 4980 
GGAACAGTAG AAAAACCCAA CAAGAGACTT GGCATTCATC AAGCACATTA TCAGACTTTG 5040 
AGAACATATT GAAGGCATTG ACTTTGAAAA TCATCTCTTT TTCTCAAGAA GAAAGCAATG 5100 
GAGAAGCAAA TTTGTTTCAT TCAGTGAATC CCCAGTTTGG G GCTTGTGG G GCTTAGAGAC 5160 
ATTGTGAAAT CAAATCTTGT GTTATACTTT TCTCCTGGCT CACTTTTTTT GAGAAGGTTT 5220 
ATGGGCTATT TGGCTGGTGA GACACGATCC CCTCCTAAGA AAATGTAGGT GCTCAGACAG 5280 
GTAACCACTG CTGCTACTGT TTTTATTTGT TTGTTTGTTC AATTTTATTT AAGATTTGTT 5340 
TTTGTTGTAC TAGGATTTTA AAAAATGTAA TATATTGCAG GATTTATA AC CAG 

Scq ID NO: 72 Protein sequence: 
Protein Accession #: BAA86552 



1 11 21 31 41 51 
I I I 1 I I 
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MCSSVAAKLW FLTDRRJRED YPQKEILRAL KAKCCEEELD FRAWMDEVV LTIEQGNLGL 60 
SSsUW PQVVWRVPT PWVQSDSDIT VLRHLEKMGC 

ELAGHGVPLP DTFSYGGHEN FAKM1DEAEV LEFPMVVKNT ™™GWTL *™™^}< 180 
SHLIRHEAPY LFQKYVKESH GRDVRVIVVG GRVVGTMLRC STDGRMQSNC SLGGVGMMCS 240 
LSEQGKQLAI QVSNILGMDV CGIDLLMKDD GSFCVCEANA NVGFIAFDKA C^VAGIIA 300 
DYAASLLPSG RLTRRMSLLS VVSTASETSE PELGPPASTA VDNMSASSSS VDSDPESTER 360 
ELLTKLPGGL FNMNQLLANE IKLLVD 

Seq ID NO: 73 DNA sequence 

Nucleic Acid Accession*: XM 040080.2 

Coding sequence: 159-1 104 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

CTGAGTGGGG GCGGGGACTO CTGGAGTTGC GGGGCCTGCC TGGGGTAGGG CGGGGCAGGA 60 
CAGCTTGGAG ATAGGGCCCG GAATTGCGGG CGTCACTCTG CTCCTGCGAC CTAGCCAGGC 120 
GTGAGGGAGT GACAGCAGCG CATTCGCGGG ACGAGAGCGATGAGTGAGAA CGCCGCACCA 180 
GGOTGATCT CAGAGCTGAA GCTGGCTGTG CCCTGGGGCC ACATCGCAGC CAAAGCCTGG 240 
GGCTCCCTGC AGGGCCCTCC AGTTCTCTGC CTGCACGGCT GGCTGGACAA TGCCAGCTCC 300 
TTCGACAGAC TCATCCCTCT TCTCCCGCAA GACTTTTATT ACGTTGCCAT GGATTTCX3GA 360 

Stcatgggctctcgtcccattacagcc^ 

AGTG AGATCC GAAGAGTTGT GGCAGCCTTG AAATGG AATC GATTCTCCAT TCTGGGCCAC 480 
AGCTTO^TG G^CGTCGT^G CGGAATGTTT TTCTGTACXrr TCCCCGAGAT ^^9^1^^ «? 
CTTATCTTGC TGGACACGCC GCTCTTTCTC CTGGAATCAG ATGAAATGGA GAACTTGCTG 600 
ACCTACAAGC GGAGAGCCAT AGAGCACGTG CTGCAG GTAG AGG CCTCCCA GGAGCCCTCG 660 
CaStGTTCA SaAGCA GCTGCTGCAG AGGTTACTGA AGAGCAATAG CCACTTCAGT 720 
GAGG^^^GWjGGGAGCTTCT CCTGCAAAGA GG AACCACGA AGGTGGCCAC AGG^GGTT 780 
CTGAACAGAG ACCAGAGGCT CGCCTGGGCA GAGAACAGCA TTGACTTCAT CAGCAGGGAG 840 

Sgtct^gcatocatcaggaagct^ ™ 

GGATATTTTG ATTCAAGACA GAATTACTCT GAGAAGGAGT CCCTGTCGTT CATGATAGAC 960 
ACGATGAAAT CCACCCTCAA AGAGCAGTTC CAGTTTGTGG AAGTCCCAGG CAATCACTGT 1020 
GTCCACATGA GCG AACCCCA GCACGTGGCC AGTATCATCA GCTCCTTCTT ACAGTGCACA 1080 
Sc^GCTCCCAGCC^^^^ GCCTGG AACT ATGAAGACCT AGTGCTCCCA 1140 

G^CTCA^G^^ TGGGACTCTG AGTTCCTGAG CCCCACAACA AGGCCAGGGA TGGTGGGG AC 1200 
AGGCCTCACT AGTCTTGAGG CCCAGCCTAG G ATGGTAGTC AGGGGAAGGA GCGAGATTTCC 1260 
AAOTCAACA TCTGTGACCT CAAGGGGGAG ACAGAGTCTG GGTTCCAGGG CTGCTTTCTC 1320 
CTGGCTAATA ATAAATATCC AGCCAGCTGG AGGAAGG^AG^GCAGGCTGG GCCCACCTAG 1380 
CCTTTCCCTG CTGCCCAACT GGATGGAAAA l AAA AGG i iC i iuiAi tCTC A 

Seq ID NO: 74 Protein sequence: 
Protein Accession XP_040080.1 

I 11 21 31 41 51 

MSENAAPGLI SELKLAVPWGHIAAKAWGSL QGPPVLCLHG WLDNASSFDR LffLLPQDFY 60 
^^^OlSnSoVP YYLQTFVSEI RRWAALKWN RFSILGHSFG GVVGGMFFCT 120 
rSis DEMENLLTYK RRAIEHVLQV EASQEPSHVF SLKQLLQRLL 180 
SsHLSEEC GELLLQRGTT KVATGLVLNR DQRLAWAENS IDFISRELCA HSIRKLQAHV 240 
SSgYF D SROTKE SLSFMIDTMK StLkEQFQFV EVPGNHCVHM SEPQHVASII 300 
SSFLQCTHML PAQL 

Seq ID NO: 75 DNA sequence 

Nucleic Acid Accession*: NM 005794 

Coding sequence: 434-1276 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

GGTTCCCTTC CACGCTGTGA AGCTTTGTTC TTTTGGTCTT CATG ATAAAT CTTGCTGCTG 60 

CTTCACTCCT GGGGCCAGCA AGACCACG AA TGCACCGAGA \^ 
P ArCATCTTT AAGAACCGTA ATACTCACCG CAAGGGTCTG CAACTTCATT CTTGAAGTCA 240 
Grr^CCAA G^ CATTTTGGTG ACTTTG AAGA GACTGTCACC 300 

?I?CAC^A^TG^TG7^ GTCAGACCAT 360 

CA^GCGG TGAGACTATC ACCTATCGCC AAGTGGCCTG ATTCAGCAGG AAGCATCTCA 420 
gISo^AACC ACTATGCTGT CAGCAGTTGC CCGGGGCTAC CAGGGCTGGT TTCATCCCTG 480 
?^A^G^™^GGATGAGCAGCAC CGGGATAGAC AGGAAGGGCG TCCTGGCTAA 540 
CCGGGTAGCC GTG^AOTG GGTCCACCAG TGGGATCGGC TTTGCCATCG CCCGACGTCT 600 
SScCGGGAC GGGGC^ACG TGGTCATCAG CAGCCGGAAG CAGCAGAACG TGGACCGGGC 660 
rS^CAAG CTGCAGGGGG AGGGGCTG AG TGTGGCGGGC ATTGTGTGCC ACGTGGGG AA 720 
GgSgAG^AcS^ GGCCCTGGAG CACTGTGGGG GCGTCGACTT 780 

CC^GGTGTCC^ AGCG CAGGGG TCAACCCTCT GGTAGGGAGC ACTCTGGGGA CCAGTGAGCA 840 

gSggacaa^aS 

GCTGCCCTAC ATGG AGAACA GG AGGGGTGC TGTCATCCTG GTCTCTTCCA TTGCAGCTTA 960 
TAATCCAGTA GTGGCGCTGG GTGTCTACAA TGTCAGCAAG ACAGCGCTGC TGGGTCTCAC 1020 
^A^CTG GCATOGAGC TGGCCCCCAA GGACATCCGG GTAAACTGCG TGGTTCCAGG 1080 
AATTATAAAA ACTGACTTCA GCAAAGTGTT TC ATGGGAAT GAGTCTCTCT GGAAGAACTT 1140 
CAAGGAACATCATCAGCTGC AGAGGATTGG GGAGTCAGAG GACTGTGCAG GAATCGTGTC 1200 
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CTTCCTGTGC TCTCCAGATG CCAGCTACGT CAACGGGGAG AACATTGCGG TGGCAGGCTA 1260 
CTCCACTCGG CTCTGAGAGG AGTGGGGGCG GCTGCGTAGC TGTGGTCCCA GCCCAGGAGC 1320 
CTGAGGGGGT GTCTAGGTGA TCATTTGGAT CTGGAGCAGA GTCTGCCATT CTGCCAGACT 1380 
AGCAATTTGG GGGCTTACTC ATGCTAGGCT TGAGGAAGAA GAAAAACGCT TCGGCATTCT 1440 
CC 

Seq ID NO: 76 Protein sequence: 
Protein Accession #: NP_005785 

I li 21 31 41 51 
I I I 1 I I 

MLSAVARGYQ GWFHPCARLS VRMSSTGfDR KGVLANRVAV VTGSTSGIGF AIARRLARDG 60 
AHVVISSRKQ QNVDRAMAKL QGEGLSVAGI VCHVGKAEDR EQLVAKALEH CGGVDFLVCS 120 
AG VNPLVGST LGTSEQIWDK ILSVNVKSPA LLLSQLLPYM ENRRG A VILV SSIAAYNPW 1 80 
ALGVYNVSKT ALLGLTRTLA LELAPKD1RV NCVVPGIIKT DFSKVFHGNE SLWKNFKEHH 240 
QLQRIGESED CAGIVSFLCS PDASYVNGEN IAVAGYSTRL 



SeqIDNO:77 DMA sequence 

Nucleic Acid Accession NMJ>02425 

Coding sequence: 26-1453 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

AAAGAAGGTA AGGGCAGTGA GAATGATGCA TCTTGCATTC CTTGTGCTGT TGTGTCTGCC 60 
AGTCTGCTCT GCCTATCCTC TGAGTGGGGC AGCAAAAGAG GAGGACTCCA ACAAGGATCT 120 
TGCCCAGCAA TACCTAGAAA AGTACTACAA CCTCGAAAAG GATGTGAAAC AGTTTAGAAG 180 
AAAGGACAGT AATCTCATTG TTAAAAAAAT CCAAGGAATG CAGAAGTTCC TTGGGTTGG A 240 
GGTGACAGGG AAGCTAGACA CTGACACTCT GGAGGTGATG CGCAAGCCCA GGTGTGGAGT 300 
TCCTGACGTT GGTCACTTCA GCTCCTTTCC TGGCATGCCG AAGTGGAGGA AAACCCACCT 360 
TACATACAGG ATTGTGAATT ATACACCAGA TTTGCCAAGA GATGCTGTTG ATTCTGCCAT 420 
TGAGAAAGCT CTGAAAGTCT GGGAAGAGGT GACTCCACTC ACATTCTCCA GGCTGTATGA 480 
AGGAGAGGCT G ATATAATG A TCTCTTTCGC AGTTAAAGAA CATGG AGACT TTTACTCTTT 540 
TGATGGCCCA GGACACAGTT TGGCTCATGC CTACCCACCT GGACCTGGGC TTTATGGAGA 600 
TATTCACTTT GATGATG ATG AAAAATGGAC AG AAGATGCA TCAGGCACCA ATTTATTCCT 660 
CGTTGCTGCT CATGAACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTGAAGC 720 
TTTGATGTAC CCACTCTACA ACTCATTCAC AGAGCTCGCC CAGTTCCGCC TTTCGCAAGA 780 
TGATGTGAAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG AGGAAC CCCT 840 
GGTGCCCACA AAATCTGTTC CTTCGGGATC TG AGATGCCA GCCAAGTGTG ATCCTGCTTT 900 
GTCCTTCGAT GCCATCAGCA CTCTGAGGGG AGAATATCTG TTCTTTAAAG ACAGATATTT 960 
TTGGCGAAGA TCCCACTGGA ACCCTGAACC TGAATTTCAT TTG ATTTCTG CAT TTTGG CC 1 020 
CTCTCTTCCA TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA CCGTTTTTAT 1080 
TTTTAAAGG A AATGAGTTCT GGGCCATCAG AGGAAATGAG GTACAAGCAG GTTATCCAAG 1 140 
AGGCATCCAT ACCCTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 1200 
CAAGGAAAAG A AG AAA AC AT ACTTCTTTGC AGCGGACAAA TACTGGAGAT TTGATGAAAA 1260 
TAGCCAGTCC ATGG AGCAAG GCTTCCCTAG ACTAATAGCT GATGACTTTC CAGGAGTTGA 1320 
GCCTAAGGTT GATGCTGTAT TACAGGCATT TGGATTTTTC TACTTCTTCA GTGGATCATC 1380 
ACAGTTTGAG TTTGACCCCA ATGCCAGGAT GGTGACACAC ATATTAAAGA GTAACAGCTG 1440 
GTTACATTGC TAG GCGAGAT AGGGGGAAGA CAGATATGGG TGTTTTTAAT AAATCTAATA 1 500 
ATTATTCATC TAATGTATTA TGAGCCAAAA TGGTTAATTT TTCCTGCATG TTCTGTGACT 1560 
GAAGAAGATG AGCCTTGCAG ATATCTGCAT GTGTCATGAA GAATGTTTCT GGAATTCTTC 1620 
ACTTGCTTTT GAATTGCACT GAACAGAATT AAGAAATACT CATGTGCAA T AGGTGAGAGA 1680 
ATGTATTTTC ATAG ATGTGT TATTACTTCC TCAATAAAAA GTTTTATTTT GGGCCTGTTC 1 740 
CTT _ . - . . 

Seq ID NO: 78 Protein sequence: 
Protein Accession #: NP JW24 1 6 

] II 21 31 41 51 

MHLAFLVLLC LPVCSA YPLS G AAKEEDSNK DLAQQYLEKY YNLEKDVKQF RRKDSNLIVK 60 
K1QGMQKFLG LEVTGKLDTD TLEVMRKPRC GVPDVGHFSS FPGMPK.WRKT HLTYR1VNYT \ 20 
PDLPRDAVDS AJEKALKVWE EVTPLTFSRL YEGEADIMIS FAVKEHGDFY SFDGPGHSLA 1 80 
HAYPPGPGLY GDIHFDDDEK WTEDASGTNL FLVAAHELGH SLGLFHSANT EALMYPLYNS 240 
FTELAQFRLS QDDVNG IQSL YGPPPASTEE PLVPTKSVPS GSEMPAKCDP ALSFDAISTL 300 
RGEYLFFK.DR YFWRRSHWNP EPEFHLISAF WPSLPSYLDA AYEVNSRDTV FIFKGNEFWA 360 
IRGNEVQAGY PRGIHTLGFP PTIRX1DAAV SDKEKKKTYF FAADKYWRFD ENSQSMEQGF 420 
PRLIADDFPG VEPKVDAVLQ AFGFFYFFSG SSQFEFDPNA RMVTHILKSN SWLHC 

Seq ID NO: 79 DNA sequence 

Nucleic Acid Accession »: NM_000493.I 

Coding sequence: 1-2043 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

ATGCTGCCAC AAATACCCTT TTTGCTGCTA GTATCCTTGA ACTTGGTTCA TGGAGTGTTT 60 
TACGCTGAAC GATACCAAAC GCCCACAGGC ATAAAAGGCC CACTACCCAA CACCAAGACA 120 
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CAGTTCTTCA TTCCCTACAC CATAAAGAGT AA AGGTATAG CAGTAAGAGG AG AGCAAGGT 1 80 
ACTCCTGGTC C ACCAGGCCC TGCTGG ACCT CG AGGGCACC CAGGTCCTTC TGG ACCACCA 240 
GGAAAACCAG GCTACGGAAG TCCTGGACTC CAAGGAGAGC CAGGGTTGCC AGGACCACCG 300 
GGACCATCAG CTGTAGGGAA ACCAGGTGTG CCAGGACTCC CAGGAAAACC AG GAG AG AG A 360 
5 GGACCATATG GACCAAAAGG AGATGTTGGA CCAGCTGGCC TACCAGGACC CCGGGGCCCA 420 
CCAGGACCAC CTGGAATCCC TGGACCGGCT GGAATTTCTG TGCCAGGAAA ACCTGGACAA 480 
CAGGGACCCA CAGGAGCCCC AGGACCCAGG GGCTTTCCTG GAGAAAAGGG TGCACCAGGA 540 
GTCCCTGGTA TGAATGGACA G AAAGGGGAA ATGGG ATATG GTGCTCCTGG TCGTCCAGGT 600 
GAGAGGGGTC TTCCAGGCCC TCAGGGTCCC ACAGGACCAT CTGGCCCTCC TGGAGTGGG A 660 

1 0 AAAAGAGGTG AAAATGGGGT TCCAGGACAG CCAGGCATCA AAGGTGATAG AGGTTTTCCG 720 
GGAGAAATGG GACCAATTGG CCCACCAGGT CCCCAAGGCC CTCCTGGGGA ACGAGGGCCA 780 
G AAGGCATTG GAAAGCCAGG AGCTGCTGGA GCCCCAGGCC AGCCAGGGAT TCCAGGAACA 840 
AAAGGTCTCC CTGGGGCTCC AGGAATAGCT GGGCCCCCAG GGCCTCCTGG CTTTGGGAAA 900 
CCAGGCTTGC CAGGCCTGAA GGG AG AAAG A GGACCTGCTG GCCTTCCTGG GGGTCCAGGT 960 

1 5 GCCAAAGGGG AACAAGGGCC AGCAGGTCTT CCTGGGAAGC CAGGTCTGAC TGGACCCCCT 1020 
GGG AATATGG G ACCCCAAGG ACCAAAAGGC ATCCCGGGTA GCCATGGTCT CCCAGGCCCT 1080 
AAAGGTGAGA CAGGGCCAGC TGGGCCTGCA GGATACCCTG GGGCTAAGGG TGAAAGGGGT 1 140 
TCCCCTGGGT CAGATGG A AA ACCAGGGTAC CCAGGAAAAC CAGGTCTCGA TGGTCCTAAG 1200 
GGTAACCCAG GGTTACCAGG TCCAAAAGGT GATCCTGGAG TTGGAGGACC TCCTGGTCTC 1260 

20 CCAGGCCCTG TGGGCCCAGC AGGAGCAAAG GGAATGCCCG GACACAATGG AGAGGCTGGC 1320 
CCAAGAGGTG CCCCTGGAAT ACCAGGTACT AGAGGCCCTA TTGGGCCACC AGGCATTCCA 1380 
GGATTCCCTG GGTCTAAAGG GGATCCAGGA AGTCCCGGTC CTCCTGGCCC AGCTGGCATA 1440 
GCAACTAAGG GCCTCAATGG ACCCACCGGG CCACCAGGGC CTCCAGGTCC AAGAGGCCCC 1500 
TCTGGAGAGC CTGGTCTTCC AGGGCCCCCT GGGCCTCCAG GCCCACCAGG TCAAGCAGTC 1S60 

25 ATGCCTGAGG GTTTTATAAA GGCAGGCCAA AGGCCCAGTC TTTCTGGGAC CCCTCTTGTT 1620 
AGTGCCAACC AGGGGGTAAC AGGAATGCCT GTGTCTGCTT TTACTGTTAT TCTCTCCAAA 1680 
GCTTACCCAG CAATAGGAAC TCCCATACCA TTTGATAAAA TTTTGTATAA CAGGCAACAG 1740 
CATTATGACC CAAGGACTGG AATCTTTACT TGTCAGATAC CAGGAATATA CTATTTTTCA 1800 
TACCACGTGC ATGTGAAAGG GACTCATGTT TGGGTAGGCC TGTATAAGAA TGGCACCCCT 1860 

3 0 GTAATGTACA CCTATGATG A ATACACCAA A GGCTACCTGG ATCAGGCTTC AGGGAGTGCC 1920 
ATCATCG ATC TCACAGAAAA TGACCAGGTG TGGCTCCAGC TTCCCAATGC CGAGTCAAAT 1980 
GGCCTATACT CCTCTGAGTA TGTCCACTCC TCTTTCTCAG GATTCCTAGT GGCTCCAATG 2040 
TGAGTACACC CCACAGAGCT AATCTAAATC TTGTGCTAGA AAAAGCATTC TCTAACTCTA 2100 
CCCCACCCTA CAAAATGCAT ATGGAGGTAG GCTGAAAAGA ATGTAATTTT TATTTTCTGA 2160 

3 5 AATACAG ATT TG AGCTATCA GACCAACAAA CCTTCCCCCT GAAAAGTG AG CAGCAACGTA 2220 
AAAACGTATG TG AAGCCTCT CTTGAATTTC TAGTTAGCAA TCTTAAGGCT CTTTAAGGTT 2280 
TTCTCCAATA TTAAAAAATA TCACCAAAG A AGTCCTGCTA TGTTAAAAAC AAACAACAAA 2340 
AAACAAAGCA ACAAAAAAAA AAATTAAAAA A AAAAAC AG A AATAGAGCTC TAAGTTATGT 2400 
G AAATTTGAT TTG AG AAACT CGGCATTTCC TTTTTAAAAA AGCCTGTTTC TAACTATGAA 2460 

40 TATGAGAACT TCTAGG AAAC ATCCAGGAGG TATCATATAA CTTTGTAGAA CTTAAATACT 2520 
TGAATATTCA AATTTAAAAG ACACTGTATC CCCTAAAATA TTTCTGATGG TGCACTACTC 2580 
TGAGGCCTGT ATGGCCCCTT TCATCAATAT CTATTCAAAT ATACAGGTGC ATATATACTT 2640 
GTTAAAGCTC TTATATAA AA AAGCCCCAA A ATATTGAAGT TCATCTGAAA TGCAAGGTGC 2700 
TTTCATCAAT G AACCTTTTC AAAACTTTTC TATGATTGCA GAGAAGCTTT TTATATACCC 2760 

45 AGCATAACTT GGAAACAGGT ATCTG ACCTA TTCTTATTTA GTTAACACAA GTGTGATTAA 2820 
TTTG ATTTCT TTAATTCCTT ATTGAATCTT ATGTGATATG ATTTTCTGGA TTTACAGAAC 2880 
ATTAGCACAT GTACCTTGTG CCTCCCATTC AAGTG AAGTT ATAATTTACA CTG AGGGTTT 2940 
CAAAATTCGA CTAGAAGTGG AG ATATATTA TTTATTTATG CACTGTACTG TATTTTTATA 3000 
TTGCTGTTTA AAACTTTTAA GCTGTGCCTC ACTTATTAAA GCACAAAATG TTTTACCTAC 3060 

50 TCCTTATTTA CGACACAATA AAATAACATC AATAGATTTT TAGGCTG AAT TAATTTGAAA 3120 
GCAGCAATTT GCTGTTCTCA ACCATTCTTT CAAGGCTTTT CATTCGACAC AATAAAATAA 3160 
CATCAATAGA TTTTTAGG 

Scq ID NO: 80 Protein sequence: 
55 Protein Accession*: NPJW0484.1 

1 11 21 31 41 51 

60 MLPQIPFLll VSLNLVHGVF YAERYQTPTG IKGPLPNTKT QFFIPYTIKS KG1AVRGEQG 60 

TPGPPGPAGP RGHPGPSGPP GKPGYGSPGL QGEPGLPGPP GPSAVGKPGV PGLPGKPGER 120 
GPYGPKGDVG PAGLPGPRGP PGPPGIPGPA G1SVPGKPGQ QGPTGAPGPR GFPGEKGAPG 180 
' VPGMNGQKGE MGYGAPGRPG ERGLPGPQGP TGPSGPPGVG KRGENGVPGQ PGIKGDRGFP 240 
GEMGP1GPPG PQGPPGERGP EGIGKPGAAG APGQPGIPGT KGLPGAPGIA GPPGPPGFGK 300 

65 PGLPGLKGER GPAGLPGGPG AKGEQGPAGL PGKPGLTGPP GNMGPQGPKG 1PGSHGLPGP 360 
KGETGPAGPA GYPGAKGERG SPGSDGKPGY PGKPGLDGPK GNPGLPGPK.G DPGVGGPPGL 420 
PGPVGPAGAK GMPGHNGEAG PRGAPGCPGT RGPIGPPGIP GFPGSKGDPG SPGPPGPAGI 480 
ATKGLNGPTG PPGPPGPRGP SGEPGLPGPP GPPGPPGQAV MPEGFKAGQ RPSLSGTPLV 540 
SANOG VTGMP VSAFTVILSK AYPAIGTPIP FDKILYNRQQ HYDPRTGIFT CQ1PGIYYFS 600 

70 YHVHVKGTHV W VGLYKNGTP VMYTYDEYTK GYLDQASGSA IIDLTENDQV WLQLPNAESN 660 
GLYSSEYVHS SFSGFLVAPM 

Seq ID NO: 81 DNA sequence 
Nucleic Acid Accession #: NM.00Q786 
75 Coding sequence: 332-1861 (underlined sequences correspond to start and stop codons) 

I 11 21 31 41 51 
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CGCGATTCTC AGGGATTGAT CCGCCTCTTC AGGTAAGTTA TCTTCCGGCC CCGTACCACT 60 
GTGCCACAGG CGCAGCCCGC TTCCTCAGGT GCCCTATCCC GCGCAG AAGA CCACGGCTTC 1 20 
ACAGAGTGTT ATTTAAGGGC GTGGCCAGCG G AACATCCCG CCCCATTCTG TGACGCACGG 1 80 
GGTGGCGCGC GTGGGACCCG AGGGGTGGGG CTGGGTTTAG TAGGAGACCT GGGGCAAGGC 240 
CCCCTGTGGA CGACCATCTG CCAGCTTCTC TCGTTCCGTC GATTGGGAGG AGCGGTGGCG 300 
ACCTCGGCCT TCAGTGTTTC CGACGGAGTG AATGGCGGCG GCGGCTGGGA TGCTGCTGCT 360 
GGGCTTGCTG CAGGCGGGTG GGTCGGTGCT GGGCCAGGCG ATGGAGAAGG TGACAGGCGG 420 
CAACCTCTTG TCCATGCTGC TGATCGCCTG CGCCTTCACC CTCAGCCTGG TCTACCTG AT 480 
CCGTCTGGCC GCCGGCCACC TGGTCCAGCT GCCCGCAGGG GTGAAAAGTC CTCCATACAT 540 
TTTCTOCCCA ATTCCATTCC TTGGGCATGC CATAGCATTT GGG AAAAGTC CAATTGAATT 600 
TCTAG AAAAT GCATATGAGA AGTATGGACC TGTATTTAGT TTTACCATGG TAGGCAAGAC 660 
ATTTACTTAC CTTCTGGGGA GTGATGCTGC TGCACTGCTT TTTAATAGTA AAAATGAAGA 720 
CCTGAATGCA GAAGATGTCT ACAGTCGCCT GACAACACCT GTGTTTGGGA AGGGAGTTGC 780 
ATACGATGTG CCTAATCCAG TTTTCTTGGA GCAGAAGAAA ATGTTAAAAA GTGG CCTTA A 840 
CATAGCCCAC TTTAAACAGC ATGTTTCTAT AATTGAAAAA GAAACAAAGG AATACT TTGA 900 
GAGTTGGGGA GAAAGTGGAG AAAAAAATGT GTTTGAAGCT CTTTCTGAGC TCATAATTTT 960 
AACAGCTAGC CATTGTTTGC ATGGAAAGGA AATCAGAAGT CAACTCAATG AAAAGGTAGC 1020 
ACAGCTGTAT GCAGATTTGG ATGGAGGTTT CAGCCATGCA GCCTGGCTCT TACCAG GTTG 1080 
GCTGCCTTTG CCTAGTTTCA GACGCAGGGA CAGAGCTCAT CGGGAAATCA AGGATATTTT 1 140 
CTATAAGGCA ATCCAGAAAC GCAGACAGTC TCAAGAAAAA ATTGATGACA TTCTCCAAAC 1200 
TTTACTAGAT GCTACATACA AGGATGGGCG TCCTTTGACT GATGATGAAG TAGCAGGGAT 1260 
GCTTATTGGA TTACTCTTGG CAGGGCAGCA TACATCCTCA ACTACTAGTG CTTGGATGGG 1320 
CTTCTTTTTG GCCAGAGACA AAACACTTCA AAAAAAATGT TATTTAGAAC AGAAAACAGT 1380 
CTGTGGAGAG AATCTGCCTC CTTTAACTTA TGACCAGCTC AAGGATCTAA ATTTACTTGA 1440 
TCGCTGTATA AAAG AAACAT TA AGACTTAG ACCTCCTATA ATGATCATG A TGAGAATGGC 1 500 
CAGAACTCCT CAGACTGTGG CAGGGTATAC CATTCCTCCA GGACATCAGG TGTGTGTTTC 1560 
TOCCACTGTC AATCAAAGAC TTAAAGACTC ATGGGTAGAA CGCCTGGACT TTAATCCTGA 1620 
TCGCTACTTA CAGGATAACC CAGCATCAGG GGAAAAGTTT GCCTATGTGC CATTTGGAGC 1680 
TGGGCGTCAT CGTTGTATTG GGGAAAATTT TGCCTATGTT CAA ATT AAGA CAATTTGGTC 1 740 
CACTATGCTT CGTTTATATG AATTTGATCT CATTGATGGA TACTTTCCCA CTGTGAATTA 1 800 
TACAACTATG ATTCACACCC CTGAGAACCC AGTTATCCGT TACAAACGAA GATCAAAAJG I860 
A.AAAAGGTTG CAAGGAACGA ATATATGTGA TTATCACTGT AAGCCACAAA GGCATTCGAA 1920 
GAGAATGAAG TGTACAAAAC AACTCTTGTA GTTTACTGTT TTTTTAAGTG TGTAATTCTA 1980 
AAAGCCAGTT TATGATTTAG GATTTTGTTA ACTGAATGGT TCTATCA AAT ATAATAGCAT 2040 
TTGACACATT TTCTAATAGT TATGATACTT ATACATGTGC TTTCAGGAAG TTCCTTGGTG 2100 
AAACAATTGT TGAGGGGGGA TCTAGGTAAT TGGCAGATTC TAAATAATAT AATTTCCAGA 2160 
TAGTAATTTT AAGAGTACTC ATCGCTCTTG CCAAAT AAGT TCAGGGTATT CAAATCTTGG 2220 
ACTAGTCCTG CAAGGTATAA AGAATAAAAA TCCCAGTGAG ATACTTGGAA ACCACAGTTT 2280 
ATTATTATTT ATCTGGGCAA TTATTGTGTG TGTGAGG ATG GAAGGGTAGG GAATAATCG A 2340 
ACATCTAAAG CCTTGAATAA GAGAATACTA ATTGTTTTGG TATGATGATA CTCAGAAATG 2400 
GAGATATTAT AGGAAAAAGA AATCCTTTGG AATTTTAACT AAAATCACTG CATATGGGAA 2460 
ATTAAGAG AT CCAGGACCAT ATTTG ATAAG AGTTCCTAAA AATAATGTAA TTATTAATGC 2520 
TAAAGACTGC TCATGTATCT TGATCTAATT ACTAAATAAA TTACATATTT ATTTACCTGA 2580 
TAAATATGTA TCTAG TTCTA CAAGGTCACA TTTATGTGGA AGTCCAAA GT CAAGTCCT TA 2640 
GGGGATAATT TTGTTTTGGG CTCAGTTGTT CCCTGCTTCC 1111111111 IT 1 1111111 2700 
TTGAGATGGA GTCTCGCTCT GTTGCCCAGG CTGGAGTGCA GTGGTGCGAT CTCAGCTCAC 2760 
TGCATCCTCT GCCTCCCGGG TTCAAGCAAT TCTCTGCCTC AGCCTCCCAA GTAGTTGGGA 2820 
TTACAGGCAC CTGCCACCAT GCCTGGCTAA Till 1TGTAT TTTTAGTAGA GACGGGGGTT 2S80 
TCACTATGTT GGCTAGGCTG GTCTTG AACT CCTGAGCCTC GTGAGTCCAC CCGCCTTGGC 2940 
CTCCCAAAGT GCTGGGATTA CAGGCATGAG CCACCGCACC TGGCCTTCCC TGCTTCCTCT 3000 
CTAGAATCCA ATTAGGGATG TTTGTTACTA CTCATATTGA TTAAAACAGT TAACAAACTT 3060 
TTTTCTTTTT AAAATGTGAG ATCAGTGAAC TCTGGTTTTA AGATAATCTG AAACAAGGTC 3120 
CTTGGGAGTA AT AAAATTGG TC AC ATTCTG TAAAGCACAT TCTGTTTAGG AATCAACTTA 3180 
TCTCAAATTG TAACTCGGGG CCTAACTATA TGAGATGGCT G A AAA AAT AC CACATCGTCT 3240 
GTTTTCACTA GGTGATGCCA AAATATTTTG CTTTATGTAT ATTACAGTTC TTTTTAAAAC 3300 
ACTGGAAGAC TCATGTTAAA CTCTAATTGT GAAGGCAGAA TCTCTGCTAA TTTTTCAGAT 3360 
TAAAATTCTC TTTGAAAAAA T 

Seq ID NO: 82 protein sequence: 
Protein Accession U: NP 000777 

1 11 21 31 41 51 
I I I I I I 

MAAAAGMLLL GLLQAGGSVL GQAMEKVTGG NLLSMLLIAC AFTLSLVYLI RLAAGHLVQL 60 
PAG VKSPPYI FSPIPFLGHA IAFGKSPIEF LENA YEKYGP VFSFTMVGKT FTYLLGSDAA 1 20 
ALLFNSKNED LNAEDVYSRL TTPVFGKGVA YDVPNPVFLE QKKMLKSGLN IAHFKQHVS1 1 80 
IEKETKEYFE SWGESGEKNV FEALSELIIL TASHCLHGKE IRSQLNEKVA QLYADLDGGF 240 
SHAAWLLPGW LPLPSFRRRD RAHREIKDIF YKAIQKRRQS QEKIDDILQT LLDATYKDGR 300 
PLTDDEVAGM LIGLLLAGQH TSSTTSAWMG FFLARDKTLQ KKCYLEQKTV CGENLPPLTY 360 
DQLKDLNLLD RCDCETLRLR PPIMIMMRMA RTPQTVAGYT IPPGHQVCVS PTVNQRLKDS 420 
WVERLDFKPD RYLQDNPASG EKFAYVPFGA GRHRCIGENF AYVQIKTIWS TMLRLYEFDL 480 
IDGYFPTVNY TTMIHTPENP VIRYKRRSK 

Seq ID NO: 83 DNA sequence 

Nucleic Acid Accession ft: NM_00655 1 .2 

Coding sequence: 64-336 (underlined sequences correspond to start and stop codons) 
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1 II 1\ *l " I 

AATTCTAGAA GTCCAAATCA CTCATTGTTT GTGAAAGCTG AGCTCACAGC AAAACAAGCC 60 
ACC^TO/^GCTOTCGGTGTG TCTCCTGCTG ^^^^^^^^^^^^^J^i^?PP .!? 
AATGCCGAGT TCTGCCCAGC TCTTGTTTCT GAGCTGTTAG ACTTCTTCTT CATTAGTGAA 180 
CCTCrcriX^AGT^\^TCT TGCCAAATTT GATGCCCCTC CGGAAGCTGT TGCAGCCAAG 240 
^^^a^^ca^^aomg TCCCTTCAGA AACGAAGOT CATO^ 300 
GTCCTGGTGA AAATATTGAA GAAATGTAGT GTGTCACATG TAAAAACTTT CATCCTGGTT 360 
TCCACTGTCT TTCAATGACA CCCTGATCTT CACTGCAGAA TGTAAAGGTT TCAACGTCTT 420 
GCTTTAATAA ATCACTTGCT CTAC 

Scq ID NO: 84 Protein sequence: 
protein Accession U: NP„006542.1 

1 II 21 31 41 51 

MKLSVCLLLV TLALCCYQAN AEFCPALVSE LLDFFFISEP LFKLSLAKFD APPEAVAAKL 60 
GVKRCTDQMS LQK.RSLIAEV LVK1LKKCSV 

Scq ID NO: 85 DNA sequence 

Nucleic Acid Accession #: NM 002317.I 

Coding sequence: 231-1484 (underlined sequences correspond to start and stop codons) 



25 1 n 21 31 41 51 
I I ! I 



GGGOCAGGAC TGAGAAAGGG GAAAGGGAAG GGTGCCACGT CCGAGC AGCC GCC^GACTG 60 
GGGAAGGGTC TGAATCCCAC CCTTG GCATT GCTTGGTGGA G ACTGAGATA CCCGTGCTCC 20 
GCTCGCCTCcVrGGTTGAAG ATTTCTCCrr CCCTCACGTG ATTTGAGCCC CGT^ATT 180 
30 TOTCTGAGC CACGTCCTCC TCGAGCGGGG TCAATCTGGC AAAAGGAGTG A^CGCTTCG 240 
SgGACCGT GCTCCTGCTC GGGCCTTTGC AGCTCTGCGC GCTAGTGCAC TGCGCCCCTC 300 
cScCGCCG^ CC^CAGCA^CCCGCGCG AGCCGCCGGC GGCTCCGGGC GCCTGGCGCC 360 
AGCAGATCCA ATGGGAGAAC AACGGGCAGG TGTTCAGCTT GCTGAGCCTG GGCTCACAGT 420 
apt AGCCTCA GCGCCGCCGG G ACCCGGGCG CCGCCGT<XC TGGTGCAGCC AACGCCTCCG 480 
15 CC^AGCAGCC CCGCACTCCG ATCCTGCTG A TCCGCGACAA CCGCACCGCC GCGOGGCGAA 540 
CGCGGACGGC CGGCTCATCT GGAGTCACCG CTGGCCGCCC CAGGCCCACC GCCCGTCACT 600 
^nr^r^Qn^KCTCa ACATCTAGAG CCCGCGAAGC TGGGCCCTCG CGCGCGGAGA 660 
ACCAGACAGC GCCGGGAGAA GTTCCTGCTC TCAGTAACCT GCGGCCGCCC AGCCGCGTGG 720 

aStcg?gggcga^ 

40 ACAACTACTA CGATACTTAT GAAAGGCCCA GACCTGGGGG CAGGTACCGG CCCGGATACG 840 
GC^SgSa CTOCAGTAC GGTCTCCCAG ACCTGGTGGC CGACCCCTAC TACATCCAGG 
rGTCCACGTA CGTCCAGAAG ATGTCCATGT ACAACCTGAG ATGCGCGGCG GAGGAAAACT 960 
GTCTGGCCAG TACAGCATAC AGGGCAGATG TCAGAGATTA TGATCACAGG GTGCTGCTCA 1020 
GA^^C^COCA AAGAGTGAAA AACCAAGGGA CATCAGATTT CTTACCCAGC CGACCAAGAT 1080 

45 a^gg^ggS 

Vctacctgcttgatgccaacacccagaggagatgggctgaaggccacaaa 

fiTCTTGAAGA CACATCCTGT GACTATGGCT ACCACAGGCG ATTTGCATGT ACTGCACACA 1260 

2ISIgga^gagtS 

^GATATTAC AGATGTAAAA CCTGGAAACT ATATCCTAAA GGTCAGTGTA AACCCCAGCT 1380 
50 TcctSctGaIt^ ATGTTOTGCG CTGTGACATT OGCTACACAG 1440 

GACATCATGC GTATGCCTCA GGCTGCACAA TTTCACCGTA TTAGAAGGCA AAGCAAAACT 500 
rrCAATGGAT AAATCAGTGC CTGGTGTTCT GAAGTGGGAA AAAATAGACT AACTTCAGTA 1560 
GG ATTT ATGT AT^THXiAAAA AGAGAACAGA AAACAACAAA AGAATTTTTG T^GGACTCT 1620 
mCAATAAC AAAGCACATA ACTGGATTTT GAACGCTTAA GTCATCATO I«0 
5 5 TTTTAATGTT TATTATTTAC ATCACTTTGT GAATTAACAC AGTGTTTCAA TTCTGTAATT 1740 
ACATATTTGA CTCTTTCAAA AAAAAAAAAA AAAAAAAAAA 



Seq ID NO: 86 Protein sequence: 
Protein Accession #: NPJ)02308.1 



1 11 21 31 41 51 

LrFAWTVLLL GPLOLCALVH CAPPAAGQQQ PPREPPAAPG AWRQQIQWEN NGQVFSLLSL 60 
G!toYQPQRRR DPGAAVPG AA NASAQQPRTP tLURDNRTA AGRTRTAGSS GVTAGRPRPT 120 

KjPYVNYYDTY ER^FWRYR PGYGTGYFQY GLPDLVADPY YIQASTYVQK MSMYNLRCAA 240 
^^^aVr^dv^ydmr VLLRFPQRVK NQGTSDFLPS RPRYSWEWHS CHQHYHSMDE 300 
fKlDAN ^RR^AEGHK ASFCLEDTSC DyWrFAC TAHTQGLSPG CYDTYGADID 360 
CQwSlDVK PGNY^KVSV NPSYLVPESD YTNNVVRCDI RYTGHHAYAS GCTISPY 



Seq ID NO: 87 DNA sequence 

Nucleic Acid Accession*: NMJW6419.1 

Coding sequence: 91-420 (underlined sequences correspond to start and stop codons) 



75 j 11 21 31 41 51 

WcGGCACTT GGGAG AAGAT GTTTGAAAAA ACTGACTCTG CTAATGAGCC TGGACTCAGA 60 
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GCTCAAGTCT GAACTCTACC TCCAG ACAGA ATGAAGTTCA TCTCGACATC TCTGCTTCTC 120 
ATGCTGCTGG TCAGCAGCCT CTCTCCAGTC CAAGGTGTTC TGGAGGTCTA TTACACAAGC 1 80 
TTGAGGTGTA GATGTGTCCA AGAGAGCTCA GTCTTTATCC CTAGACGCTT CATTGATCGA 240 
ATTCAAATCT TGCCCCGTGG GAATGGTTGT CCAAGAAAAG AAATCATAGT CTGGAAGAAG 300 
AACAAGTCAA TTGTGTGTGT GGACCCTCAA GCTGAATGGA TACAAAGAAT GATGGAAGTA 360 
TTGAGAAAAA GAAGTTCTTC AACTCTACCA GTTCCAGTGT TTAAGAG AAA GATTCCCTGA 420 
TGCTGATATT TCCACTAAGA ACACCTGCAT TCTTCCCTTA TCCCTGCTCT GGATTTTAGT 480 
TTTGTGCTTA GTTAAATCTT TTCCAGGG AG AAAGAACTTC CCCATACAAA TAAGGCATGA 540 
GGACTATGTG AAAAATAACC TTGCAGGAGC TGATGGGGCA AACTCAAGCT TCTTCACTCA 600 
CAGCACCCTA TATACACTTG GAGTTTGCAT TCTTATTCAT CAGGGAGGAA AGTTTCTTTG 660 
AAAATAGTTA TTCAGTT ATA AGTAATACAG GATTATTTTG ATTATATACT TGTTGTTTAA 720 
TGTTTAAAAT TTCTTAGAAA ACAATGGAAT G AGAATTTAA GCCTCAAATT TGAACATGTG 780 
GCTTGAATTA AGAAGAAAAT TATGGCATAT ATTAAAAGCA GGCTTCTATG AAAGACTCAA 840 
AAAGCTGCCT GGGAGGCAGA TGGAACTTG A GCCTGTCAAG AGGCAAAGGA ATCCATGTAG 900 
TAG ATATCCT CTGCTTAAAA ACTCACTACG GAGGAGAATT AAGTCCTACT TTTAAAGAAT 960 
TTCTTTATAA AATTTACTGT CTAAGATTAA TAGCATTCGA AGATCCCCAG ACTTCATAGA 1020 
ATACTCAGGG AAAGCATTTA AAGGGTGATG TACACATGTA T<XTTTCACA CATTTGCCTT 1080 
GACAAACTTC TTTCACTCAC ATCTTTTTCA CTGACTTTTT TTGTGGGGGC GGGGCCGGGG 1 140 
GGACTCTGGT ATCTAATTCT TTAATGATTC CTATAAATCT AATGACATTC AATAAAGTTG 1200 
AGCAAACATT TTACTT 

Seq ID NO: 88 Protein sequence: 
Protein Accession #: NPJJ064 10. 1 

1 II 21 31 41 51 

MKFISTSLLL MLLVSSLSPV QGVLEVYYTS LRCRCVQESS VFIPRRFIDR IQILPRGNGC 60 
PRKEIIVWKK NKSIVCVDPQ AEW1QRMMEV LRKRSSSTLP VPVFKRKIP 

Seq ID NO: 89 DNA sequence 

Nucleic Acid Accession #: NM 002652 

Coding sequence: 37-477 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

CTTCTCTGGG ACACATTGCC TTCTGTTTTC TCCAG CATGC GCTTGCTCCA GCTCCTGTTC 60 
AGGGCCAGCC CTGCCACCCT GCTCCTGGTT CTCTGCCTGC AGTTGGGGGC CAACAAAGCT 1 20 
CAGGACAACA CTCGGAAG AT CATAATA AAG AATTTTG ACA TTCCCAAGTC AGTACGTCCA 1 80 
AATGACGAAG TCACTGCAGT GCTTGCAGTT CAAACAGAAT TGAAAGAATG CATGGTGGTT 240 
AAAACTTACC TCATTAGCAG CATCCCTCTA CAAGGTGCAT TTAACTATAA GTATACTGCC 300 
TGCCT ATGTG ACG ACAATCC AAAAACCTTC TACTGGGACT TTTACACCAA CAGAACTGTG 360 
CAAATTGCAG CCGTCGTTG A TGTTATTCGO GAATTAGGCA TCTGCCCTGA TG ATGCTGCT 420 
GTAATCCCCA TCAAAAACAA CCGGTTTTAT ACTATTG AAA TCCTAAAGGT AGAAJAATGG 480 
AAGCCCTGTC TGTTTGCCAC ACCCAGGTGA TTTCCTCTAA AGAAACTTGG CTGGAATTTC 540 
TGCTGTGGTC TATAAAATAA ACTTCTTAAC ATGCTT 

Seq ID NO: 90 Protein sequence: 
Protein Accession #: NPJ02643. 1 

1 II 21 31 41 51 

MRLLQLLFRA SPATLLLVLC LQLGANKAQD NTRKIIIKNF DIPKSVRPND EVTAVLAVQT 60 
ELKECMVVKT YLISSIPLQG AFNYKYTACL CDDNPKTFYW DFYTNRTVQI AAWDVTREL 120 
GICPDDAAVI PIKNNRFYTl EIUCV 

Seq ID NO: 91 DNA sequence 

Nucleic Acid Accession U: AK00034 1 

Coding sequence: 85-975 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

GATAGCGOCG GGCAGAGGGA CCCGGCTACC CTGGACAGCG CATCGCCGCC CGCCCGGGTC 60 
GCCGCGCCAC AGCCGCTGCG GATCATGGAA CATCTAAAGG CCTTTGATGA TG AAATCAAT 120 
GCTTTTTTGG ACAATATGTT TGGACCGCGA GATTCTCGAG TCAGAGGGTG GTTCACGTTG 180 
GACTCTTACC TTCCTACCTT TTTTCTTACT GTCATGTATC TGCTCTCAAT ATGGCTGGGT 240 
AACAAGTATA TGAAGAACAG ACCTGCTCTT TCTCTCAGGG GTATCCTCAC CTTGTATAAT 300 
CTTGGAATCA CACTTCTCTC CGCGTACATG CTGGCAGAGC TCATTCTCTC CACTTGGGAA 360 
GGAGGCTACA ACTTACAGTG TCAAGATCTT ACCAGCGCAG GGGAAGCTG A CATC CGGG TA 420 
GCCAAGGTGC TTTGGTGGTA CTATTTCTCC AAATCAGTAG AGTTCCTGGA CACAATTTTC 480 
TTCGTTTTGC GGAAAAAA AC G AGTCAG ATT ACTTTTCTTC ATGTATATCA TCA TGCT TCT 540 
ATGTTTAACA TCTGGTGGTG TGTCTTGAAC TGG ATACCTT GTGG ACAAAG TTTCTTTGGA 600 
CCAACACTGA ACAGTTTTGT CCACATTCTT ATGTACTCCT ACTATGG ACT TTCTGTGTTT 660 
CCATCTATGC ACAAGTATCT TTGGTGG AAG AAATATCTCA CACAGGCTCA GCTGGTGCAG 720 
TTCGTGCTCA CCATCACGCA CACCATGAGC GCCGTCGTGA AACCGTGTGG CTTCCCCTTC 780 
GGTTGTCTCA TCTTCCAGTC ATCTTATATG CTAACGTTAG TCATCCTCTT CTTAAATTTT 840 
TATGTTCAGA CATACCGAAA AAAGCCAATG AAGAAAGATA TGCAAGAGCC ACCTGCAGGG 900 
AAAGAAGTGA AGAATGGTTT TTCCAAAGCC TACTTCACTG CAGCAAATGG AGTG ATGAAC 960 
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AAGAAAOCACAAI^^ 

TTGTTTTAAA GCAAAGACTQ AATTGAAGGT TACATGTTTT AGGAJA AACT AAmCHTTT 080 
GAGTTC ATAA ATCATTTGTA CCCAG AATGT ATTAATATAT TGCTATTAGG TTAATCTGTT 1 HO 

AACTGAATGC TTTGATCAGC ATTGAGGTGA ^^^S^SS^SjP/^Pf^^Qj^rr^i^r^\2^ 
5 CAGCTTCTCT CTCCCTCCCT CCCACAGACT GAACCTTTCG CCAG^GCTG TCOTA^C12«I 
GCCTTATACG CATACACAGC CAGGAAACGT GGAGCATTGT yT^^CAOJ^A^W^OTCTCCAA 1320 
ATAAAAAGGG TTTTGTTCAG ATTAAAATGT TT ACAACAAA ATCTTAATTA TATTCTAAAT 1380 
ACAGfiCTATG TTCTAATCTA TATTAAGCAA TAATGCCAGT GCATAATCAT TCCATTTGTT 1« 
C^S«A TCAACCCCAG AAAATATTAA AATGGGATCA TACACAGAAG ATAGAAAAAT 1500 

10 CT^GCAAVW^ TTCTCTTTCT GTAAGCCAG A GTCTTGTCTA TCAGATTCCC ACAACCACTC 1560 
CTCATP^^^ATTIAGTC AT ATGGTAATGA AATTGGTATT TATTTTAAAT ATTAGT^TT 1620 
CTAAGGAGAA^AAAATGCTT CTGCAAGATT ^^^AATTC^^GGGCTGTGGATAGGATTG 1^680 
TTCCTCTGTT TCCCTAATCA TTCATCTGTT CATGTCTCCC TCTTGTGCCA GTCAOCCTAG 1740 
^AT^AGA TCCC^GCTCCACACCACXSA GCAGTGTACA AATCTGGCTG CCCGTTTACT 1800 

1 S TTCTGAGCAA GCACTGGAGT CCACTCCGAC CTTTTTCTTT G AACATGCAT GCTGCTGGAA 1 860 

tIStataaa tcagaactag caoaagtagc agagtgatgg oagcaaaata ggcactgaat 1920 

TCGTCAACTC TT^^GTGAGCCTACTTGT G AATATTACC TCAG ATACCT GTTGTCACTC ^0 
™Sa^tWgaAGCTGGGAGG^ 

TCCAGCACTG AGCCGTGAGC CGGTCATGAG CCACGATAAA AAATGCCAGT TTGGCAAAC1 Z1W 
00 TA^rArrcCT GTTCCCTGCT CAGGTATATG CGATCTCTACTGAGAAGCAA GCACAAAAGT 2160 

20 A^ACCA^GTA^S 

AAACTCTACC AAGAATGGAA ACCAAGAATA TTTTCTGAAG ATTTTTTOA ^AGATTAATTT 2280 
ATACCCTATA AAATAAAACT TGTTAGCTTC GATGAAGTCA AAAAAAAAAA AAAAAAAAAA 

25 Seq ID NO: 92 Protein sequence: 

Protein Accession #: BAA9 1096. 1 

1 U 21 31 41 51 

30 MEHLKAFDDE ^AFLdUg PRDSRVRGW ^ « 
at ci pnn Ti YNI GITLLSA YMLAELILST WEGGYNLQCQ DLTSAGEADI RVAK.VLWWYY uu 
^V^T^F^ OITFLHVYHH ASMFNIWWCV LNWIPCGQSF FGPTLNSFVH 180 

^LT^^^^MQ^? AGKEVKNGFS KAYFTAANGV MNKKAQ 



Seq ID NO: 93 DNA sequence 

Kucieic Acid Accession NM_000044 ^ ^ ne i 

Coding sequence: 1 1 15-3874 (underlined sequences correspond to start and stop codons) 



40 1 11 21 31 41 51 

PflAfiATCCCG GGG AGCCAGC TTGCTGGGAG AGCGGGACGG TCCGGAGCAA GCCCACAGGC 60 

AAGGG ACGCA CCACGCCAGC CCCAGCCCGG CTC^GCGAC 
AS r-rrrr.rtrrrr a a AGCCGCCG CCCGGAGCTG CCCTTTCCTC TTCGGTG AAG TTTTTAAAAU iw 

CTCCTCCTCT CCACCCCGCC TCCCCCCACC CTGCCTTCCC f ^^^^^SaCCC 420 

ccgcagctgc ctcagtcggc tactctcagc caacccccct^ c ^^^^^^ r A2 ^ Q 

nrcrrccCGC CCCCGTCGGC CCAGCGCTGC CAGCCCGAGT TTGCAGAGAG GTAACTCCCT 480 

50 togctgc^agS *° 

^ C^GGGAGC»GOTCAGCACTGCAGCCACGACX^^ 
ACCCTCTGTTTTCCCCCACTC^ 

AGCAG AGATC AAAAGATGAA AAGGCAGTCA GGTCTTCAGT ^CCAAAAAA ^^AAC 720 
AAAAAC AAAA AAGCCGAAAT AAAAGAAAAA GATAAT^AACTCAGITC^ 
55 TTCAGTGGAC ACTGAATTTG GAAGGTGGAG G ATTTTGTTT J^CTmA n^SI^C 900 
ATCTTTTGAA TCTACCCTTC AAGTATTAAG ^^^^^^JS^f^F^^^^^^^^ 
TTGTCCACCG TGTGTCTTCT TCTGCACG AG ACTTTGAGGC TGTCAG AGCG CrnTTCCGT 960 
GGTTGCTCCC GCAAGTTTCC TTCTCTGGAG CTTCO:GCAG [^^^°^^°^^ 

aptarrgcat catcacagcc tgttgaactc ttctgagcaa gagaagggga ggcggggtaa 108u 
60 gSaaStcg^caS 

CGTCTACCCT CGGCCGCCGT CCAAGACCTA CCGAGGAGCT ^AGAATC JSSSkMC U6» 
CGTGCGCGAA GTGATCCAGA ACCCGGGCCC CAGGCACCCAGA(MCCGCGAG(X3CAGCA^ 1260 
TScKCGCCAGTrrGCTGCTGCTGCAGCAGCAGCAGC^ 

GCAGCAGCAG CAGCAGCAGC AGCAGCAAGA ^ACTAGCCCC AGGC^CAGC^GC/^C^CA |380 
65 GGGTGAGGAT GGTTCTCCCC AAGCCCATCG TAGAGGCTCC AC^GCTAttTGGTCCTGGA 1440 

7A S^rSr" CAGCAACAGC AGCAGGAAGC AGTATCCGAA GGCAGCAGCA GCGGGAGAGC 1740 
CATTTCTGAC AACGCCAAGG AGTTGTGTAA G^ 0 ^ 0 * G ^^^^^ 

/£ VrrrnTTG GAGCATCTGA GTCCAGGGGA ACAGCTTCGG GGGGATTGCA TGTACGCCCC 1920 
?^Son^ GCCCCATTGG CCGAATGCAA 1980 

75 aSctg*^ 

1 QH 
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AGCACTGGAC GAGGCAGCTG CGTACCAGAG TCGCGACTAC TACAACTTTC CACTGGCTCT 2220 
GGCCGGACCG CCGCCCCCTC CGCCGCCTCC CCATCCCCAC GCTCGCATCA AGCTGGAGAA 2280 
CCCGCTGGAC TACGGCAGCG CCTGGGCGGC TGCGGCGGCG CAGTGCCGCT ATGGGGACCT 2340 
GGCGAGCCTG CATGGCGCGG GTGCAGCGGG ACCCGGTTCT GGGTCACCCT CAGCCGCCGC 2400 
TTCCTCATCC TGGCACACTC TCTTCACAGC CGAAGAAGGC CAGTTGTATG GACCGTGTGG 2460 
TGGTGGTGGG GGTGGTGGCG GCGGCGGCGG CX3GCGGCGGC GGCGGCGGCG GCGGCGGCGG 2520 
CGGCGGCGGC GAGGCGGGAG CTGTAGCCCC CTACGGCTAC ACTCGGCCCC CTCAGGGGCT 2580 
GGCGGGCCAG G AAAGCGACT TCACCGCACC TGATGTGTGG TACCCTGGCG GCATGGTGAG 2640 
CAGAGTGCCC TATCCCAGTC CCACTTGTGT CAAAAGCGAA ATGGGCCCCT GGATGGATAG 2700 
CTACTCCGGA CCTTACGGGG ACATGCGTTT GGAGACTGCC AGGGACCATG TTTTGCCCAT 2760 
TGACTATTAC TTTCCACCCC AGAAGACCTG CCTGATCTGT GGAGATGAAG CTTCTGGGTG 2820 
TCACTATGGA GCTCTCACAT GTGGAAGCTG CAAGGTCTTC TTCAAAAGAG CCGCTGAAGG 2880 
GAAACAGAAG TACCTGTGCG CCAGCAGAAA TGATTGCACT ATTGATAAAT TCCGAAGGAA 2940 
AAATTGTCCA TCTTGTCGTC TTCGGAAATG TTATGAAGCA GGGATGACTC TGGGAGCCCG 3000 
GAAGCTGAAG AAACTTGGTA ATCTGAAACT ACAGGAGGAA GGAGAGGCTT CCAGCACCAC 3060 
CAGCCCCACT G AGGAGACAA CCCAGAAGCT GACAGTGTCA CACATTGAAG GCTATGAATG 3 120 
TCAGCCCATC TTTCTG AATG TCCTGGAAGC CATTGAGCCA GGTGTAGTGT GTGCTGG ACA 3 1 80 
CGACAACAAC CAGCCCGACT CCTTTGCAGC CTTGCTCTCT AGCCTCAATG AACTGGGAGA 3240 
GAGACAGCTT GTACACGTGG TCAAGTGGGC CAAGGCCTTG CCTGGCTTCC GCAACTTACA 3300 
CXJTGGACGAC CAGATGGCTG TCATTCAGTA CTCCTGG ATG GGGCTCATGG TGTTTGCCAT 3360 
GGGCTGGCGA TCCTTCACCA ATGTCAACTC CAGGATGCTC TACTTCGCCC CTGATCTGGT 3420 
TTTCAATGAG TACCGCATGC ACAAGTCCCG GATGTACAGC CAGTGTGTCC GAATGAGGCA 3480 
CCTCTCTCAA GAGTTTGGAT GGCTCCAAAT CACCCCCCAG GAATTCCTGT GCATGAAAGC 3540 
ACTGCTACTC TTCAGCATTA TTCCAGTGGA TGGGCTGAAA AATCAAAAAT TCTTTGATGA 3600 
ACTTCGAATG AACTACATCA AGGAACTCGA TCGTATCATT GCATGCAAAA GAAAAAATCC 3660 
CACATCCTGC TCAAGACGCT TCTACCAGCT CACCAAGCTC CTGGACTCCG TGCAGCCTAT 3720 
TGCGAGAGAG CTGCATCAGT TCACTTTTGA CCTGCTAATC AAGTCACACA TGGTGAGCGT 3780 
GGACTTTCCG GAAATGATGG CAGAGATCAT CTCTGTGCAA GTGCCCAAGA TCCTTTCTGG 3840 
GAAAGTCAAG CCCATCTATT TCCACACCCA GTGAAGCATT GGAAACCCTA TTTCCCCACC 3900 
CCAGCTCATG CCCCCTTTCA GATGTCTTCT GCCTGTTATA ACTCTGCACT ACTCCTCTGC 3960 
AGTGCCTTGG GGAATTTCCT CTATTGATGT ACAGTCTGTC ATGAACATGT TCCTGAATTC 4020 
TATTTGCTGG GCTTTTTTTT TCTCTTTCTC TCCTTTCTTT TTCTTCTTCC CTCCCTATCT 4080 
AACCXTTCCCA TGGCACCTTC AGACTTTGCT TCCCATTGTG GCTCCTATCT GTGTTTTGAA 4140 
TGGTGTTGTA TGCCTTTAAA TCTGTGATGA TCCTCATATG GCCCAGTGTC AAGTTGTGCT 4200 
TGTTTACAGC ACTACTCTGT GCCAGCCACA CAAACGTTTA CTTATCTTAT GCCACGGGAA 4260 
GTTTAGAGAG CT A AG ATT AT CTGGGGAAAT CAAAACAAAA AACAAGCAAA CAAAAAAAAA 4320 
A 

Seq ID NO: 94 Protein sequence: 
Protein Accession #: NP_00OO35. 1 

1 11 21 31 41 51 
I I 1 I I I 

MEVQLGLGRV YPRPPSKTYR GA5QNLFQSV REVIQNPGPR HPEAASAAPP GASLLLLQQQ 60 
QQQQQQQQQQ QQQQQQQQET SPRQQQQQQG EDGSPQAHRR GPTG YLVLDE EQQPSQPQSA 1 20 
LECHPERGCV PEPGAAVAAS KGLPQQLPAP PDEDDSAAPS TLSLLGPTFP GLSSCSADLK 1 80 
DILSEASTMQ LLQQQQQEAV SEGSSSGRAR EASGAPTSSK DNYLGGTSTI SDNAKELCKA 240 
VSVSMGLGVE ALEHLSPGEQ LRGDCMYAPL LG VPPAVRPT PCAPLAECKG SIXDDSAGKS 300 
TEDTAEYSPF KGGYTKGLEG ESLGCSGSAA AGSSGTLELP STLSLYKSGA LDEAAAYQSR 360 
DYYNFPLALA GPPPPPPPPH PHARHCLENP LDYGSAWAAA AAOCRYGDLA SLHGAGAAGP 420 
GSGSPSAAAS SSWHTLFTAE EGQLYGPCGG GGGGGGGGGG GGGGGGGGGG GGEAGAVAPY 480 
GYTRPPQGLA GQESDFTAPD VWYPGGMVSR VPYPSPTCVK SEMGPWMDSY SGPYGDMRLE 540 
TARDHVLPID YYFPPQKTCL ICGDEASGCH YGALTCGSCK VFFKRAAEGK QKYLCASRND 600 
CTIDKFRRKN CPSCRLRKCY EAGMTLGARK LKKLGNLKLQ EEGEASSTTS PTEETTQKXT 660 
VSH1EGYECQ PIFLNVLEA1 EPGVVCAGHD NNQPDSFAAL LSSLNELGER QLVHVVKWAK 720 
ALPGFRNLHV DDQMAVIQYS WMGLMVFAMG WRSFTNVNSR MLYFAPDLVF NEYRMHKSRM 780 
YSQCVRMRHL SQEFGWLQIT PQEFLCMKAL LLFSIIPVDG LKNQKFFDEL RMNYDCELDR 840 
IIACKRKNPT SCSRRFYQLT K.LLDSVQPLA RELHQFTFDL LIKSHMVSVD FPEMMAEIIS 900 
VQVPKILSGK VKPIYFHTQ 

SeqIDNO:95 DNA sequence 

Nucleic Acid Accession tf: NM_002497 

Coding sequence: 135-1472 (underlined sequences correspond to start and stop codons) 

I 1 1 21 31 41 51 
I I I I I I 

GGCACGAGTA GGGGTGGCGG GTCAGTGCTG CTCGGGGGCT TCTCCATCCA GGTCCCTGGA 60 
GTTCCTGGTC CCTGGAGCTC CGCACTTGGC GCGCAACCTG CGTGAGGCAG CGCGACTCTG 120 
GCGACTGGCC GGCCATGCCT TCCCGGGCTG AGG ACTATGA AGTGTTGTAC ACCATTGGCA 1 80 
CAGGCTCCTA CGGCCGCTGC CAGAAGATCC GGAGGAAGAG TGATGGCAAG ATATTAGTTT 240 
GG AAAGAACT TGACTATGGC TCCATGACAG AAGCTG AGAA ACAGATGCTT GTTTCTG AAG 300 
TGAATTTGCT TCGTGAACTG AAACATCCAA ACATCGTTCG TTACTATGAT CGGATTATTG 360 
ACCGGACCAA TACAACACTG TACATTGTAA TGGAATATTG TGAAGGAGGG GATCTGGCTA 420 
GTGTAATTAC AAAGGGAACC AAGGAAAGGC AATACTTAGA TGAAGAGTTT GTTCTTCGAG 480 
TGATGACTCA GTTGACTCTG GCCCTGAAGG A ATGCCACAG ACGAAGTGAT GGTGGTCATA 540 
CCGTATTGCA TCGGGATCTT AAACCAGCCA ATGTTTTCCT GGATGGCAAG CAAAACGTCA 600 
AGCTTGGAGA CTTTGGGCTA GCTAG AATAT TAAACCATGA CACGAGTTTT GCAAAAACAT 660 



388 
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TTfiTTG G C AC ACCTTATTAC ATGTCTCCTG AACAAATGAA TCGCATGTCC TACAATGAGA 720 
^f^^^^f^T^TTATTG GGCTGCTTGC TGTATGAGTT ATGTGCATTA ATGCCTCCAT 780 

rrCAOQ ATC CAGCCCTGTA TTCAGTGAGC TGAAACTGAA GGAAATTCAG TTACAGGAGC .080 

gSISa^tgVgtcagctcacat^ 

fifiCTTCACGC TGCCCAGCTG CGGGCTCAAG CCCTGTCAGA TATTGAGAAA^TTACCJW^ l w 
rr AAAAGCAG A^GATCCTG GGCATGCGCT AGCCAGGTAO AGAGACACAG AGCTGTGTAC 1500 
1 5 AGGATGTAAT ATTACGAj^C TTTAAAGACTO^ATTCA AA TGCTGTAGTG *TTGAATACTT ISM 
GGCCCCATGA gSaTGCCTT TCTGTATAGT ACACATGATA TTTCGGAATTGGTTTTACTG 1620 

rlS^GA TGTCACATC T^T^ GA^AAAAAT AATATTAGGA A AAA AAT ATT 1 800 

r^rrcTG tttaacagaa tatgagctgt ctgtcattta cctacttctt TCCCACTAAA 2100 
25 taaaagaatt cttcagtta 

Seq ID NO: 96 Pmtein sequence: 
Protein Accession #: NP_002488 

30 1 " 21 31 41 51 

niREFNPNIM ANEVERINMI TOSISGENEA RELATEDKIN ASEHOMOSAP IENSMPSRAE 60 
n^V^I^ KELDYGSMTE AEKQMLVSEV NLLRELKHPN 1 20 

1 <: ruVR<5nfiGHT VLHRDLKPAN VFLDGKQNVK LGDFGLARIL NHDTSFAKTF VGTPYYMSPE 240 
n^M^K SD^LGCLL YELCALMPPF TAFSQKELAG KJREGKFRRI PYRYSDELNE 300 

if i F IOI OER ERALKAREER LEQKEQELCV RERLAEDKLA RAENLLKNYS LI^kiu-l^ 420 
40 LSDIEKNYQL KSRQILGMR 



Sea ID NO: 97 DN A sequence 

Nucleic Acid Accession #: NM 007050.2 

^sequence: 1 85-4576 (underlined sequences correspond to sUrt and stop codons) 



GGT ^^r ^SSctA aS^CGAGGG CCGAATGGAG AGCGAGGAGCTGGTGGTGCA 1620 



PCT/US02/02242 
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GGGG AAAGTG TTCAAGCTCC GGAATGAAAC CCACC ACCTC TTTGTGGGTC TGTACCCAGG 1 860 
GACCACCTAT TCCTTCACCA TCAAGGCCAG CACAGCAAAG GGCTTTGGGC CCCCTGTCAC 1920 
CACTCGGATT GCCACCAAAA TTTCAGCTCC ATCCATGCCT GAGTACGACA CAGACACCCC 1980 
ATTGAATGAG ACAGACACGA CCATCACAGT GATGCTGAAA CCCGCTCAGT CCCGGGGAGC 2040 
TCCTGTCAGT GTTTATCAGC TGGTTGTCAA GGAGGAGCGA CTTCAGAAGT CACGGAGGGC 2100 
AGCTGACATT ATTG AGTGCT TTTCGGTGCC CGTG AGCTAT CGGAATGCCT CCAGCCTCGA 2 160 
TTCTCTACAC TACTTTGCTG CTGAGTTGAA GCCTGCCAAC CTGCCTGTCA CCCAGCCATT 2220 
TACAGTGGGT GACAATAAG A CATACAATGG CTACTGGAAC CCTCCTCTCT CTCCCCTGAA 2280 
AAGCTACAGC ATCTACTTCC AGGCACTCAG CAAAGCCAAT GGAGAGACCA AAATCAACTG 2340 
TGTTCGTCTG GCTACAAAAG CACCAATGGG CAGCGCCCAG GTGACCCCGG GGACTCCACT 2400 
CTGCCTCCTC ACCACAGGTG CCTCCACCCA GAATTCTAAC ACTGTGGAGC CAGAGAAGCA 2460 
GGTGGACAAC ACCGTG AAGA TGGCTGGCGT GATCGCTGGC CTCCTCATGT TCATCATCAT 2520 
TCTCCTGGGC GTGATGCTCA CCATCAAAAG GAGAAGAAAT GCTTATTCCT ACTCCTATTA 2580 
CTTGTCCCAA AGGAAGCTGG CCAAGAAGCA GAAGGAGACC CAGAGTGGAG CCCAGAGGGA 2640 
GATGGGGCCT GTGGCCTCTG CCGACAAACC CACCACCAAG CTCAGCGCCA GCCGCAATGA 2700 
TGAAGGCTTC TCTTCTAGTT CTCAGG ACGT CAACGGATTC ACAGATGGCA GCCGCGGGGA 2760 
GCTTTCCCAG CCCACCCTCA CGATCCAGAC TCATCCCTAC CGCACCTGTG ACCCTGTGGA 2820 
GATGAGCTAC CCCCGGGACC AGTTCCAACT CGCCATCCGG GTGGCTGACT TGCTGCAGCA 2880 
CATCACGCAG ATGAAGAGAG GCCAGGGCTA CGGGTTCAAG GAGGAATACG AGGCCTTACC 2940 
AGAGGGGCAG ACAGCTTCGT GGGACACAGC CAAGGAGGAT GAAAACCGCA ATAAGAATCG 3000 
ATATGGGAAC ATCATATCCT ACGACCATTC CCGGGTGAGG CTGCTGGTGC TGGATGGAGA 3060 
CCCGCACTCT GACTACATCA ATGCCAACTA CATTGACGGA TACCATCGAC CTCGGCACTA 3120 
CATTGCG ACT CAAGGTCCG A TGCAGG AG AC TGTAAAGG AC TTTTGG AG A A TGATCTGGCA 3 1 80 
GGAGAACTCC GCCAGCATCG TCATGGTCAC AAACCTGGTG GAAGTGGGCA GGGTGAAATG 3240 
TGTGCGATAC TGGCCAGATG ACACGGAGGT CTACGGAGAC ATTAAAGTCA CCCTGATTGA 3300 
AACAGAGCCC CTGGCAGAAT ACGTCATACG CACCTTCACA GTCCAGAAGA AAGGCTACCA 3360 
TGAGATCCGG GAGCTCCGCC TCTTCCACTT CACCAGCTGG CCTGACCACG GCGTTCCCTG 3420 
CTATGCCACT GGCCTTCTGG GCTTCGTCCG CCAGGTCAAG TTCCTCAACC CCCCGGAAGC 3480 
TGGGCCCATA GTGGTCCACT GCAGTGCTGG GGCTGGGCGG ACTGGCTGCT TCATTGCCAT 3540 
TGACACCATG CTTGACATGG CCGAGAATGA AGGGGTGGTG GACATCTTCA ACTGCGTGCG 3600 
TGAGCTCCGG GCCCAAAGGG TCAACCTGGT ACAGACAGAG GAGCAATATG TGTTTGTGCA 3660 
CGATGCCATC CTGGAAGCGT GCCTCTGTGG CAACACTGCC ATCCCTGTGT GTGAGTTCCG 3720 
TTCTCTCTAC TACAATATCA GCAGGCTGG A CCCCCAGACA AACTCCAGCC AAATCAAAGA 3780 
TGAATTTCAG ACCCTCAACA TTGTGACACC CCGTGTGCGG CCCGAGGACT GCAGCATTGG 3840 
GCTCCTGCCC CGGAACCATG ATAAGAATCG AAGTATGGAC GTGCTGCCTC TGGACCGCTG 3900 
CCTGCCCTTC CTTATCTCAG TGGACGGAGA ATCCAGCAAT TACATCAACG CAGCACTGAT 3960 
GGATAGCCAC AAGCAGCCTG CCGCCTTCGT GGTCACCCAG CACCCTCTAC CCAACACCGT 4020 
GGCAGACTTC TGGAGGCTGG TGTTCGATTA CAACTGCTCC TCTGTGGTGA TGCTGAATGA 4080 
GATGGACACT GCCCAGTTCT GTATGCAGTA CTGGCCTGAG AAGACCTCCG GGTGCTATGG 4140 
GCCCATCCAG GTGGAGTTCG TCTCCGCAGA CATCGACGAG GACATCATCC ACAQAATATT 4200 
CCGCATCTGT AACATGGCCC GGCCACAGGA TGGTT ATCGT ATAGTCCAGC ACCTCCAGTA 4260 
CATTGGCTGG CCTGCCTACC GGGACACGCC CCCCTCCAAG CGCTCTCTGC TCAAAGTGGT 4320 
CCGACGACTG GAGAAGTGGC AGGAGCAGTA TGACGGGAGG GAGGGACGTA CTGTGGTCCA 4380 
CTGCCTAAAT GGGGGAGGCC GTAGTGGAAC CTTCTGTGCC ATCTGCAGTG TGTGTGAGAT 4440 
G ATCCAGCAG CAAAACATCA TTG ACGTGTT CCACATCGTG AAAACACTGC GTAACAACAA 4500 
ATCCAACATG GTGGAGACCC TGGAACAGTA TAAATTTGTA TACGAGGTGG CACTGGAATA 4560 
TTTAAGCTCC TTTTAGCTCA ATGGGATGGG G AACTGCCGG AGTCCAGAGG CTGCTGTGAC 4620 
CAAGCCCCCT TTTGTGTGAA TGGCAGTAAC TGGGCTCAGG AGCTCTGAGG TGGCACCCTG 4680 
CCTGACTCCA AGGAGAAGAC TGGTGGCCCT GTGTTCCACG GGGGGCTCTG CACCTTCTGA 4740 
GGGGTCTCCT GTTGCCGTGG GAGATGCTGC TCCAAAAGGC CCAGGCTTCC TTTTCAACCT 4800 
AACCAGCCAC AGCCAAGGGC CCAAGCAGAA GTACACCCAC AAGCAAGGCC TTGG ATTTCT 4860 
GGCTCCCAGA CCACCTGCTT TTGTTCTGAG TTTGTGGATC TCTTGGCAAG CCAACTGTGC 4920 
AGGTGCTGGG GAGTGGGAGG CTCCCCTGCC CTCCTTCTCC TTAGGAGTGG AGGAGATGTG 4980 
TGTTCTGCTC CTCTACGTCA TGGAAAAGAT TG AGGCTCTT GGGGGTCACT GCTCTGCTGC 5040 
CCCCTGCAAC CTCCTTCAGG GGCCTCTGGC ACCAGACATT TGCAGTCTGG ACCAGTGTGA 5100 
CCTTACGATG TTCCCTAGGC CACAAGAGAG GCCCCCCATC CTCACACCTA ACCTGCATGG 5160 
GGCTTCGCCC ACAACCATTC TGTACCCCTT CCCCAGCCTG GGCCTTGACC GTCCAGCATT 5220 
CACTGGCCGG CCAGCTGTGT CCACAGCAGT TTTTG ATAAA GGTGTTCTTT GCTTTTTTGT 5280 
GTGGTCAGTG GGAGGGGGTG GAACTGCAGG G AACTTCTCT GCTCCTCCTT GTCTTTGTAA 5340 
AAAGGGACCA CCTCCCTGGG GCAGGGCTTG GGCTGACCTG TAGGATGTAA CCCCTGTGTT 5400 
TCTTTGGTGG TAGCTTTCTT TGGAAGAGAC AAACAAGATA AGATTTGATT ATTTTCCAAA 5460 
GTGTATGTGA AAAG AAACTT TCTTTTGGAG GGTGTAAAAT CTTAGTCTCT TAT GTCAA AA 5520 
AGAAGGGGGC GGGGGAGTTT GAGTATGTAC CTCTAAGACA AATCTCTCGG GCCTTTTATT 5580 
TTTTCCTGGC AATGTCCTTA AAAGCTCCCA CCCTGGGACA GCATG CCACT G AGCAAG G AG 5640 
AGATGGGTGA GCCTGAAGAT GGTCCCTTTG GTTTCTGGGG CAAATAGAGC ACCAGCTTTG 5700 
TGCATAATTT GGATGTCCAA ATTTGAACTC CTTCCTAAAG AAACCCAGCA GCCACCTTGA 5760 
AAAAGGCCAT TGTGGAGCCC ATTATACTTT G ATTTAAAAT AGGCCAAGAG AATCAGGCCT 5820 
GGAGATCTAG GGTCTTGTCC AAAGTGTGAG TGAGTCAATG AGAGGGAACC AACATTTGCT 5880 
AAGTCTCTAC TGTATGCCAG GG ATCATGCT TGGGACTTTC CATAGGACAT TTCACACAGT 5940 
CCTTAGAACC CCCAGGAGAG AGCTACTGAC TTGTTATCAT CTCCATTTG A TCATCTCCTC 6000 
CAATGAGGAA ACCCACGCAC CTTCCTTAGT AATG AAATCC TGGGTTCCAA AGGGGCAGGT 6060 
AATGGCAATG AGACTTCTCC GTGCTGTTTT CTTCATCTTC TCTAAGCCAA GCAATTATTT 6120 
TATGGAGGGA AAATAAGGCC AGAAACTTCT GAGCAGATAA CTCCACAAAT GGA AATTTAG 6180 
TACTTTC TTC CTGATGCCAG TTCTTCTGGG AAGCGCAGAA TTTCAGATAT ATTTTAGTAA 6240 
CACATTCCCA GCTCCCCAGG AAAGCCAGTC TCATCTAATT TCTTAGTCAG TAAAAACAAT 6300 
TCCCTGTTCC TTCAGGCTAT G AATGG ACCA GCCAGGG AAA CTCTCGACCT TGATCTCTAG 6360 
CCAGTGCTTA GGCCCAATAT CTGACAGCCT CAGGTGGGCT GGGACCTAGG AAGCTCCATC 6420 

390 
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•rrr a Arrrrn fiTCT AGCCCC AGACAGGGCA TGAGGGGCAG AGAATTCAAG AAGGTACAGC 6480 

" GGAMTGGAACCATGGTGCA OTAQTOTGGA 6540 . 
GTGGATG AGT GTTCCATGAG CCTAGGAGCA AGAAAGTCTC TTCGGCCTCG i GGCTTCCTGGl S600 
ipurfTfiAP fnrTATTrCT GCTGGGTCTT AACAAGCATA AAAAGGAAAA AAAGGAAACT 6660 
S PArrCAAAGG ^tc£SS TGC^TGGCA ^AGAAATGTG AAAAGGCATT GGGAGAAGCA 6720 
5 Sg^GAGgS^GTGC^GGCAC AGCACAACACGGGG^C^GAOATGAG «80 
X-ArnrvrrrA GGAGACAGAT GCCCATCGCG AGTACAGACT TTGTCCTATT GGCAACAAGG 6840 
AGTCCATGGA GC^^AX^G^G^TCCACTCA GCTTCGTGTT G ^^^P^^^J^^^^qF 6960^ 
CAATGGGGCT GCCTCTTTTC CTTTCATCAG ACACTGTGAA 7020 
1 0 r^iTTAATA TCACATCTAT TTGTCTGTCT GCTCATTGTT TTGTTGCTGG AACTAAATAT 7020 
10 GCAATC(M^^A'rcAG^TCA^^TCTATG A GAAACCCAGG 9T,9^^P^^'^^^^q^^^J^j4q 
PAGGGTCACC AACCCAGATC TCCAGGCCCA TGAGGATGGA ACATGAAAGG AGCCGACAAA ,/MU 
AG^GCTHX A^TCGCATGG GCTCTGGAGCTGTCCAGAAG TCCAGGGACA CCAGACTTGA 7200 
tSgGAAGG i GCTGTCACTT TAGAGGTTCA AAAGGAAGTG CCTCAAAGCA AAGGCAAGCA .7260 
1 S I SI£ a ACCCC ACGATG AACT TGCTCTTTTC CTTTGATGAG CCTCTCCCCA GGTGTATTTC 7320 

GC^C^GOT GCCTTCCCCA GCCTGCAAGG AGCCTGTAGC ATGGCAAATC TGCCTGCTGT 7440 
ATCCTATnT CTTAG ATCTT GGTACATCCA GACAGGATGA GGGTGGAGGG AGAGCTATTT 7500 
AACACA/^TCCTAAGATTTT TTTCTGCTCA GGAAGGGGTG AAATAGCTGG CAGATACAAA 7560 
90 lr at ATTflGC TTTIV^CATT TTAAATGGTA GGAATTTAAG GTGTGACTTC AGGGAG AAAC 7620 

U ^aa^ca^IIa^a^ 7680 

^rA^^A^G^^^TA^G^CAG^TCCACACAGG AAGAAATTGA GGCTTATGGA TGGGGATG AC 7800 
SSSSES STC^C^GGATOTG GCAAGGCTTG GATGAGATGG GGCACCAGTG 7860 
?S fY^AnfiAATT TGAACATTTT DZTTTACCCA GGAAATCTCC GGAGCCAACA CCACCACCCC 7920 

5 Sgg^c^ 

aa a ATATTAT TAATGCTCTC TGAGTCTTTA CAACAGGAGC TCTTACCTCA TAGATGTGGG 8040 
C^J^G J Jg^ISSgCA AGGAAGTAAT GAGAAGCCCA GGAAATTTCT CCACCTGTGT I 1100 
Satp^A AATAGCTTCA GGATGTATCT TAGCTGCACT CCAACATTGC ATCCTTTCTG 8160' 
^0 £pt^^^ TTGGGCCTCT AGAAGGCCAC AGTAGGCCTC 8220 

3 ?c^gtgggS 

^caaaGG^CCAACAGA CCCTATGCTG GGGACTCTAA CATGTGAGCT CATTAAATTC 8340 
™S^AnC^A^ A^CTAAGGC84O0 

SgggI^tcac^ 

^ ta a Krrr.cm TTTTCTCACT ACAGAGACAA TATTTTTCCA CCATTGTATC TCACA1 i l i 1 b^u 

35 SamtSat^ca^aagagactagagtggaacagat^ 

SaGCAA ACAACAGTAA GCTTAAAATT XCTTCATAGT^CATGTm ACGTTCACAA 8640 
ATrP4ii A TTTGC ATTC CACTTTCTGA fTi AUUU lluuuuiuiftAiAiviAw^.ft 

SaatatTOA^^^ 8760 
40 ^ISg^C^G^TCAG CTGGTCTTCA GCTTGACCCT AGAATTGACT CTAGGAGCAG 8820 
^ A^CTG^CT G^CTCCCAGA GCCAGTTATA GGCTCAAGAT CAAGACCAAC TGACCTTCTC 8880 

™£gct^ 

SaGGCAOTtSgTTGGG TGCTGGAAGG AACCCATTAACTCACACTAG AATGATGAGG 9000 
AT^G^I^^CTGGCGTGGA GAAGGATGAG CCCACAAAAC CCTAAAGGGA AAAGAGAAGC 9060 

45 tg^ac/^agctc^w^tcagc agattcctga atgctaggct ggaaagtggt ^^^®][?^SL IfjL 

r a Arrrr AGT CACATGGTTG CTAATGTGGG CAAGTCTGAG GACACACTTC ATGAGCAGCT 9180 
GG^SgI aSScSa OTtIcCcVa GCCACACATA ATTACTGGGT OCC^CAGCA 9240 
CCTAGCACCT TGGAGGGGGC ACTATTAGGA AATCGAGATT ACTATCGCAC ^TTAA^CC 9300 
SaAGGC ATGGGGTTGT GGTGGACAGA ^^^^^^^^SJJJ^^SS^^^^^-^q 9420 
^ A i t a r ath a AA AACATACATG AAAAAAGGAC CCTCATCAAC ATTAGAAGGG GTAGATTTGG 9420 
^ U a r p ArTTTAG^ GCAGG A AA AC AGGAACGCAA GGCCAGGAAA CTGGAACCCA GTGAATACTC 9480 
Ar A^^x^^AG^m^mA^ AAAATGGTCA CTTCTGTGAC ATAGCTGGAG 9540 
a a ArGA^GGG TAAC AGCTTG^ OCAG AGCCAC TTGGAACAAG GGCAAATCTC AGTGTCTGGG 9600 

^SaS?gItcca^cc^ 9660 
55 c^ac^ 

ArrrnT aha GTACATAGAG GTATCAGTCC AGTAAGACTG CTCTACACAA CAGAAGTG AG 9780 
rSrA^GGAG ^G^^G GCCOTATO TGTTACCTCT GCAGGAGTGA CTGCCCAACC 9840 

SSE^^G^^S^G^AA^ S 

65 ^c^ctagS 

iroOT^^rnTOA tgtatacatg tctagaagtg atagaatgca ^^mocimn 

mfiTCTTGC AGGGATCTGC CTTTGAGGTC ATAGACTGAA CAGCAGGGAG AGAGGTTAGT 10440 
nYTG^GT XG^GGG AGCT GTTCTAGCTC CAGTTTCTTC TGACACAim ^TTCAGGATCA ^10500 
TrPATCTOAT CCTCCG AAGC ACAGCAGAGA TATCTAAGCC ATATTTGTGC ACATGAGCAG 10560 
-7A ^S^t-^t T"iTTTTACn^ ACC^.GGG ATG GGCTTTTGCA TGGCACTGAC TATAGAGATG 10620 
70 iSSaK^ "680 
I^CTCATC ^GGGCatt CACCAACTGA AATCCACTCA TGAATGTTAG GTCTCTAAAA 10740 

^agSa^ctcacaatggtagcctccaaacct^^^ 

rAGGGGTGGT CCACTGGGGC AGATACAAGG G AAGTGCAAG GGCTCAGGAT GAAAGAAAAT 10860 
7 S rTA^TKl^GAA G AGTTTTAGG GGCTTGATCA TTATGGGGCT TCCTTCTATA TCTGAGAACT 10920 
PctSgGGTG GTGAGATGTG GACTCTGATC CTTAATTGGA ATGTTCGGAG AATGAGTGTC 10980 
?SSgCOTGAAGTGTTGG ACAGAAAAGT ATCAGTATAA AAGCCTGGAG CTCAGGGTAA 11040 
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TTAATGTAGT TCATGGTTCC TTAGTGAGCA GGACTCTTGG ATGTGGAGGA GAAAGGGTCA U 100 
TAGGAAGTAA ACCACCAAAA TTACAAAATT GAGTCTCTGT ACAATTACTT CAGTGCCTTT 1 1 160 
GGGCTTATGA ATACAAATCA GTGGGCCTTC TCTATGATGG TCCAACAAAC TCTCAGTGTC 11220 
CACCCTGTCC CTGTATCTCC CATGGAAGAT GAATAATGTC AGGTGTTCTT TGGGTCAAAG 1 1280 
GCCCCAGGGC AGTCTGGAGG CTTAGAGGGC AGAGTGGTGT CATTCCATGT AAAGTTAGGC 1 1340 
TTCTGAGGGG TCAGGCAGAA TATGGTGTCC ATATCTTCCA TAGCTCTGCA GATTCTTGGA 1 1400 
TGAAGTCAAG CACAGTTTGC TAGACCCAGG TCACTCCTCT GAGTATAACT AGGACCCATG 1 1460 
AGTGAAACTT AATAGCTGTA AGGAAGAACC TGCTGTCTGC CAGAGAGGAT AAGCTGCCCA 1 1520 
TCTCAGCAGC TGTCTAAAAG AAGGCAGGTG TCTCTTTAAA GGGAAGAGAA GCATTGGTGA 1 1530 
AATGGATTTC AGGTCACTTC CATTCCAGAT GGGTGAGATC TTGTGGAGCT GGOATCATGT 1 1 640 
TTGAACTCAT TCATACCTGT AGAGCACGAA TCCAAGTAGA TTGTGTTTGG TCTGTACAGG 1 1700 
CTGAAGCCCC CTGCTCTCCC ACCCAAGTGC CCCCACTGAG CAGGCCAACA TGCTGTTGTG 11760 
GCCACATATA CTGGGCTGAT CCAGGCTGGT TATCACCAAA CAGCAAACCA TAGGGAACAG 1 1820 
CTGCTTTGCC ATAGACCCAA TACCCATGTA GATCTCTCAT GAGAGCAGCC ATAACTCAGA 1 1 880 
CCCACTGACC AACAGGGCCA TGAGTGACAG CCAGAACCAG TGAAGGTCCA AGTAGGACAC 1 1940 
AGAGCAGGGC TTTTCTTACC ATACACATTA TCTCCAGAGG TTATTTCTAC CCCACTCCCT 12000 
ATTCAAGGCC TGTTGGAGCA CACTGCAAAA GCAAAAGCAC AGTAACTCAA TTTACACATG 12060 
ATTATAATCA TTTCCAGTGC ACACATTTCA TCACCAGGTG GATCCTGAGC TAGCCCATGT 12120 
AAATCCGGGT TAACCCATAT TGGTAATCAT ACTCAAAAGC ACTTTTCACC CTACATTCTA 12180 
CTAGCCAATC AAAGACAAAG AGTTGTGGCC TCTACCATTG CCTTGGCTTC TGGACACCCT 12240 
CACAAGCTAT CCCAAGGTTC CCGCTCAACT CCAGGGAGGC TGACATCTTC ACATCCACTG 12300 
GGCATATAAT ATTGCATGAG ACCAAAGTCT CCACACTCTr TGCAGCCTCC TCCATGAATC 1 2360 
CCAATGGCCT GCACTTGTAC AGTTTGGGTG TTTGATAGAT AAAGCACGTA TGAGAAGAGA 12420 
AAACAAAATA AATCAACTTT TTAAAAAAGC CAGCACTGTG CTGTCAATGT TTTTTTTTTC 12480 
TTTTCAATTC TAGCTCAGAA AAGCAGAAGG TAAATAATGT CAGGTCAATG AATATCAGAT 12540 
ATATTTTTTG ACTGTACATT ACAGTGAAGT GTAATCTTTT TACACCTGCA AGTCCATCTT 12600 
ATTTATTCTT GTAAATGTTC CCTGACAATG TTTGTAATAT GGCTGTGTTA AAAAATCTAT 12660 
ACAATAAAGC TGTGACCCTG 

Seq ID NO: 98 Protein sequence: 
Protein Accession »: NP_00898 1 . 1 

1 11 21 31 41 51 

GIREFNPPRO TEINTYROSl NEPHOSPHAT ASERECEPTO RTYPETHOMO SAPIENSMAS 60 
LAALALSLLL RLQLPPLPGA RAQSAPGGCS FDEHYSNCGY SVALGTNGFT WEQINTTEKP 120 
MLDQAVPTGS FMMVNSSGRA SGQKAHLLLP TLKENDTHCI DFHYYFSSRD RSSPGALNVY 180 
VKVNGGPQGN PVWNVSG VVT EGWVKAELAI STFWPHFYQV IFESVSLKGH PGYIAVDEVR 240 
VLAHPCRKAP HFLRLQNVEV NVGQNATFQC 1AGGKWSQHD KLWLQQWNGR DTALMVTRW 300 
NHRRf SATVS VADTAQRSVS KYRCVIRSDG GSG VSNYAEL I VKEPPTPIA PPELLA VGAT 360 
YLWDCPNANS IIGDGPHUC EVEYRTTTGT WAETHIVDSP NYKLWHLDPD VEYEIRVLLT 420 
RPGEGGTGPP GAPLTTRTKC ADPVHGPQNV EIVDIRARQL TLQWEPFGYA VTRCHSYNLT 480 
VQYQYVFNQQ QYEAEEVIQT SSHYTLRGLR PFMTIRLRLL LSNPEGRMES EELVVQTEED 540 
VPGAVPLESI QGGPFEEKIY IQWKPPNETN GVITLYEINY KAVGSLDPSA DLSSQRGKVF 600 
KLRNETHHLF VGLYPGTTYS FTIKASTAKG FGPPVTTRIA TKISAPSMPE YDTDTPLNET 660 
DTTITVMLKP AQSRG AP VSV YQL WKEERL QKSRRAADII ECFSVP VSYR NASSLDSLHY 720 
FAAELKPANL PVTQPFT VGD NKTYNGYWNP PLSPLKSYSI YFQALSKANG ETKINCVRLA 780 
TKAPMGSAQV TPGTPLCLLT TGASTQNSNT VEPEKQVDNT VKMAG VIAGL LMFII1LLGV 840 
MLTIKRRRNA YSYSYYLSQR KLAKKQKETQ SGAQREMGPV ASADKPTTKL SASRNDEGFS 900 
SSSQDVNGFT DGSRGELSQP TLTIQTHPYR TCDPVEMSYP RDQFQLAJRV ADLLQHITQM 960 
KRGQGYGFKE EYEALPEGQT ASWDTAKEDE NRNKNRYGNI ISYDHSRVRL LVLDGDPHSD 1020 
YINANYIDGY HRPRHYIATQ GPMQETVKDF WRMIWQENSA SJVMVTNLVE VGRVKCVRYW 1080 
PDDTEVYGDI KVTUETEPL AEYVIRTFTV QKKG YHEIRE LRUHFTSWP DHGVPCYATG 1 140 
LLGFVRQVKF LNPPEAGPIV VHCSAG AGRT GCFIAIDTML DMAENEGWD IFNCVRELRA 1200 
QRVNLVQTEE QY VFVHDAIL EACLCGNTAI PVCEFRSLYY NISRLDPQTN SSQIKDEFQT 1 260 
LNIVTPRVRP EDCSIGLLPR NHDKNRSMDV LPLDRCLPFL ISVDGESSNY INAALMDSHK 1320 
QPAAF WTQH PLPNTV ADFW RLVFDYNCSS WMLNEMDTA QFCMQYWPEK TSGCYGPIQV 1380 
EFVSADIDED HHRIFRJCN MARPQDGYRI VQHLQYIGWP AYRDTPPSKR SLLKVVRRLE 1440 
KWQEQYDGRE GRTVVHCLNG GGRSGTFCAI CSVCEMIQQQ NI1DVFH1VK TLRNNKSNMV 1 500 
ETLEQYKFVY EVALEYLSSF 

Seq ID NO: 99 DNA sequence 

Nucleic Acid Accession #: NM JJ02988. 1 

Coding sequence: 71-340 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

COGGCACGAG AGGAGTTGTG AGTTTCCAAG CCCCAGCTCA CTCTO ACCAC TTCTCTGCCT 60 
GCCCAGCATC ATGAAGGGCC TTGCAGCTGC CCTCCTTGTC CTCGTCTGCA CCATGGCCCT 120 
CTGCTCCTGT GCACAAGTTG GTACCAACAA AGAGCTCTGC TGCCTCGTCT ATACCTCCTG 180 
GCAGATTCCA CAAAAGTTCA TAGTTGACTA TTCTGAAACC AGCCCCCAGT GCCCCAAGCC 240 
AGGTGTCATC CTCCTAACCA AGAG AGGCCG GCAG ATCTGT GCTGACCCCA ATAAG AAGTG 300 
GGTCCAG AAA TACATCAGCG ACCTGAAGCT GAATGCCTGA GGGGCCTGGA AGCTGCGAGG 360 
GCCCAGTGAA CTTGGTGGGC CCAGGAGGGA ACAGGAGCCT GAGCCAGGGC AATGGCCCTG 420 
CCACCCTGG A GGCCACCTCT TCTAAG AGTC CCATCTGCTA TGCCCAGCCA CATTAACTAA 480 
CTTTAATCTT AGTTTATGCA TCATATTTCA TTTTGAAATT GATTTCTATT GTTGAGCTGC 540 
ATTATGAAAT TAGTATTTTC TCTGACATCT CATGACATTG TCTTTATCAT CCTTTCCCCT 600 
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TTCCCTTCAA CTCTTCGTAC ATTCAATGCA TGGATCAATC AGTGTGATTA GCTTTCTCAG 660 
CAGACATTGT GCCATATGTA TCAAATGACA AATCTTTATT GAATGGTmGCTCAGCACC 720 
ACCTTTTAAT ATATTGGCAG TACTTATTAT ATAAAAGGTA AACCAGCATT CTCACTGTGA 780 
^ AAAAAAAAAA AAAAAAAAAA AAA 

Seq ID NO: 100 Protein sequence: 
Protein Accession #: NP_002979. 1 

I 1! 21 31 41 51 

10 MKGLAAALLV LVCTMALCSC AQVGTNKELC CLVYTSWQIP QKFIVDYSET SPQCPKPGVI 60 
LLTKRGRQIC ADPNKKWVQK YISDLKLNA 

Seq ID NO: 101 DNA sequence 
15 Nucleic Acid Accession »: NMJH5507.2 

Coding sequence: 241-1902 (underlined sequences correspond to start and stop coaons) 

1 U 21 31 41 51 

20 CCGCAGAGG A GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 60 
CGCCCCTGO TGCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC CCGGTAACTGl 20 
CGA^GGAGCGG/^GACCCG AGCGGCTGAG GAGAGAGGAG GCGGCGGCTT ACCTGCTACO 180 

g?g?ScgS 

ATorrTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 
25 G^^CG^ ™ 
IG^CIAIQQ^ACIKK^ GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 
TGTGAAGCTACATGCGAACC TGGATGTAAG TTTGGTG ACT GCGTGGGACC AAACAAATGC 480 
AGATGCTTTC cTcGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTGTGGAATG 540 
A^JcCCGGC CATGCCAACA CAGATGTGTG AATACACACG ^CTAC^AA GTGC^GC 600 
30 CTCAGTGGCC ACATGCTCAT GCCAGATGCT ACGTGTGTGA ACTCTAGGAC ATGTGCCATG 660 

J U Sctgtc agtacagctg tg a ag acaca gaagaagggc cacagtgcct ™™£cc 720 

TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTCCCTCT 780 
GGTAAAGTCATCTGTCCCTA CAATCGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 
AAATGTCACA TTGGTTTCGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 
3 5 AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 960 

ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AG AGCACCTG^T ACC ATCAA AG ACAG AATC^ 1 08 0^ 
AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CA >^™* ^ 
CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 
40 ATAGTTTCCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 1260 

W gISotg aggatgagaa aagagaagag aaagccctga agaatga^ 

AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTGAATGAAO ^/f<^TT 440 
CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGG AACATA AAGATTTAAA TATCTCGGTT 1440 
GACTGCAGCT TCAATCATGG GATCTGTGAC TGGAAACAGG ATAQAO/^QA TGATTTTOAC 1500 
45 TGGAATCCTG CTGATCGAGA TAATGCTATT ^^^^^^^^^PS^^JSSP^^^^^^} t*2Q 
GGTCACAAGA AAGACATTGG CCG ATTGAAA CTTCTCCTAC ^^^^^ ^ 
AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA ACTTCG AGTG 1680 
TTTCTGAAAA ACAGTAACAA TGCCCTGGGA TGGGAGAAG A ^^^^1 Ar rAtCATTni 1 800 
-rnn a Ar.ArAfi OGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 1 80O 

50 gaa^agaS 

TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGG ATGAC1GAATGTTACT ATCTTTATAT 1920 

SgScta^ot^ 

^AGAAT^CTAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT A AG ATG CCTT 204 0 
^GTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2100 

55 SgaItc^S 

CAGTATATCT GATTTGTATA AGTAAGTTGATGAGCTTCTCTCTACAACATTTCTAGAAAA ^2220 
TAGAAAAAAA TgCACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 

TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 



60 



Seq ID NO: 102 Protein sequence: 
Protein Accession #: NP_056322,2 



1 11 21 31 41 

65 i i I I I I 



MPLPWSLALP UXSWVAGGF GNAASARHHG LLASARQPGV CHYGTKLACC YGWRRNSKGV 60 
CEATCEPGCK FGECVGPNKC RCFPGYTGKT CSQDVNECGM KPRPCQHRCV NTHGSYKCFC 120 
LSGHMLMPDA TCVNSRTCAM INCQYSCEDT EEGPQCLCPS SGLRLAPNGR DCLDIDECAS 180 
GKVTCPYNRR CVNTFGSYYC KCHIGFELQY ISGRYDCIDI NECTMDSHTC SHHANCFNTQ 240 
70 GSFKCKCKOG YKGNGLRCSA IPENSVKEVL RAPGTIKDRI KKLLAHKNSM KKKAKDCNVT 300 
PRPTRTPTPK VNLOPFNYEE IVSRGGNSHG GKKGNEEKMK EGLEDEKREE KALKNDIEER 360 
SLRGDVFFPK VNEAGEFGL1 LVQRKALTSK. LEHKDLNISV DCSFNHGICD WKQDREDDFD 420 
WNPADRDNAI GFYMAVPALA GHKKDIGRLK LLLPDLQPQS NFCLLFDYRL AGDKVGKLRV 480 
FVKNSNNALA WEKTTSEDEK WKTGKJQLYQ GTDATKSIIF EAERGKGKTG EIAVDGVLLV 540 
75 SGLCPDSLLSVDD 

Seq ID NO: 103 DNA sequence 
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Nucleic Acid Accession #: NM_00t 565. 1 

Coding sequence: 67-363 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

GAGACATTCC TCAATTGCTT AGACATATTC TGAGCCTACA GCAGAGGAAC CTCCAGTCTC 60 
AGCACCATG A ATCAAACTGC GATTCTGATT TGCTGCCTTA TCTTTCTGAC TCTAAGTGGC 1 20 
ATTCAAGG AG TACCTCTCTC TAGAACCGTA CGCTGTACCT GCATCAGCAT TAGTAATCAA 1 80 
CCTGTTAATC CAAGGTCTTT AGAAAAACTT GAAATTATTC CTGCAAGCCA ATTTTGTCCA 240 
CGTGTTGAGA TCATTGCTAC AATGAAAAAG AAGGGTGAG A AGAGATGTCT GAATCCAGAA 300 
TCGAAGGCCA TCAAGAATTT ACTGAAAGCA GTTAGCAAGG AAATGTCTAA AAGATCTCCT 360 
TAAAACCAGA GGGGAGCAAA ATCGATGCAG TGCTTCCAAG GATGGACCAC ACAGAGGCTG 420 
CCTCTCCCAT CACTTCCCTA CATGGAGTAT ATGTCAAGCC ATAATTGTTC TTAGTTTGCA 480 
GTTACACTAA AAGGTGACCA ATGATGGTCA CCAAATCAGC TGCTACTACT CCTGTAGG AA 540 
GGTTAATGTT CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 600 
GCTCTACTGA GGTGCTATGT TCTTAGTGG A TGTTCTGACC CTGCTTCAAA TATTTCCCTC 660 
ACCTTTCCCA TCTTCCAAGG GTACTAAGGA ATCTTTCTGC TT TGGGG TTT ATCAGAATTC 720 
TCAGAATCTC AAATAACTAA AAGGTATGCA ATCAAATCTG CTTTTTAAAG AATGCTCTTT 780 
ACTTCATGGA CTTCCACTGC CATCCTCCCA AGGGGCCCAA ATTCTTTCAG TGGCTACCTA 840 
CATACAATTC CAAACACATA CAGGAAGGTA GAAATATCTG AAAATGTATG TGTAAGTATT 900 
CTTATTTAAT GAAAGACTGT ACAAAGTATA AGTCTTAG AT GTATATATTT CCTATATTGT 960 
TTTCAGTGTA CATGGAATAA CATGTAATTA AGTACTATGT ATCAATGAGT AACAGGAAAA 1020 
TTTTAAAAAT ACAGATAGAT ATATGCTCTG CATGTTACAT AAGATAAATG TGCTGAATGG 1080 
TTTTCAAATA AAAATGAGGT ACTCTCCTGG AAATATTAAG 

Scq ID NO: 104 Protein sequence: 
Protein Accession #: NP_00 1 556. 1 

1 11 21 31 41 51 

MNQTAILICC LIFLTLSGIQ GVPLSRTVRC TCIS1SNQPV NPRSLEKLE1 IPASQFCPRV 60 
EIIATMKKKG EKRCLNPESK AIKNLLKAVS KEMSKRSP 

Seq ID NO: 105 DNA sequence 

Nucleic Acid Accession #: NM_0 15068. 1 

Coding sequence: 1 1 70-2243 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

GTAACAAOCG TCACCCTGGG TCCCGACTGC CCACCTCCTC CTCCTCCCCC TCCCCCCAAC 60 
AACAACAACA ACAACAACTC CAAGCACACC GGCCATAAGA GTGCGTGTGT CCCCAACATG 120 
ACCGAACGAA GAAGGGACG A GCTCTCTGAA GAGATCAACA ACTTAAGAGA GAAGGTCATG 1 80 
AAGCAGTCGG AGGAGAACAA CAACCTGCAG AGCCAGGTGC AGAAGCTCAC AGAGGAGAAC 240 
ACCACCCTTC GAG AGCAAGT GG AACCCACC CCTG AGGATG AGGATGATGA CATCGAGCTC 300 
CGCGGTGCTG CAGCAGCTGC TGCCCCACCC CCTCCAATAG AGGAAGAGTG CCCAGAAGAC 360 
CTCCCAGAGA AGTTCGATGG CAACCCAGAC ATGCTGGCTC CTTTCATGGC CCAGTGCCAG 420 
ATCTTCATGG AAAAGAGCAC CAGGGATTTC TCAGTTGATC GTGTCCGTGT CTGCTTCGTG 480 
ACAAGCATGA TG ACCGGCCG TGCTGCCCGT TGGGCCTC AG CAAAGCTGG A GCGCTCCCAC 540 
TACCTGATGC ACAACTACCC AGCTTTCATG ATGGAAATGA AGCATGTCTT TGAAGACCCT 600 
CAGAGGCGAG AGGTTGCCAA ACGCAAGATC AGACGCCTGC GCCAAGGCAT GGGGTCTGTC 660 
ATCGACTACT CCAATGCTTT CCAG ATGATT GCCCAGG ACC TGGATTGGAA CGAGCCTGCG 720 
CTGATTGACC AGTACCACGA GGGCCTCAGC GACCACATTC AGGAGGAGCT CTCCCACCTC 780 
GAGGTCGCCA AGTCGCTGTC TGCTCTGATT GGGCAGTGCA TTCACATTGA GAGAAGGCTG 840 
GCCAGGGCTG CTGCAGCTCG CAAGCCACCC TCGCCACCCC GGGCGCTGGT GTTGCCTCAC 900 
ATTGCAAGCC ACCACCAGGT AGATCCAACC GAGCCGGTGG GAGGTGCCCG CATGCGCCTG 960 
ACGCAGGAAG AAAAAGAAAG ACGCAGAAAG CTGAACCTGT GCCTCTACTG TGGAACAGGA 1020 
GGTCACTACG CTGACAATTG TCCTGCCAAG GCCTCAAAGT CTTCGCCGGC GGGAAACTCC 1080 
CCGGCCCCGC TGTAGAGGGA CCTTCAGCGA CCGGGCCAGA AATAATAAGG TCCCCACAAG 1 140 
ATGATGCCTC ATCTCCACAC TTGCAAGTGAJGCTCCAGAT TCATCTTCCG GGCAGACACA 1200 
CCCTGTTCGT CCGAGCCATG ATCGATTCTG GTGCTTCTGG CAACTTCATT GATCACGAAT 1260 
ATGTTGCTCA AAATGG AATT CCTCTAAGAA TCAAGGACTG GCCAATACTT GTGG AAGCAA 1 320 
TTGATGGGCG CCCCATAGCA TCGGGCCCAG TTGTCCACGA AACTCACGAC CTGATAGTTG 1380 
ACCTGGGAGA TCACCGAGAG GTGCTGTCAT TTGATGTGAC TCAGTCTCCA TTCTTCCCTG 1440 
TCGTCCTAGG GGTTCGCTGG CTGAGCACAC ATGATCCCAA TATCACATGG AGCACTCGAT 1500 
CTATCGTCTT TGATTCTGAA TACTGCCGCT ACCACTGCCG GATGTATTCT CCAATACCAC 1S60 
CATCGCTCCC ACCACCAGCA CCACAACCGC CACTCTATTA TCCAGTAGAT GGATACAGAG 1620 
TTTACCAACC AGTGAGGTAT TACTATGTCC AGAATGTGTA CACTCCAGTA GATGAGCACG 1680 
TCTACCCAGA TCACCGCCTG GTTGACCCTC ACATAGAAAT GATACCTGGA GCACACAGTA 1740 
TTCCCAGTGG ACATGTGTAT TCACTGTCCG AACCTGAAAT GGCAGCTCTT CGAGATTTTG 1800 
TGGCAAGAAA TGTAAAAGAT GGGCTAATTA CTCCAACGAT TGCACCTAAT GGAGCCCAAG 1860 
TTCTCCAGGT GAAGAGGGGG TGG AAACTGC AAGTTTCTTA TGATTGCCG A GCTCCAAACA 1920 
ATTTTACTAT CCAGAATCAG TATCCTCGCC TATCTATTCC AAATTTAGAA GACCAAGCAC 1980 
ACCTGGCAAC GTACACTGAA TTCGTACCTC AAATACCTGG ATACCAAACA TACCCCACAT 2040 
ATGCCGCGTA CCCGACCTAC CCAGTAGGATTCGCCTGGTA CCCAGTGGGA CGAGACGGAC 2100 
AAGGAAGATC ACTATATGTA CCTGTGATGA TCACTTGGAA TCCACACTGG TACCGCCAGC 2160 
CTCCGGTACC ACAGTACCCG CCGCCACAGC CGCCGCCTCC ACCACCACCA CCGCCGCCGC 2220 
CTCCATCTTA CAGTACCCTG TAA ATACCTG TCATGTCCTT CAGGATCTCT GCCCTCAAAA 2280 
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TTTATTCCTG TTCAGCTTCT CAATCAGTGA CTGTGTGCTA AATTTTAGGC TACTGTATCT 2340 
TCAGGCCACC TGAGGCACAT CCTCTCTGAA ACGGCTATGG AAGGTTAGGG CCACTCTGGA 2400 
CTGGCACACA TCCTAAAGCA CCAAAAG ACC TTCAACATTT TCTGAGAGCA ACAGAGTATT 2460 
TGCCAATAAA TGATCTCTCA TTTTTCCACC TTG ACTGCCA ATCTAACTAA AATAATTAAT 2520 
AAGTTTACTT TCCAGCCAGT CCTGGAAGTC TGGGTTTTAC CTGCCAAAAC CTCCATCACC 2580 
ATCTAAATTA TAGGCTGCCA AATTTGCTGT TTAACATTTA CAG AG AAGCT GATACAAACG 2640 
CAGGAAATGC TGATTTCTTT ATGGAGGGGG AGACGAGGAG G AGGAGGACA TGACTTTTCT 2700 
TGCGGTTTCG GTACCCTCTT TTTAAATCAC TGGAGGACTG AGGCCTTATT AAGGAAGCCA 2760 
AAATTATCGG TGCAGTGTGG AAAGGCTTCC GTGATCCTCT CGCTGCACCC TTAGAAACTT 2820 
CACCGTCTTC AAACTCCATT TCCATGGTTC TGTTAATTCT CAAGGAGCAG CAACTCGACT 2880 
GGTTCTCCCA GG AGCAGGAA AAACCCTTGT GACATGAAAC ATCTCAGGCC TGAAAAGAAA 2940 
GTGCTCTCTC AGATGGACTC TTGCATGTTA AGACTATGTC TTCACATCAT GGTGCAAATC 3000 
ACATGTACCC AATG ACTCCG GCTTTGACAC AACACCTTAC CATCATCATG CCATG ATGGC 3060 
TTCCACAAAG CATTAAACCT GGTAACCAGA GATTACTGGT GGCTCCAGCG TTGTTAGATG 3120 
TTCATGAAAT GTGACCACCT CTCAATCACC TTTGAGGGCT AAAG AGTAGC ACATCAAAAG 31 1 80 
GACTCCAAAA TCCCATACCC AACTCTTAAG AGATTTGTCC TGGTACTTCA GAAAGAATTT 3240 
TCATG AGTGT TCTTAATTGG CTGGAAAAGC ACCAGCTGAC GTTTTGGAAG AATCTATCCA 3300 
TGTGTCTGCC TCCATATGCA TCTGGGCATT TCATCTTCAG TCCCCTCATT AGACTGTAGC 3360 
ATTAGGATGT GTGGAGAGAG GAGAAATGAT TTAGCACCCA GATTCACACT CCTATGCCTG 3420 
GAAGGGGGAC ATCTTTGAAG AAGAGGAATT AGGGCTGTGG ACACTGTCTT GAGGATGTGG 3480 
ACTTCCTTAG TGAGCTCCAC ATTACTTGAT GGTAACCACT TCAA AAGGAT CAGAATCCAC 3540 
GTAATG AAAA AGGTCCCTCT AG AGGATGG A GCTG ATGTGA AGCTGCCAAT ' GGATGAAAAG 3600 
, CCTCAGAAAG CAACTCAAAG GACTCAAAGC AACGGACAAC ACAAGAG^G TCTOAGCCC 3660 
AGTGACACCT CTGATGTCCC CTGGAAGCTT TGTGCTAACC TGGGACTGCC TGAOTCCTT 3720 
TAGCCTGGTC CCTTGCTACT ACCTTGAACT GTTTTATCTA ACCTCTCTTT TTCTGTTOA 3780 
TTCTTTGCTA CTGCCATTG A CCCTGCTGCA GGATTTGTGT ^^^^^^^l^ Qm 
G AGACTCCAT TTTGCTGCCA CACACAGAGA TGTAAGAGGC AGGCTTTAAT TGCCAAAGCA 3900 
CAGTTTGAGC AGTAGAA AAC AACATGGTGT ATATCTCAAA TTGCCTGACA TGAAGAGGAG 3960 
TCTAACGGTG AAGTTTCACT TTTCATCAGC ATCATCTTTC ACATGTTCAT TATCATCCGC 4020 
TCTTATTCTT G CATGUT A A ACACTTAAAA TTTTTAGTAT AATTTTTAGT GTGTTTOAA 4080 
GTGGTGACTA GGCTTTCAAA AACTTCCATT GAATTACAAA GCACTATCCA GTTCTOTTG 4140 
TTAAACTAAG T AAA AATG AT AAGTAACATA GTGTAAAATA TTCCTTTACT GTGAACTTCT 4200 
TACAATGCTG TG AATG AG AG GCTCCTCAGA ACTGG AGCAT TTGTATAATAJ^rTCATCCTG 4260 
TTCATCTTCA ATTTTAACAT CATATATAAT TTCAATTCTA TCAATTGGGC C ^^J, A A ^ an 
CATATAAAAG GATATAAAAT TTGAAAAGAG AAACCTAATT GGCTATTTAA TCCAAAACAA 4380 
CTTTTTTTTT TCCTTCAATG GAATC AG AAA GCTTGTCAAT CACTCATGTG TTTTAGAGTA 4440 
ATTACTTTTA AAATGGTGCA TTTGTGCTTC TG AACTATTT TGAAGAGTCA CTTCTGTTT A 4500 
CCTCAAGTAT CAATTCATCC TCCATACATT TGAATTCAAG TOTrrTTTu i<^iAA i w^j56u 
AGTTGTCAAT TGATCTTCAA GCTGCAGGGT ^^^^^^^^^^J^J ^^^^^^^^^^q 
GGCATGTGCA CACGGACATT TGCCACCACT GCAAGCAAAA GTCTGGAGAA GTTCACCAAC 4680 
GACAAGAACG ATTAGGGAAA ATATGCTGCT GTGGGTTAAC AACTCAGAAA GJCCCTG ATC. 4740 
CACATTTGGC TGTTTACTAA AGCTTGTGAT TAACTTTTTG GCAGTGTGTA CT ATGCTCTA 4800 
TTGCTATATA TGCTATCTAT AAATGTAGAT GTTAAGGATA AGTAATTCTA A ATTT ATT AT 
TCTATAGTTT TGAAGTTTGG TTA AGTTTCC TTTCACTCAA TTGATTTATT TTGTTGTTAA 4920 
TCAAATTTAT GTTAATTGG A TCCTTTAAAT TTTTTTTGGC ATTTTCCAAC AAAAATGGCT 4980 
E^KI GAAAG?A^ AAATCAATGG AATTTGATAT CTAAAGAAGT TAGAAAGGGA 5040 
GCAAAATAAA AAACATAAAG GAGATAGATG AATTAGTAAG CAAATCAGTA GTCGAGTTTT 5100 
TCAAACTGGC AAAATTAATT AATTGACTTT TAGCCCAAAT TTACATTGTT AATTAAATCA 5160 
AGAAGGAAGA AGATCTAAGA GCTCCCATTG ATAGGCAAGC CTAGAGAGAA CTAGCTAAAT 5220 
TTATCATGCT AGGATATTG A AACACAGAAA GTTTACATAC ATTTATGAAG GGTCAATTTA 5280 
GTTTGGACAG TGAGGTATTT GTCTTAGTGG AAAAAAGGAG AATTAGTCTC ^CAAATGGT 5340 
GAAGTAATAC AGTGAACTTG CAGGTGCACA AAATAAGAGG GCC ACATCW ^TATCGTCCAG 5400 
TCTGGAATTC TGTTTAAGTT TGTAGGTACC TCTTGGACTT CTGAATTGAT CCAGTTGTCA 5460 
TCCACCACAG ACATCTCACA TCAG ATACAG ACAGTTCCAA G *™ A ^™^C*£ ^20 
CTGCTGGAAA GACCTGGGCA GAAATGGAGA GCCCTGCGGG ^ ( ^^^TI^S : i ^ 
TAAAGAGAGA ATGCACATCT GATG AG ACTG AAAGTTCTTT GTTGTTTTAG ATTGTAGAAT ^ 5640 
GGTATTG AAT TGGTCTGTGG AAAATTGCAT TGCTTTTATT TCTTTGTGTA ATCAAGTTT^700 
AGTAATAGGG G ATATATAAT CATAAGCATT TTAGGGTGGG AGGGACTATTAAGT^AAT^TTT 5760 
AAGTGGGTGG GGTTATTTAG AATGTTAGAA TAATATTATG TATTAGATAT CGCTATAAGT 5820 
GGACATGCGT ACTTACTTGT AACCCTTTAC CCTATAATTG CTATCCTTAA AGATTTCAAA 5880 
TAAACTCGGA GGGAACTGCA GGG AGACCAA CTTATTTAGA GCG^™GA CATGGATAAA 5940 
AACCCCAGTG GGAGAAAGTT CAAAGGTGAT TAGATTAATA ATTTAATAGA WATCAGTGA 6000 
CCTCTGATAA ATTACTGCTA GAATGAACTT GTCAATGATG GATGGTAAAT mCATGGAA 6060 
GTTATAAAAG TG ATA AATAA AAACCCTTGC TTTTACCCCT GTCAGTAGCC CTCCTCCTAC 6120 
CACTGAACCC CATTGCCCCT ACCCCTCCTT CTAACTTTAT TGCTGTATTC TCTTCACTCT 61 80 
ATA^rreCTCT^ CTATUGO'Ai^ATTGCATT GCTGTTACAA TAAAAATTCA ATAAAGATTT 6240 
AGTGGTTAAG TGC 

Scq ID NO: 106 Protein sequence: 
Protein Accession*: NPJ)55883.1 

I 11 21 31 41 51 

MTERRRDELS EEmNLREKV MKQSEENNNL QSQVQKLTEE NTTLREQVE? TPED^DDIE 60 
LRGAAAAAAP PPPIEEECPE DLPEKFDGNP DMLAPFMAQC QIFMEKSTRD FSVDRVRVCF 120 
^^f^ RWASAKLERS HYLMHNYPAF MMEMKHVFED PQRREVAKRK IRRLRQGMGS 180 
^SqMIAQDLDWNEP AL1DQYHEGL SDHIQEELSH LEVAKSLSAL IGOCIHIERR 240 
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LARAAAARKP RSPPRALVLP HIASHHQVDP TEPVGGARMR LTQEEKERRR KLNLCLYCGT 300 
GGHYADNCPA KASKSSPAGN SPAPL 

Seq ID NO: 107 DNA sequence 
5 Nucleic Acid Accession ft NM_003679.1 

Coding sequence: 47- I507(underlined sequences conespond to start and stop codons) 

1 11 21 31 41 51 

I U GGCACGAGCA G AAGCAACAA TAATTGTGAA AAATACTTCA GCAGTTATGG ACTCATCTGT 60 
CATTCAAAGG AAAAAAGTAG CTGTCATTGG TGGTCGCTTG GTTGGCTCAT TACAAGCATG 120 
CTTTCTTGCA AAGAGGAATT TCCAGATTGA TGTATATGAA GCTAGGGAAG ATACTCGAGT 180 
GGCTACCTTC ACACGTGGAA GAAGCATTAA CTTAGCCCTT TCTCATAGAG GACGACAAGC 240 
CTTGAAAGCT GTTGGCCTGG AAGATCAGAT TGTATCCCAA GGTATTCCCA TGAG AGCAAG 300 

1 5 AATGATCCAC TCTCTTTCAG GAAA AAAGTC TGCAATTCCC TATGGGACAA AGTCTCAGTA 360 
TATTCTTTCT GTAAGCAGAG AAAATCTAAA CAAGGATCTA TTGACTGCTG CTGAGAAATA 420 
CCCCAATGTG AAAATGCACT TTAACCACAG GCTGTTGAAA TGTAATCCAG AGGAAGGAAT 480 
GATCACAGTG CTTGG ATCTG ACAAAGTTCC CAAAGATGTC ACTTGTGACC TCATTGTAGG 540 
ATGTGATGGA GCCTATTCAA CTGTCAGATC TCACCTGATG AAGAAACCTC GCTTTGATTA 600 

20 CAGTCAGCAG TACATTCCTC ATGGGTACAT GGAGTTGACT ATTOCACCTA AG AACGGAGA 660 
TTATGCCATG GAACCTAATT ATCTGCATAT TTGGCCTAGA AATACCTTTA TGATGATTGC 720 
ACTTCCTAAC ATGAACAAAT CATTCACATG TACTTTGTTC ATGCCCTTTG AAGAGTTTGA 780 
AAAACTTCTA ACCAGTAATG ATGTGGTAGA TTTCTTCCAG AAATACTTTC CGGATGCCAT 840 
CCCTCTAATT GGAGAGAAAC TCCTAGTGCA AG ATTTCTTC CTGTTGCCTG CCCAGCCCAT 900 

25 GATATCTGTA AAGTGCTCTT CATTTCACTT TAAATCTCAC TGTGTACTGC TGGGAGATGC 960 
AGCTCATGCT ATAGTGCCGT TTTTTGGGCA AGGAATGAAT GCGGGCTTTG AAGACTGCTT 1020 
GGTATTTGAT GAGTTAATGG ATAAATTCAG TAACGACCTT AGTTTGTGTC TTCCTGTGTT 1080 
CTCAAGATTG AGAATCCCAG ATGATCACGC GATTTCAGAC CTATCCATGT ACAATTACAT 1 140 

_ _ AGAGATGCGA GCACATGTCA ACTCAAGCTG GTTCATTTTT CAGAAGAACA TGGAGAGATT 1200 

30 TCTTCATGCG ATTATGCGAT CGACCTTTAT CCCTCTCTAT ACAATGGTCA CTTTTTCCAG 1260 

AATAAGATAC CATG AGGCTG TGCAGCGTTG GCATTGGCAA AAAAAGGTGA TAAACAAAGG 1320 
ACTCTTTTTC TTGGGATCAC TGATAGCCAT CAGCAGTACC TACCTACTTA TACACTACAT 1380 
GTCACCACGA TCTTTCCTCT GCTTGAGAAG ACCATGGAAC TGGATAGCTC ACTTCCGGAA 1440 

„ c TACAACATGT TTCCCCGCAA AGGCCGTGGA CTCCCTAGAA CAAATTTCCA ATCTCATTAG 1500 

35 CAGGTGATAG AAAGGTTTTG TGGTAGCAAA TGCATGATTT CTCTGTGACC AAAATTAAGC 1560 
ATGAAAAAAA TGTTTCCATT GCCATATTTG ATTCACTAGT GGAAGATAGT GTTCTGCTTA 1620 
TAATTAAACT GAATGTAGAG TATCTCTGTA TGTTAATTGC AATTACTGGT TGGGGGGTGC 1 680 
ATTTTAAAAG ATG AAACATG CAGCTTCCCT ACATTACACA CACTCAGGTT GAGTCATTCT 1 740 
AACTATAAAA GTGCAATGAC TAAG ATCCTT CACTTCTCTG AAAGTAAGGC CCTAGATGCC 1 800 

40 TCAGGG AAG A CAGTAATCAT GCCTTTTCTT TAAAAGACAC AATAGG ACTC GCAACAGCAT 1 860 
TGACTCAACA CCTAGGACTA AAAATCACAA CTTAACTAGC ATGTTAACTG CACTTTTCAT 1920 
TACGTGAATG GAACTTACCT AACCACAGGG CTCAGACTTA CTAGATAAAA CCAGAAATGG 1980 
AAATAAGGAA TTCAGGGGAG TTCCAGAGAC TTACAAAATG AACTCATTTT ATTTTCCCAC 2040 
CTTC AA ATAT AAGTATTATC ATCTATCTGT TTATCGTCTA TCTATCTATC ATCT ATCTAT 2 1 00 

45 CTATCTATCA TCTATCTATC TATCTATCTA TCTATCTATC TATCTATCTA TCTATCTATC 2 1 60 
TCTATTTATT TATGTATTTA GAGATCAGGT CTCACTCTGT TGACCAGGCT GGAGTGCAGT 2220 
GGTGAG ATCT GGGTTCACTG CAACCTCTGC CTCCTGGGCT CAAGCAATCC TCCCACTTCA 2280 
GCCTCCCAAA TAGCTGGGGC TACCATGGTA TTTTTCAGTA GAGACCGGGT CTTGCCATGC 2340 
TGCCCAGCCC AGTCTCAAAC TCCTGGCCTC ATGTGATCTG CCCACCTCAG CCTCCCAAAG 2400 

5 0 TACAGGG ATT AG AGTTGTGA GCCACCGCTG CCAGCCCAGA GTTACCCTCT AAAGATAAGA 2460 
AAAAGGCTAT TAATATCATA CTAAGTG AAG GACAGGAAAG GGTTTTATTC ATAAATTAAA 2520 
TGTCTACATG TGCCAGAATG GAAAGGAAAC AAGGGGAGAC AACTTTTATA GAAATACAAA 2580 
GCCATTACTT TATTCAATTT CAG ACCCTCA GAAGCAATTT ACTAATTTAT TCTTCGACTA 2640 

r CATACTGCAG CAG AACCAGC AATACACTTG ATTTTTAAAA GCACATTTAG TG A AATGTTT 2700 

5 5 TCTTTGGTTC ATCCTTCTTT AACAGGCTGC TGAGTCACTC AG AAATCCTT CAAACATGAT 2760 

TAATTATGAA GATGAAACAC TAGAGTCATA TAAGAAATAA AAATTGGGCA ATAAAATAAA 2820 
ATGATTCAGT GTTTCTTTTC TATATTGTCA ATGAAAACCT TG AGTTCTAA TAATCCATGT 2880 
TCAGTTTGTA GGGAAAGAAA A AATAATTTT TCCTTCTACC CACTTTAGGT TCCTTGGCTG 2940 
GGGCCCCTAT AACAAAAGAC AGATTGACAA GAGAAAAACA AACATAAATT TATTAGCGGG 3000 

60 TATATGTAAT ATATATGTGG G AAATACAGG GG AATGAGCA AATCTCAAAG AGCTGGCGTC 3060 
TTAGAACTCC CTGGCTTATA TAGCATCGAC AAAGAACAGT AAATTTTTAG AGAAACAACA 3120 
AAACAAAGAA AAAGAGCTTT GAGTCTGTAG GGGCAGCAAT TTGGGGGAAG CAAATATATG 3180 
GGAGTTTGCC TTGTAGATTC CTCTGGTGGT GGTCTCCAGG CTGACAAGGA TTCAAAGTTG 3240 
TCTCTGAAACTCCTCTTTGT CATACTGCAG ATATAAAACG TCTTTTGTTT CCAACAAGAG 3300 

65 GATTTCTTTT TCATTCTAGA ATTATCTCCT TGATAACTTG ATCAGATATA GGACATGACA 3360 
CTGAATAGAG TCCAACAGTA CAAAAAAAAT TCAGTATGTT CTAGCTACTT CACACATGTG 3420 
TACGCG ACAG TTATTTTTAC AGTAAGGTAT TTTCGAGAAA AATGCATTAC GTGTTTTGGA 3480 
AAATAGAGTA ATTTAAAAAA TATATTTGAA ATGAAAATCT CCAACACATT AGAAGATGAT 3540 
GATGTTAGAT GCCCATCGTG TGCCACAAGT GGTTTTTTCA TTATGTAAAG CACCCGTTGA 3600 

70 ATTAAAAG AA TTTGTTTTTG TTCAACCTCT TCCTG AGGCC CAAG AGCATA TGGGCAATTC 3660 
GGATTTCCTG CTGG ACCACA AGGTTCTGTT GATATTACAT AGAAACGGGT ATTCCAGACA 3720 
CTTCTTATGA TGAA AGTCCA A AAGTGGCAT CCAATTTAAG GCCCCATCTT TCGTTGCCAT 3780 
TCTTCATTCC TACAAAGGAC G AACTTGGAT TACATCAACT TTGGACCCAT TGGTTTTGTC 3840 
GCTGTCGTCA ACTG ACAGTG ATTCACCACT GGTGATGATA AAAATGATGG AAG AAGAGTT 3900 

75 GAAAGTCACT TTTTTCTTTG GCCTGTCCCC ATCTTTCTGT GACATCACAA TGGGTCTGAT 3960 
CTGCATTTCA CTTCCAGCTG CTGGTAGGTC TTTAGCAGGC CTCTGGCACC TCAGCAGTCG 4020 
GAGGCACAGA AGCTGCAAAA GGGATCTTCG AAACTGGGCA GAGAAAAAAT AAAGTGGAAT 4080 
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ATTAAGTAAA AGTTGGGCAC TAATCTGGAT TAACATTCGA GGAAATCAGT TGAGCTGATT 4140 
TAAGTOmmGrrTGTT AGCAGGTGTG GATGTGGGGT TATGTGGTCA TGCTCAGATC 4200 
TACCTAAATC ACCCCAGAGC TTTATGTCTT TTATTCATTC TAAATCTTAT TAACCGG AAT 4260 
I^G^S ATTOAATAC CTTGTAATCC TCCAAGCTTC AATCTGCACA CACTTTCTAT 4320 
GAGGGCAGGT A^CT^^ AACATTAAGT TAGTCCACAA ATATTCAGTG 4380 

GGCATCTACT AGGTGACAGC CACTGTGCTA TAATTAG AG A CTTTTTACTA TAAGCATCAA 4440 
AAACA™ 

AA^AGGT^AGAATTITTTTTTTCTATCG AAATTACTAATCAGTTGGGG AAAAAAATAC 4560 
TATAGCAGAC AGC ACTAA.TG TCATC A AC A A ACATTGTTCT TCTCCGTGTC CTGGGTACAA 4620 
CATCGA^TAATATr^TTCG CC^ GCTTCTCCTC TCTGCTGTTC CTCTCTACAA 4680 

CALUjAAIAA ^^^'^X* a An a at a atatpapaa TGG AAGGAAC CTAGATTCCT 474C 



rATCGAATAATATTTCTTGG CCTCCTTTCC GClil- ICv, ivJ i^au^ivji iv. uiv,iuiavaa „ 
GAAC^GGA GGCCAACGCC TAAAGATCAT AATATCACAA TGG AAGGAAC CTAGATTCCT 4740 
A^TGACTGC ATAGG ACAG A TCCCATCTCC TCCACCCAAT ACATTATTAG ACTGAACTGT 4800 
GACCTGAAAT GAGCAATAAA CTCTGTATTA ATTCACTGAA ATGTTGGGGT TGCTTGTTAT 4860 
AGTAGTCGGT CCATCATG AC CAGTAAA ACA TAAATCAAAA GTTAATGTAA TTGTTATCCC 4920 
ATTAmAdAG^CGAAATAAATGTTGAATAT ATGGACTTTC TCAGATTAGG AAATACCAAT 4980 
TAAAAATATA ATAAATAGCT 



Seq ID NO: 108 Protein sequence: 
Protein Accession #: NP 003670. 1 

21 31 



MDSSVIQRKK VAVIGGGLVG SLQACFLAKR NFQIDVYEAR EDTRVATFTR GRSINLALSH 60 
R^AuSwG LEDQIVSQGI PMRARMIHSL SGKKSAIPYG TKSQYILSVS RENLNKDLLT 120 
AAEKYPNVKM l^^lHRLLiCCN PEEGMITVLG SDKVPKDVTC DUVGCDGAY STVT^HLMKK .180 
P^DYSOOY^ 240 

Sits ^ ™ffqky fpdatplige kllvgdffll paqpmisvkc s™^v 300 

LLGDAAHAIV PFFGOGMNAG FEDCLVFDEL MDKFSNDLSL CLPVFSRLRI PDDHAISDLS 360 

VTNKGLFFLG SLIAISSTYL LIHYMSPRSF LCLRRPWNWI AHFRNTTCFP AKAVDSLEQI 480 
SNLISR 

Seq ID NO: 109 DNA sequence 

Nucleic Acid Accession #: NMJW6115.1 

Coding sequence: 236-1765 (underlined sequences correspond to start and stop codons) 



21 31 

GCTTCAGGGT ACAGCTCCCC CGCAGCCAGA AGCCGGGCCT GCAGCCCCTC AGCACCGCTC 60 
cSac^ccc CACCCGCTTC CCAGGCGTGA c^ gtca acag^ no 
ACTCTCTGAG GAAAAACCAT TTTGATTATT ACTCTCAGAC GTGCGTGGCA ACAAGTGACT 180 
GAGACCTAGA AATCCAAGCG TTGG AGGTCC TGAGGCCAGC CTAAGTCGCT TCAAAATGGA 240 
AOG^AAGGCGT ^GTGGGGTT CCATTCAGAG CCGATACATC ^^^^PJ^^^I^3^^P T i^. 
CCCACGGAGA CTTGTGGAGC TGGCAGGGCA GAGCCTGCTG AAGGATGAGG CCCTGGCCAT 360 
TG^CGCCCTC GAGTTGCTGC CCAGGGAGCT CTTCCCGCCA ^^^^^^^^^^^^^ r 
CGGGAGACAC AGCCAGACCC TGAAGGCAAT GGTGCAGGCC TGGCCCTTCA CCTGCCTCCC 480 
TCTGCGAGTG CTGATGAAGG G ACAACATCT TCACCTGGAG ACCTTC AA AG CTGTG CTTGA 540 

Igg^cttgatgtgctccttg cccaggaggttcgccccagg aggtgga^cttcaagtg^ct 600 

GGATTTACGG AAGAACTCTC ATCAGGACrTCTGGACTGTATGGTCTGGAA ACAGGGO^G 660 
TCTGTACTCA TTTCCAGAGC CAG AAGCAGC TCAGCCCATG ACAAAGAAGC GAAAAJ3TAGA 720 
TGGTTTGAGC ACAGAGGCAG AGCAGCCCTT CATTCCAGTA GAGGTGCTCG JAGACCTGTT 780 
CCTCAAGGAA GGTGCCTGTG ATG AATTGTT CTCCTACCTC AJ^P^^^^^^^^^pp^pp^ "° 
GAAAAATGTA CTACGCCTGT GCTGTAAGAA GCTGAAGATT TTTGCAATGC CCATGCAGG A 900 
TATC^\GATG ATCCTGAAAA TGGTGCAGCT GG ACTCTATT GAAGATTTGG AAGTGACTTG 960 
TACCTGGAAG CTACCCACCT TGGCG AAATT TTCTCCTTAC CTGGGCCAGA TGATTAATCT 1020 
GCGTAGACTC CTCCTCTCCC ACATCCATGC ATCTTCCTAC ATTTCCCCGG AGAAGGAAGA 1080 
GCAGTATATC ScCAOTCA CCTCTCAGTT CCTCAGTCTG CAGTGCCTGC AGGCTCTCTA 1 140 
5otggactct™™cc TTAGAGGCCG CCTGGATCAG TTGCTCAGGC ACGTGATGAA 1200 
JScTTGGAA ACCCTCTCA^ TAACTAACTG CCGGCTTTCG GAAGGGGATG TGATGCATCT 1260 
GrcScAGT CCCAGCGTCA GTCAGCTAAG TGTCCTGAGT CTAAGTGGGG TCATGCTGAC 1320 

Staagtccc^ 

r^GGTCTTT G ATG \GTGTG GG ATCACGGA TG ATCAGCTC CTTGCCCTCC TGCCTTCCCT 1440 
Sa^^CT^AGCTTA CAACCTTAAG CTTCTACGGG AATTCCATCTCCATATCTGC 1500 
C^TOCACAGT CTCCTGCAGC ACCTCATOGG GCTGAGCAAT CTGACCCACG TGCTGTATCC 1560 
T^CCCCCTGG^IS l «° n 
TCTGC ATG CC AGG CTC AGG G AGTTGCTGTG TGAGTTGGGG CGGCCCAGCATGGTCTGGCT 1680 
T A^TCCC A. AC CCCTGTCCrC ACTGTGGGGA CAGAACCTTC TATGACCCGG AGCCCATCCT 1740 
r TGCCCCTGT T FcaTGCCTA ACTAGCTGGG TGCACATATC AAATGCTTCA TTCTGCATAC 1800 
^GACActI AAGCCAGGAl G TGCATGCAT CTTGAAGCAA CAAAGCAGCC ACAGTTTCAG 1860 
AC^AATGTTC AGTGTG AGTG AGG AAA ACAT GTTCAGTG AG GAAAAAACAT ^AGACAAAT 1920 
GTO^GTG AG GAAAAAAAGG GG A AGTTGGG GATAGGCAG A TGTTGACTTG AGGAGTTAAT 1980 
GTC^CTTTC ^3G^G ATACA TCTTATAGAG TTAGAAATAG AATCTGAA^ TJCTAAAGGGA 2040 
GATTCTCGCT TGGG AAGTAC ATGTAGGAGT TAATCCCTGT GTAGACTGTT GTAAAGAAAC 2100 
TGTTGAAAAT AAAG AGAAGC AATGTGAAGC AAAAAAAAAA AAAAAAAA 

Seq ID NO: 1 >f> Prniem sequence: 
Protein Accession #: NP_006 1 06. 1 
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1 11 21 31 41 51 
I I I I I I 

MERRRLWGSI QSRY1SMSVW TSPRRLVELA GQSLLKDEAL AIAALELLPR ELFPPLFMAA 60 
FDGRHSQTLK AMVQA WPFTC LPLG VLMKGQ HLHLETFKAV LDGLDVLLAQ EVRPRRWKLQ ! 20 
VLDLRKNSHQ DFWTVWSGNR ASLYSFPEPE AAQPMTKKRK VDGLSTEAEQ PFIPVEVLVD 1 80 
LFLKEGACDE LFSYLIEKVK RKKNVLRLCC KKLKIFAMPM QDIKM1LKMV QLDSIEDLEV 240 
TCTWKLPTLA KFSPYLGQMI NLRRLLLSH1 HASSYISPEK EEQYIAQFTS QFLSLQCLQA 300 
LYVDSLFFLR GRLDQLLRHV MNPLETLSIT NCRLSEGDVM HLSQSPSVSQ LSVLSLSGVM 360 
LTDVSPEPLQ ALLERASATL QDLVFDECGI TDDQLLALLP SLSHCSQLTT LSFYGNSISI 420 
SALQSLLQHL IGLSNLTHVL YPVPLESYED IHGTLHLERL AYLHARLREL LCELGRPSMV 480 
WLSANPCPHC GDRTFYDPEP ILCPCFMPN 



Scq ID NO: 1 1 1 DNA sequence 

Nucleic Acid Accession U: NM JW38 15 

Coding sequence: 8-2452 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

CGCTGCCATG CGGCTGGCGC TGCTCTGGGC CCTGGGGCTC CTGGGCGCGG GCAGCCCTCT 60 
GCCTTCCTGG CCGCTCCCAA ATATAGGTGG CACTGAGGAG CAGCAGGCAG AGTCAGAGAA 120 
GGCCCCGAGG GAGCCCTTGG AGCCCCAGGT CCTTCAGGAC GATCTCCCAA TTAGCCTCAA 180 
AAAGGTGCTT CAGACCAGTC TGCCTGAGCC CCTGAGGATC AAGTTGGAGC TGGACGGTGA 240 
CAGTCATATC CTGGAGCTGC TACAGAATAG GGAGTTGGTC CCAGGCCGCC CAACCCTGGT 300 
GTGGTACCAG CCCGATGGCA CTCGGGTGGT CAGTGAGGGA CACACTTTGG AGAACTGCTG 360 
CTACCAGGGA AGAGTGCGGG GATATGCAGG CTCCTGGGTG TCCATCTGCA CCTGCTCTGG 420 
GCTCAGAGGC TTGGTGGTCC TGACCCCAGA GAGAAGCTAT ACCCTGGAGC AGGGGCCTGG 480 
GGACCTTCAG GGTCCTCCCA TTATTTCGCG AATCCAAGAT CTCCACCTGC CAGGCCACAC 540 
CTGTGCCCTG AGCTGGCGGG AATCTGTACA CACTCAGACG CCACCAGAGC ACCCCCTGGG 600 
ACAGCGCCAC ATTCGCCGGA GGCGGGATGT GGTAACAGAG ACCAAGACTG TGGAGTTGGT 660 
GATTGTGGCT GATCACTCGG AGGCCCAG AA ATACCGGGAC TTCCAGCACC TGCTAAACCG 720 
CACACTGGAA GTGGCCCTCT TGCTGGACAC ATTCTTCCGG CCCCTGAATG TACGAGTGGC 780 
ACTAGTGGGC CTGGAGGCCT GG ACCCAGCG TG ACCTGGTG G AGATCAGCC CAAACCCAGC 840 
TGTCACCCTC GAAAACTTCC TCCACTGGCG CAGGGCACAT TTGCTGCCTC GATTGCCCCA 900 
TGACAGTGCC CAGCTGGTGA CTGGTACTTC ATTCTCTGGG CCTACGGTGG GCATGGCCAT 960 
TCAGAACTCC ATCTGTTCTC CTGACTTCTC AGGAGGTGTG AACATGGACC ACTCCACCAG 1020 
CATCCTGGGA GTCGCCTCCT CCATAGCCCA TGAGTTGGGC CACAGCCTGG GCCTGGACCA 1080 
TGATTTGCCT GGG AATAGCT GCCCCTGTCC AGGTCCAGCC CCAGCCAAGA CCTGCATCAT 1 140 
GGAGGCCTCC ACAGACTTCC TACCAGGCCT GAACTTCAGC AACTGCAGCC GACGGGCCCT 1 200 
GGAGAAAGCC CTCCTGGATG GAATGGGCAG CTGCCTCTTC GAACGGCTGC CTAGCCTACC 1260 
CCCTATGGCT GCTTTCTGCG GAAATATGTT TGTGGAGCCG GGCGAGCAGT GTGACTGTGO 1320 
CTTCCTGGAT GACTGCGTCG ATCCCTGCTG TGATTCTTTG ACCTGCCAGC TGAGGCCAGG 1 380 
TGCACAGTGT GCATCTGACG GACCCTGTTG TCAAAATTGC CAGCTGCGCC CGTCTGGCTG 1440 
GCAGTGTCGT CCTACCAGAG GGGATTGTGA CTTGCCTGAA TTCTGCCCAG GAGACAGCTC 1500 
CCAGTGTCCC CCTGATGTCA GCCTAGGGGA TGGCGAGCCC TGCGCTGGCG GGCAAGCTGT 1560 
GTGCATGCAC GGGCGTTGTG CCTCCTATGC CCAGCAGTGC CAGTCACTTT GGGGACCTGG 1620 
AGCCCAGCCC GCTGCGCCAC TTTGCCTCCA GACAGCTAAT ACTCGGGGAA ATGCTTTTGG 1680 
GAGCTGTGGG CGCAACCCCA GTGGCAGTTA TGTGTCCTGC ACCCCTAGAG ATGCCATTTG 1740 
TGGGCAGCTC CAGTGCCAGA CAGGTAGGAC CCAGCCTCTG CTGGGCTCCA TCCGGGATCT 1800 
ACTCTGGGAG ACAATAGATG TGAATGGG AC TGAGCTGAAC TGCAGCTGGG TGCACCTGGA 1860 
CCTGGGCAGT GATGTGGCCC AGCCCCTCCT GACTCTGCCT GGCACAGCCT GTGGCCCTGG 1920 
CCTGGTGTGT ATAGACCATC GATGCCAGCG TGTGGATCTC CTGGGGGCAC AGGAATGTCG 1980 
AAGCAAATGC CATGGACATG GGGTCTGTGA CAGCAACAGG CACTGCTACT GTGAGGAGGG 2040 
CTGGGCACCC CCTGACTGCA CCACTCAGCT CAAAGCAACC AGCTCCCTGA CCACAGGGCT 2100 
GCTCCTCAGC CTCCTGGTCT TATTGGTCCT GGTGATGCTT GGTGCCGGCT ACTGGTACCG 21 60 
TGCCCGCCTG CACCAGCGAC TCTGCCAGCT CAAGGGACCC ACCTGCCAGT ACAGGGCAGC 2220 
CCAATCTGGT CCCTCTGAAC GGCCAGGACC TCCGCAGAGG GCCCTGCTGG CACGAGGCAC 2280 
TAAGTCTCAG GGGCCAGCCA AGCCCCCACC CCCAAGGAAG CCACTGCCTG CCGACCCCCA 2340 
GGGCCGGTGC CCATCGGGTG ACCTGCCCGG CCCAGGGGCT GGAATCCCGC CCCTAGTGGT 2400 
ACCCTCCAGA CCAGCGCCAC CGCCTCCGAC AGTGTCCTCG CTCTACCT CT GAC CTCTCCG 2460 
GAGGTTCCGC TGCCTCCAAG CCGG ACTTAG GGCTTCAAGA GGCGGGCGTG CCCTCTGGAG 2520 
TCCCCTACCA TGACTGAAGG CGCCAGAGAC TGGCGGTGTC TTAAGACTCC GGGCACCGCC 2580 
ACGCGCTGTC AAGCAACACT CTGCGGACCT GCCGGCGTAG TTGCAGCGGG GGCTTGGGGA 2640 
GGGGCTGGGG GTTGGACGGG ATTGAGGAAG GTCCGCACAG CCTGTCTCTG CTCAGTTGCA 2700 
ATAAACGTGA CATCTTGGGA GCGTTAAAAA AAAAAAAAAA 

SeqIDNO: 112 Protein sequence: 
Protein Accession U: NP_003 806.2 

I II 21 31 41 51 
I I I I I I 

MRLALLWALG LLGAGSPLPS WPLPN1GGTE EQQAESEKAP REPLEPQVLQ DDLPISLKKV 60 
LQTSLPEPLR IKLELDGDSH ILELLQNREL VPGRPTLVWY QPDGTRWSE GHTLENCCYQ 120 
GRVRGYAGSW VS1CTCSGLR GLVVLTPERS YTLEQGPGDL QGPPUSR1Q DLHLPGHTCA 1 80 
LSWRESVHTQ TPPEHPLGQR HIRRRRDWT ETKTVELVTV ADHSEAQKYR DFQHLLNRTL 240 
EVALLLDTFF RPLNVRVALV GLEAWTQRDL VEISPNPAVT LENFLHWRRA HLLPRLPHDS 300 
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AQLVTGTSFS GPTVGMAIQN SICSPDFSGG VNMDHSTSIL GVASSIAHEL G^JHDL 360 
PGNSCPCPGP APAKTCIMEA STDFLPGLNF SNCSRRALEK ALLDGMGSCL FERLPSLPPM 420 

™cg>^ep^^^^ ™ 

RPTRGECDLP EFCPGDSSQC PPDVSLGDGE PCAGGQAVCM HGRCASYAQQ CQSLWGPGAQ 540 
5 P^SSt^ n¥r^^ LQCQTGRTQP LLGS!RDLLW 600 

ETOVNGTCL NCSWVHLDLG SDVAQPLLTL PGTACGPGLV CIDHRCQRVD LLGAQECB5K 660 

SvSrIcyceegwa ppdcVqlkatsslttglll ^p^^p^so 72 

LHQRLCQLKG PTCQYRAAQS GPSERPGPPQ RALLARGTKS QGPAKPPPPR KPLPADPQGR 780 
CPSGDLPGPG AGIPPLVVPS RPAPPPPTVS SLYL 

10 

Seq ID NO: 113 DNA sequence 

Nucleic Acid Accession #: NMJW2416 

Coding sequence: 40-417 (underlined sequences correspond to start and stop codons) 
15 1 II 21 31 41 51 

ATCCAATACA GGAGTGACTT GGAACTCCAT TCTATCACTATGAAGAAAAG TGGTGTTCTT 60 
^CCTCTTGG GCATCATCTT GCTGGTTCTG ATTGGAGTGC AAGGAACCCC AGTA^AGA 120 
YlWrrrc^^n^C^GCAT CAGCACCAAC CAAGGGACTA TCCACCTACA ATCCTTGAAA 180 

20 ^SSTccaMGcS 

ACCACTK^C^ CAA^A^GTAT "DCTGTGTTAA AAATGTTCTA TTTTAATTAT ACCGCTATCA 4 8 
25 TOC^GGAGGATGGCATA T A AT AC A A AG GCTTATTAAT TIG ACT AG AA AATTTAAAAC 540 
a^ctct^Staact AAAGTTAGAA AGTTGATTTT ^j^J^^J^^ 
^TTAAAGG CTATG ATTGT CTTTGTTCTT CTACCACCCA CCAGTTGAAT TTCATCATGC 660 
SaIgG^CA? G^TTTTAGCA ATACCCATGT CTACACAGAT GTTCACCCAA CCACATCCCA 720 
CTCACAACAG CTGCCTGG/^ G AGCAG CCCT AGGCTTCCAC GTACTGCAGC CTCCAGAGAG 780 
Id TATCTCAGGC ACATGTCAGC AAGTCCTAAG CCTGTTAGCA TGCTGGTGAG CCAAGCAGTT 840 
TG A^ATTGAG CTGGACCTCA CCAAGCTGCT GTGGCCATCA ACCTCTGTAT TTGAATCAGC 900 
S^£cCT CACACACAAT GTGTCTG AGA GATTCATGCT GATTGTTATT GGGTATCACC 960 

TOCATCTTG i CCCGCTCAGG CTGACCACTT TATTTCTTTT TGTTCCCCTT TG CTTCATTC 1 080 
1 S lA^Arr^r TT^^ATCC TACCACA ATG CAGTGCCTTT CTTCTCTCCA GTGCACCTGT 1 140 
35 r^SS^ScTG ISctStTTCTCATCT TGTCCCCAAC ACCCCACAGA 1200 

40 1440 

U r?A^^AC TGTGGA^CTA GGCTTTAATA aVtITTTAAT TGATGTTGTT ATGGGCAGGA 1500 
S^T^Ar ACrxT^TC CAGAGCAGGT GCTGGCTCTT TCCTGGCTAC TCCATGTTGG 1560 
SaGC^TG CT^5^A^^^Sg ACAC TCACTACAGG GACCAGGGAT 1620 
StoK oi^SSIcAGO ATGTTTGCTC f™™* ^ 
4^ fif ACGTGGTA AAACACTTGC GOATATTCTC GACTGTTTTT AAAAAATATA CAGTTTAOCG 17W 
A A AATrATAT AATCrrACAA TGAAAAGOAC TTTATAGATC AGCCAGTGAC CAACCTTTTC 1800 
rrA^rrATAC A^A/^CCTT^CrcGAAG GAAAAGGGCTTTCTCAATAAGCCTCAGCTT 1860 
TCTAAGATCT AA*^AGATAG CO^CCG^UJAT ^ ( ^^^JP^p^jS^^^Jj^^^^^^' < |980 

S<5 ESSSSa GTGTTOAlAlAmGAG^G TCTGTGACCC ACTTACCTTG CATCTCACAG 2340 

TCATTTATCA TATATATACA TACATGCATA CACTCTCA AA GCAAA^A^ TTT^CTTCA 2WJ 
AAACAOTATT GACTTGTATA CCTTGTAATT TGAAATATTT TCTTTGTTAA AATAGAATGG 2520 

tatpA ATA A A TAf5 AfTlATTA ATCAG 



60 



TATCAATAAA TAGACCATTA ATCAG 

Seq ID NO: 1 14 Protein sequence: 
Protein Accession #: NPJ002407 



I II 21 31 41 51 
65 LrKKSGVLFLL GIILLVLIGV QGTPVVRKGR CSCISTNQGT IHLQSLKDLK QFAPSPSCEK 60 

QKKTT 

70 Seq ID NO: 1 15 DNA sequence 

Nucleic Acid Accessions: NM 003238.1 

Coding sequence: 182-1426 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 
75 L a rf AG GAT ACGTTTTTCT GTTGGGCATT GACTAGATTG TTTGCAAAAG TTTCGCATCA 60 
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TTGATTTCTT TTTTTTTATT CTGACTTTTA AAAACAACTT TTTTTTCCAC TTTTTTAAAA 1 80 
AATGCACTAC TGTGTGCTGA GCGCTTTTCT GATCCTGCAT CTGGTCACGG TCGCGCTCAG 240 
CCTGTCTACC TGCAGCACAC TCGATATGGA CCAGTTCATG CGCAAGAGGA TCGAGGCGAT 300 
CCGCGGGCAG ATCCTGAGCA AGCTGAAGCT CACCAGTCCC CCAGAAGACT ATCCTGAGCC 360 
CGAGGAAGTC CCCCCGGAGG TGATTTCCAT CTACAACAGC ACCAGGGACT TGCTCCAGGA 420 
GAAGGCGAGC CGG AGGGCGG CCGCCTGCGA GCGCGAGAGG AGCGACGAAG AGTACTACGC 480 
CAAGGAGGTT TACAAAATAG ACATGCCGCC CTTCTTCCCC TCCGAAAATG CCATCCCGCC 540 
CACTTTCTAC AG ACCCTACT TCAG AATTGT TCG ATTTGAC GTCTCAGCAA TGGAG AAGAA 600 
TGCTTCCAAT TTGGTGAAAG CAGAGTTCAG AGTCTTTCGT TTGCAGAACC CAAAAGCCAG 660 
AOTGCCTGAA CAACGG ATTG AGCTATATCA GATTCTCAAG TCCAAAGATT TAACATCTCC 720 
AACCCAGCGC TACATCGACA GCAAAGTTGT GAAAACAAGA GCAGAAGGCG AATGGCTCTC 780 
CTTCGATGTA ACTG ATGCTG TTCATGAATG GCTTCACCAT AAAGACAGGA ACCTGGGATT 840 
TAAAATAAGC TTACACTGTC CCTGCTGCAC TTTTGTACCA TCTAATAATT ACATCATCCC 900 
AAATAAAAGT GAAGAACTAG AAGCAAGATT TGGAG GT ATT GATGGCACCT CCACATATAC 960 
CAGTGGTGAT CAGAAAACTA TAAAGTCCAC TAGGAAAAAA AACAGTGGGA AGACCCCACA 1020 
TCTCCTGCTA ATGTTATTGC CCTCCTACAG ACTTGAGTCA CAACAGACCA ACCGGCGGAA 1080 
GAAGCGTGCT TTGGATGCGG CCTATTGCTT TAGAAATGTG CAGGATAATT GCTGCCTACG 1 140 
TCCACTTTAC ATTGATTTCA AGAGGGATCT AGGGTGGAAA TGGATACACG AACCCAAAGG 1200 
GTACAATGCC AACTTCTGTG CTGGAGCATG CCCGTATTTA TGGAGTTCAG ACACTCAGCA 1260 
CAGCAGGGTC CTGAGCTTAT ATAATACCAT AAATCCAGAA GCATCTGCTT CTCCTTGCTG 1320 
GGTGTCCCAA GATTTAG AAC CTCTAACCAT TCTCTACTAC ATTG GCA AAA CACCCAAGAT 1380 
TGAACAGCTT TCTAATATGA TTGTAAAGTC TTGCAAATGC AGCTAAAATT CTTGGAAAAG 1440 
TGGCAAGACC AAAATG ACAA TG ATGATG AT AATGATGATG ACGACGACAA CGATGATGCT 1 500 
TGTAACAAGA AAACATAAG A GAGCCTTGGT TCATCAGTGT TAAAAAATTT TTGAAAAGGC 1 560 
GGTACTAGTT CAGACACTTT GG A AGTTTGT GTTCTGTTTG TTAAAACTGG CATCTGACAC 1620 
AAAAAAAGTT GAAGGCCTTA TTCTACATTT CACCTACTTT GTAAGTGAGA GAGACAAGAA 1680 
GCA AATTTTT TTAAA 

SeqIDNO: 116 Protein sequence: 
Protein Accession #: NPJJ03229. 1 

1 11 21 31 41 51 
J I I I I I 

MHYCVLSAFL ILHLVTVALS LSTCSTLDMD QFMRKRIEAI RGQILSKLKL TSPPEDYPEP 60 
EEVPPEV1SI YNSTRDLLQE KASRRAAACE RERSDEEYYA KEVYKIDMPP FFPSENAIPP 120 
TFYRP YFR1 V RFDVSAMEKN ASNLVKAEFR VFRLQNPKAR VPEQRIELYQ ILKSKDLTSP 1 80 
TQRYIDSKW KTRAEGEWLS FDVTDAVHEW LHHKDRNLGF KISLHCPCCT FVPSNNY1IP 240 
NKSEELEARF AGIDGTSTYT SGDQKTIKST RKKNSGKTPH LLLMLLPSYR LESQQTNRRK 300 
KRALDAAYCF RNVQDMCCLR PLYIDFKRDL GWKWIHEPKG YNANFCAGAC PYLWSSDTQH 360 
SRVLSLYNTI NPEASASPCC VSQDLEPLTI LYYIGKTPKI EQLSNMIVKS CKCS 

Scq ID NO: 1 17 DNA sequence 

Nucleic Acid Accession #: NM JW0095. 1 

Coding sequence: 26-2299 (underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 

I I I I I I 

CAGCACCCAG CTCCCCGCCA CCCCCATGGT CCCCGACACC GCCTGCGTTC TTCTGCTCAC 60 
CCTGGCTGCC CTCGGCGCGT CCGG ACAGGG CCAG AGCCCG TTGGGCTCAG ACCTGGGCCC 1 20 
GCAGATGCTT CGGGAACTGC AGGAAACCAA CGCGGCGCTG CAGGACGTGC GGG ACTGGCT 1 80 
GCGGCAGCAG GTCAGGGAGA TCACGTTCCT GAAAAACACG GTGATGGAGT GTGACGCGTG 240 
CGGGATGCAG CAGTCAGTAC GCACCGGCCT ACCCAGCGTG CGGCCCCTGC TCCACTGCGC 300 
GCCCGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAGACG GAGAGCGGCG GCCGCTGCGG 360 
CCCCTGCCCC GCGGGCTTCA CGGGCAACGG CTCGCACTGC ACCGACGTCA ACGAGTGCAA 420 
CGCCCACCCC TGCTTCCCCC G AGTCCGCTG TATCAACACC AGCCCGGGGT TCCGCTGCG A 480 
GGCTTGCCCG CCGGGGTACA GCGGCCCCAC CCACCAGGGC GTGGGGCTGG CTTTCGCCAA 540 
GGCCAACAAG CAGOTTTGCA CGGACATCAA CGAGTGTGAG ACCGGGCAAC ATAACTGCGT 600 
CCCCAACTCC GTGTGCATCA ACACCCGGGG CTCCTTCCAG TGCGGCCCGT GCCAGCCCGG 660 
CTTCGTGGGC GACCAGGCGT CCGGCTGCCA GCGCGGCGCA CAGCGCTTCT GCCCCGACGG 720 
CTCGCCCAGC GAGTGCCACG AGCATGCAGA CTGCGTCCTA GAGCGCGATG GCTCGCGGTC 780 
GTGCGTGTGT CGCGTTGGCT GGGCCGGCAA CGGGATCCTC TGTGGTCGCG ACACTGACCT 840 
AGACGGCTTC CCGGACGAGA AGCTGCGCTG CCCGGAGCCG CAGTGCCGTA AGGACAACTG 900 
CGTG ACTGTG CCCAACTCAG GGCAGGAGGA TGTGGACCGC GATGGCATCG GAGACGCCTG 960 
CGATCCGGAT GCCGACGGGG ACGGGGTCCC CAATGAAAAQ GACAACTGCC CGCTGGTGCG 1020 
GAACCCAGAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATGCGTGCG ACAACTGCCG 1080 
GTCCCAGAAG AACGACGACC AAAAGG ACAC AGACCAGGAC GGCCGGGGCG ATGCGTGCGA 1 140 
CGACGACATC GACGGCGACC GGATCCGCAA CCAGGCCGAC AACTGCCCTA GGGTACCCAA 1200 
CTCAGACCAG AAGGACAGTG ATGGCGATGG TATAGG GGAT GCCTGTGACA ACTGTCCCCA 1260 
GAAGAGCAAC CCGGATCAGG CGGATGTGGA CCACQACTTT GTGGGAGATG CTTGTGACAG 1320 
CGATCAAG AC CAGG ATGGAG ACCG ACATCA GGACTCTCGQ GACAACTGTC CCACGGTGCC 1380 
TAACAGTGCC CAGGAGGACT CAGACCACGA TGGCCAGGGT GATGCCTGCG ACGACGACGA 1440 
CGACAATGAC GGAGTCCCTG ACAGTCGGGA CAACTGCCGC CTGGTG CCTA ACCCCGGCCA 1500 
GGAGGACGCG GACAGGGACG GCGTGGGCGA CGTGTGCCAG GACGACTTTG ATGCAGACAA 1560 
GGTGGTAGAC AAGATCGACG TGTGTCCGGA GAACGCTGAA GTCACGCTCA CCGACTTCAG 1620 
GGCCTTCCAG ACAGTCGTGC TGGACCCGGA GGGTGACGCG CAGATTGACC CCAACTGGGT 1680 
GGTGCTCAAC CAGGGAAGGG AGATCGTGCA GACAATGAAC AGCGACCCAG GCCTGGCTGT 1740 
GGGTTACACT GCCTTCAATG GCGTGGACTT CGAGGGCACG TTCCATGTGA ACACGGTCAC 1 800 
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10 



15 



GGATGACGAC TATGCGGGCT TCATCTTTGG CTACCAGGAC | ^CTGCAflCT J™^™"™ 
CATGTGGAAG CAGATGGAGC AAACGTATTG GCAGGCGAACCCCTTOCGTG ^152^^??^ 
GPCTGGCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCCGGGGAAC AGCTGCGGAA 1980 

AAACGTGGGT TGGAAGGACA AGAAGTCCTA TCGTIGGTTC CTGCAGCACC GGCCCCAAGT 2100 
GGGCTACATC AGGGTGCG AT TCTATGAGGG CCCTGAGCTG GTGGCCGACA f^™™™ }™ 
pttgp. APACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTCCC AGGAGAACAT 2220 
C^CTGGGCCAAcS 

TCAGCTGCGG CAAGCCTAGG GACCAGGGTG AGGACCCGCC GGATG ACAGC CACCTTCACC 2340 
GCGGCTGGAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CGTCCTGAGG GGGAAGTGAG 2400 
AAGGGCTCAG AGAGGACAAA ATAAAGTGTG TGTGCAGGG 

Seq ID NO: US Protein sequence: 
Protein Accession U: NPJJ000S6. 1 

1 11 21 31 41 51 

MVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALQDVRD WLRQQVREIT 60 
FmJrVMECD ACGMQQSV RTG LPS VRPLLH CAPGFCFPGV ACIQTESGGR CGPCPAGFTG 120 

90 ng^^^CNA^ 180 
S^GQ^C^^^ PGFVGDQASG CQRGAQRFCP DGSPSECHEH 240 

AT^ERbGS R^VCRVGWA GNGIICGRDT DLDGFPDEKL RCPEPQCRKD NCVTVPNSGQ 300 
HWdIiDG^GD ACDPDADGDG VTNE103NCPL VRNPDQRNTD EDKWGDACDN CRSXJKNDDQK 360 
DTDQD^G^M^D^DIDGDRI RNQADNCPRV PNSDQKDSDG DGIGDACDNC PQ™^^ 

25 VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD HDGQGDACDD ^^ D !^P9Y^? , 
RDNCRLWW GQEDADRDGV GDVCQDDFDA 0KWDKTOVC PENAEVTLTD FRAFQTVVLD 540 
reG^AOmPN WVLNOGREI VQTMNSDPGL AVGYTAFNGV DFEGTFHVNT VTDDDYAGFI 600 
FGYODSS^FY WM Wl^ffiQT YWQANPFRAV AEPGIQLKAV KSSTGPGEQL RNALWHTGDT 660 

30 RLGVFCFSQE N1IWANLRYR CKDTIPEDYE THQLRQA 
35 



Seq ID NO: 119 DNA sequence 

Nucleic Acid Accession U: NMJM42U 

Coding sequence: 157-1479 {underlined sequences correspond to start and stop codons) 



11 21 



GGG ACAGGGC TGAGGATGAG GAGAACCCTG GGG ACCCAGA AGACCGTGCC TTGCCCGGAA 60 
rTCCTCCCTG TAGGCCTGAA GGACTTGCCC TAACAGAGCC TCAACAACTA CCTGGTGATT 120 

TOGTGTGTCTGAGTCTCTT CACTGAGAGG ATGTGCATCX = AOMQACra ™ 
r^rmrr.GCA GAAGTGACAA GCTTTCCCTG CCTGGCTTTG AGAACCTCAC AGCAGGATAT 300 

ATO^AAGTA TCTCTAGCAT TTCAG AG AGT AACATGG ACT M^^^™££? T ™ 

4 5 PG ACAGCGCT GGATGGACCA GCGGCTGGTG TTTGAAGGCA ACAAG AGCTT CACTCTCGAT 480 

^ GC^CGCCTCG TGGAGTTCCT CTGGGTGCCA ^^^3^^^ TrTT^Tn^A TCGCAttGTC m 
TTTCTCCATG AAGTCACTGT GGGAAACAGG CTCATCCGCC TCTTCTCCAA TGGCACGGTC 600 
^T^GCCC TCAGAATCAC GACAACTGTT GCATGTAACA TGGATCTGTC TAAATACCCC 660 
SggIcACAC ^S^l^ OTTCCAGCro GAAAGCTGGG GCTATGATGG AAATGATCTG 720 

50 gagt^acctggS ™ 

CAGTACACCA TAG AGCGGTA TTTCACCTTA GTCACCAGAT CGCAGCAGGA OACAGGAAAT 840 
TAC^ACTAGAT TGGTCTTAC A GTTTGAGCTT CGGAGGAATG TTCTGTATTT CATTTTGGAA 900 
Tr^ArGTTC PTTCC AXTTT CCTGGTGGTG TTGTCCTGGG TTTCATTTTG GATCTCTCTC 960 

5 S at?ItcgGGT CCCGCACTTC TCTYca:AAC ACCAACTGCT TCATCAAGGC CATCGATGTG 1080 
5 tI^ctSggI mG^AG CTTTGTC TTT GGGG CCTTGC TAGAATATGC AGTTGCTCAC 1 140 

?ACA?^ 

GTCAGTATTA CTAATATCAT CAACAGCTCC ATCTCCAGCTTTAAACG^^ 

arc AGP ATTG AAATTTCCAG CGACAACGTT GACTACAGTG ACTTGACAAT GAAAACCAGC 1320 
60 rAC^AGTOA ACTTTGTCTT CCGAGAAAAG ATGGGCAGG A TTGTTGATTA TTTCACAATT 1380 
60 ^^^^S^^KTTOC AAACTACrCjJ" T^OCTTTG AT ^IIX^EHSX^ ™ 

GCCAATGTAT TTTACTGGGC ATACTACATG TATTTTTOAGJTCAATO 

T^npATArr.T pttpa apaGG ACAAGATAAT GATGTAAATG GTATTTTAGG CCAAGTGTGC 1560 
65 ATC AAGCTCC AACCATTGTT CT A AGCTGTG TAGAAGTCCT AGCATTATAG GATCTTGTAA 16 o 

^^^^ 

TATAPiTrtTT TTTTAPTAAA TCTCTGCAGT GCTTATAAAATACATTG TiO CCl Ai l iauu m» 
^S^STCCA^^TOOCmATTCA^TAGTOATOGGOTACTA^ 
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AAAGTACTGG GTTGACTCAG AGAGTCGCTG TCATTCTGTC ATTGCTGCTA CTCTAACACT 2460 
GAGCAACACT CTCCCAGTGG CAGATCCCCT GTATCATTCC AAGAGG AGCA TTCATCCCTT 2520 
TGCTCTAATG ATCAGGAATG ATGCTTATTA GAAAACAAAC TGCTTGACCC AGGAACAAGT 2580 
GGCTTAGCTT AAGTAAACTT GGCTTTGCTC AGATCCCTGA TCCTTCCAGC T GGTC TGCTC 2640 
TGAGTGGCTT ATCCCGCATG AGCAGGAGCG TGCTGGCCCT GAGTACTGAA CTTTCTGAGT 2700 
AACAATGAG A CACGTTACAG AACCTATGTT CAGGTTGCGG GTGAGCTGCC CTCTCCAAAT 2760 
CCAGCCAGAG ATGCACATTC CTCGGCCAGT CTCAGCCAAC AGTACCAAAA GTGATTTTTG 2820 
AGTGTGCCAG GGTAAAGGCT TCCAGTTCAG CCTCAGTTAT TTTAGACAAT CTCGCCATCT 2880 
TTAATTT C TT AGCTTCCTGT TCTAATAAAT GCACGGCTTT ACCTTTCCTG TCAGAAATAA 2940 
ACCAAGGCTC TAAAAGATG A TTTCCCTTCT GTAACTCCCT AG AG CC AC AG GTTCTCAT TC 3000 
CTTTTCCCAT TATACTTCTC ACAATTCAGT TTCTATGAGT TTGATCACCT GATTTTTTTA 3060 
ACAAAATATT TCTAACGGGA ATGGGTGGGA GTGCTGGTGA AAAGAGATGA AATGTGGTTG 3120 ■ 
TATGAGCCAA TCATATTTGT GATTTTTTAA AAAAAGTTTA AAAGGAAATA TCTGTTCTGA 3180 
AACCCCACTT AAGCATTGTT TTTATATAA A AACAATGATA AAGATGTGAA CTGTG AAATA 3240 
AATATACCAT ATTAGCTACC CACC 

Seq ID NO: 120 Protein sequence: 
Protein Accession ff: NP_0J5026. 1 

I 11 21 31 41 51 

MNYSLHLAFV CLSLFTERMC IQGSQFNVEV GRSDKLSLPG FENLTAG YNK FLRPNFGGEP 60 
VQIALTLDIA SISSISESNM DYTATIYLRQ RWMDQRLVFE GNKSFTLDAR LVEFLW VPDT 1 20 
YIVESKKSFL HEVTVGNRLI RLFSNGTVLY ALRJTTTVAC NMDLSKYPMD TQTCKLQLES 180 
WGYDGNDVEF TWLRGNDSVR GLEHLRLAQY TIERYFTLVT RSQQETGNYT RLVLQFELRR 240 
NVLYFILETY VPSTFLVVLS WVSFW1SLDS VPARTCIGVT TVLSMTTLMI GSRTSLPNTN 300 
CFIKAIDVYL GICFSFVFGA LLEYAVAHYS SLQOMAAKDR GTTKEVEEVS ITNIINSSIS 360 
SFKRKISFAS IEISSDNVDY SDLTMKTSDK FKFVFREKMG RIVDYFTIQN PSNVDHYSK.L 420 
LFPLIFMLAN VFYWAYYMYF 

Seq ID NO: 121 DNA sequence 

Nucleic Acid Accession *: NM_001 854 

Coding sequence: 163-5582 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

AACCATCAAA TTTAG AAGAA AAAGCCCTTT OACTTTTTCC CCCTCTCCCT CCCCAATGGC 60 
TGTGTAGCAA ACATCCCTGG CGATACCTTG GAAAGGACGA AGTTGGTCTG CAGTCGCAAT 120 
TTCGTGGGTT GAGTTCACAG TTGTGAGTGC GGGGCTCGGA GATGGAGCCG TGGTCCTCTA 180 
GGTGGAAAAC GAAACGGTGG CTCTGGGATT TCACCGTAAC AACCCTCGCA TTGACCTTCC 240 
TCTTCCAAGC TAGAGAGGTC AGAGG AGCTG CTCCAGTTGA TGTACTAAAA GCACTAGATT 300 
TTCACAATTC TCCAGAGGGA ATATCAAAAA CAACGGGATT TTGCACAAAC AGAAAGAATT 360 
CTAAAGGCTC AGATACTGCT TACAGAGTTT CAAAGCAAGC ACAACTCAGT GCCCCAACAA 420 
AACAGTTATT TCCAGGTGGA ACTTTCCCAG AAGACTTTTC AATACTATTT ACAGTAAAAC 480 
CAAAAAAAGG AATTCAGTCT TTCCTTTTAT CTATATATAA TG AGCATGGT ATTCAGCAAA 540 
TTGGTGTTGA GGTTGGGAGA TCACCTGTTT TTCTGTTTGA AGACCACACT GGAAAACCTG 600 
CCCCAG AAG A CTATCCCCTC TTCAGAACTG TTAACATCGC TGACGGGAAG TGGCATCGGG 660 
TAGCAATCAG CGTGGAGAAG AAAACTGTGA CAATGATTGT TGATTGTAAG AAG AAAACC A 720 
CGAAACCACT TGATAGAAGT GAG AG AGCA A TTGTTGATAC CAATGGAATC ACGGTTTTTG 780 
G AACAAGGAT TTTGGATGAA GAAGTTTTTG AGGGGGACAT TCAGCAGTTT TTGATCACAG 840 
GTGATCCCAA GGCAGCATAT G ACTACTGTG AGCATTATAG TCCAG ACTGT GACTCTTCAG 900 
CACCCAAGGC TGCTCAAGCT CAGGAACCTC AGATAGATGA GTATGCACCA GAGGATATAA 960 
TCGAATATGA CTATGAGTAT GGGGAAGCAG AGTATAAAGA GGCTGAAAGT GTAACAGAGG . 1020 
GACCCACTGT AACTGAGGAG ACAATAGCAC AGACGGAGGC AAACATCGTT GATGATTTTC 1080 
AAGAATACAA CTATGGAACA ATGG AAAGTT ACCAGACAGA AGCTCCTAGG CATGTTTCTG 1 140 
GGACAAATGA GCCAAATCCA GTTGAAGAAA TATTTACTGA AGAATATCTA ACGGGAGAGG 1200 
ATTATGATTC CCAGAGGAAA AATTCTGAGG ATACACTATA TOAAA ACAAA GAAATAGACG 1260 
GCAGGGATTC TGATCTTCTG GTAGATGGAG ATTTAGGCGA ATATGATTTT TATGAATATA 1320 
AAGAATATGA AGATAAACCA ACAAGCCCCC CTAATGAAGA ATTTGGTCCA GGTGTACCAG 1380 
CAGAAACTGA TATTACAGAA ACAAGCATAA ATGGCCATGG TGCATATGGA GAGAAAGGAC 1440 
AGAAAGGAGA ACCAGCAGTG GTTGAGCCTG GTATGCTTGT CGAAGGACCA CCAGGACCAG 1500 
CAGGACCTGC AGGTATTATG GGTCCTCCAG GTCTACAAGG CCCCACTGGA CCCCCTGGTG 1560 
ACCCTGGCGA TAGGGGCCCC CCAGGACGTC CTGGCTTACC AGGGGCTGAT GGTCTACCTG 1620 
GTCCTCCTGG TACTATGTTG ATGTTACCGT TCCGTTATGG TGGTGATGGT TCCAAAGGAC 1 680 
CAACCATCTC TGCTCAGG AA GCTCAGGCTC AAGCTATTCT TCAGCAGGCT CGG ATTGCTC 1740 
TGAGAGGCCC ACCTGGCCCA ATGGGTCTAA CTGGAAGACC AGGTCCTGTG GGGGGGCCTG 1800 
GTTCATCTGG GGCCAAAGGT GAGAGTGGTG ATCCAGGTCC TCAGGGCCCT CGAGGCGTCC 1860 
AGGGTCCCCC TGGTCCAACG GGAAAACCTG GAAAAAGGGG TCGTCCAGGT GCAGATGGAG 1920 
GAAGAGGAAT GCCAGGAGAA CCTGGGGCAA AGGGAGATCG AGGGTTTGAT GGACTTCCGG 1980 
GTCTGCCAGG TGACAAAGGT CACAGGGGTG AACGAGGTCC TCAAGGTCCT CCAGGTCCTC 2040 
CTGGTGATGA TGGAATGAGG GGAGAAGATG GAGAAATTGG ACCAAGAGGT CTTCCAGGTG 2100 
AAGCTGGCCC ACGAGGTTTG CTGGGTCCAA GGGGAACTCC AGGAGCTCCA GGGCAGCCTG 2160 
GTATGGCAGG TGTAGATGGC CCCCCAGGAC CAAAAGGGAA CATGGGTCCC CAAGGGGAGC 2220 
CTGGGCCTCC AGGTCAACAA GGGAATCCAG GACCTCAGGG TCTTCCTGGT CCACAAGGTC 2280 
CAATTGGTCC TCCTGGTG AA AAAGGACCAC A AGGAAA ACC AGGACTTGCT GG ACTTCCTG 2340 
GTGCTGATGG GCCTCCTGGT CATCCTGGGA AAGAAGGCCA GTCTGGAGAA AAGGGGGCTC 2400 
TGGGTCCCCC TGGTCCACAA GGTCCTATTG GATNNCCGGG CCCCCGGGGA GTAAAGGGAG 2460 
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pa r ATfiriYrr r Afi AGGTCTC AAGGGATCTA AAGGTGAAAA GGGTGAAG AT GGTTTTCCAG 2520 

ScGC^ATGG CCCTCCTGGC CCTCCAGGTG AAAGAGGTCCTCAAGGACCT CAGGGTCCAG 2940 
TT^TA^n^^TGC^CCAAAA GGCCCTCCTG GACCACCAGG AAGG ATGGGC TGCCCAGGAC 3000 
Y^r^rrr I ^nr^QGMSA^^nC AAGGCAAG AC CGGCCCTCCT GGGCCAGGGG 3060 
G A^TGGTTGG ACCACAGGGA CCA^CC^GTG AGACTGGTCC AATAGGGGA A CGTGGGTATC 3 120 

^i^^A^^^CT^QGtMCt CAGGG AAAGA TGGACCAGCA GGATTACGTG 3240 

GT^C^GGG^ 1™ 
^rrrrrrCA G GG€CC ACCA, GGTCCAGTTG GCTCACCAGG AGAACGTGGG TCAGCAGGTA 3360 

c^gSggcc^ 

r Ar 7rxA AC 'C ^G^CCTGGAGAAAAAGGIC CCCAAGGQCC TGCAGGGAG A GATGGAGTTC 3480 
A^££kl ?TCG?CT^ GG^C^Cra GTCCTGCCGG CTCCCCTGGG GAAGACGGAG 3540 

nn^r^rr^rrrCAGGT CAGTTGGTGC CCCTGGAATT GCTGGAGGTG 3660 

^T9,9^P^IHS t^^ggtt^CCTGGAGATC CTGGTCCTCC TGGGG AACTT GGCCCTGCAG 4 1 40 
SS"Sa« AAGATGGAGATCCTGGTCAA C^GGTCCTC 4200 

a TrrTYTTrn G^G^ATCCCT GGTCCTGTGG GAG AACAAGG TCTCCCTGGA GCTGCAGGCC 4440 
a^S^A^ CTGGCTTACC TGGTCTCAAA GGTGACCCTG 4500 

OT^mA TSS^ , AAATCTGAGG G AGTAAGAAT TTCATCATGG CCAAAGGAGA 5160 
AArtfiA AATTC CATCAATATG GTGCAAATGA CATTCCTGAA ACTTCTGACT GCXTCTGCTC 5280 

ggcaaaattt ca^ct accac tgtcatcagt cagcagcctg ^T A Jp^I!i^,'^^^^^^ $£oo 

GTTATCACAAAGCACTTCGCTTCCTGGGATCAAA^^^ 

ATCATAAAGA TATAAGTTGG TGTGGCTAAG ATGG AAACAG ^^^S^^^q 
!2l~r~ A An r n TrrrrTTrrr ATTTGAATTT CTTTGGTGCT GTAGAAAACA AAAAAAUAAA 58»u 

JcaS^cgtgt^ 

GATGGCAATA ATATATGTAT TATGAAAATG AAGTTATGAT TTCCGATGAC CCTAA01UA, euo 
ZlL^^-™-^ -r a atv- a t/~ a a ATrrrTTTnT GTGTGTTT 



TTTCTTTGGT TAATG ATGAA ATTCCTTTGT GTGTGTTT 

Seq ID NO: 122 Protein sequence: 
Protein Accession U: NPJK)1 845 



AySkGQKGE PAVVEPGMLV EGPPGPAGPA GIMGPPGLQG PTGPPGDPGD RGPPGRPGLP 480 
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GADGLPGPPG TMLMLPFRYG GDGSKGPTtS AQEAQAQA1L QQARIALRGP PGPMGLTGRP 540 
GPVGGPGSSG AKGESGDPGP QGPRGVQGPP GPTGKPGKRG RPGADGGRGM PGEPGAKGDR 600 
GFDGLPGLPG DKGHRGERGP QGPPGPPGDD GMRGEDGEIG PRGLPGEAGP RGLLGPRGTP 660 
GAPGQPGMAG VDGPPGPKGN MGPQGEPGPP GQQGNPGPQG LPGPQGP1GP PGEKGPQGKP 720 
5 GLAGLPGADG PPGHPGKEGQ SGEKGALGPP GPQGPIGXPG PRGVKG ADGV RGLKGSKGEK 780 
GEDGFPGFICG DMGLKGDRGE VGQ1GPRGXD GPEGPKGRAG PTGDPGPSGQ AGEKGKLGVP 840 
GLPGYPGRQG PKGSTGFPGF PG ANGEKGAR GVAGKPGPRG QRGPTGPRGS RGARGPTGKP 900 
GPKGTSGGDG PPGPPGERGP QGPQGPVGFP GPKGPPGPPG RMGCPGHPGQ RGETGFQGKT 960 
GPPGPGGWG PQGPTGETGP IGERGYPGPP GPPGEQGLPG AAGKEGAKGD PGPQGISGKD 1020 

1 0 GPAGLRGFPG ERGLPGAQGA PGLKGGEGPQ GPPGPVGSPG ERGSAGTAGP IGLRGRPGPQ 1080 
GPPGPAGEKG APGEKGPQGP AGRDGVQGPV GLPGPAGPAG SPGEDGDKGE IGEPGQKGSK. 1140 
GGKGENGPPG PPGLQGPVGA PGIAGGDGEP GPRGQQGMFG QKGDEGARGF PGPPGPIGLQ 1200 
GLPGPPGEKG ENGDVGPWGP PGPPGPRGPQ GPNGADGPQG PPGSVGSVGG VGEKGEPGEA 1260 
GNPGPPGEAG VGGPKGERGE KGEAGPPGAA GPPGAKGPPG DDGPKGNPGP VGFPGDPGPP 1320 

1 5 GELGPAGQDG VGGDKGEDGD PGQPGPPGPS GEAGPPGPPG KRGPPGAAG A EGRQGEKGAK 1380 
GEAGAEGPPG KTGPVGPQGP AGKPGPEGLR GIPGPVGEQG LPGAAGQDGP PGPMGPPGLP 1440 
GUCGDPGSKG EKGHPGUGL IGPPGEQGEK GDRGLPGTQG SPGAKGDGGI PGPAGPLGPP 1500 
GPPGLPGPQG PKGNKGSTGP AGQKGDSGLP GPPGPPGPPG EVIQPLPILS SKKTRRHTEG 1560 
MQADADDNIL DYSDGMEEIF GSLNSUCQDI EHMKFPMGTQ TNPARTCKDL QLSHPDFPDG 1620 

20 EYWIDPNQGC SGDSFKVYCN FTSGGETCIY PDKKSEGVRI SSWPKEKPGS WFSEFKRGKL 1680 

LSYLDVEGNS INMVQMTFLK. LLTASARQNF TYHCHQSAAW YDVSSGSYDK ALRFXGSNDE 1740 
EMSYDNNPFI KTLYDGCTSR KGYEKTVIEI NTPKIDQVPI VDVMISDFGD QNQKFGFEVG 1800 
PVCFLG 

25 SeqID NO: 123 DNA sequence 

Nucleic Acid Accession th NM_01 5886 

Coding sequence: 485-1261 (underlined sequences correspond to start and stop codons) 
t 11 21 31 41 51 

30 i i i i I I 

GAATTCCCCC CCCCCCCCCC TCACTTGGTG TGTCTATATG TCTGGCAGAC ATTATCAGCA 60 
CATTCTCTGT TGTTACCTGT GATTCATTTT TTCTTCACTC TCCAGGTGAA TTTCAATTGC 120 
TGAAAATTTC CCACTGA A AA TATGCAGTAA TATATTTTGT GGTTCAGACA TTTGGGGCAA 1 80 
ATGGTTCACA TTCATTTTAG GGTTAGTGGT CATGCTGTTT ATTTTTCTCT GCTATACAAA 240 
3 5 GTTCCTCTTA GGGGTCTGCC TCATG AC ACT AAAAAATGA A TAGAGATTCT ACTGTAGGTT 300 
ATCTCCTAGG CTTGAGTTCA ACATTTGTTT GGATTTTTGA AGAAAGTCAA ATCAAGCAAT 360 
GCTCCCAAAT GATGTCTTTG TAAATTC ATA CCCTCTGGCC CTA 1111111 TCATAGACCC 420 
TAACTCTACC TTTCTGCTTT AAAGCAAAGT AAACTCGGTG GCCTCTTCTT CTCCACCCCT 480 
CAAAATGATA GCAATCTCTG CCGTCAGCAG TGCACTCCTG TTCTCCCTTC TCTGTGAAGC 540 
40 AAGTACCGTC GTCCTACTCA ATTCCACTGA CTCATCCCCG CCAACCAATA ATTTCACTGA 600 
TATTGAAGCA GCTCTG AAAG CACAATTAG A TTCAGCGG AT ATCCCCAA AG CCAGGCGGAA 660 
GCGCTACATT TCGCAGAATG ACATGATCGC CATTCTTGAT TATCATAATC AAGTTCGGGG 720 
CAAAGTGTTC CCACCGGCAG CAAATATGGA ATATATGGTT TGGG ATGAAA ATCTTGCAAA 780 
ATCGGCAGAG GCTTGGGCGG CTACTTGCAT TTGGGACCAT GGACCTTCTT ACTTACTGAG 840 
45 ATTTTTGGGC CAAAATCTAT CTGTACGCAC TGGAAGATAT CGCTCTATTC TCCAGTTGGT 900 
CAAGCCATGG TATG ATGAAG TG AAAG ATTA TGCTTTTCCA TATCCCCAGG ATTGCAACCC 960 
CAGATGTCCT ATGAGATGTT TTGGTCCCAT GTGCACACAT TATACGCAGA TGGTTTGGGC 1020 
CACTTCCAAT CGGATAGGAT GCGCAATTCA TGCTTGCCAA AACATGAATG TTTGGGGATC 1080 
TGTGTGGCGA CGTGCAGTTT ACTTGGTATG CAACTATGCC CCAAAGGGCA ATTGGATTGG 1140 
50 AGAAGCACCA TATAAAGTAG GGGTACCATG TTCATCTTGT CCTCCAAGTT ATGGGGGATC 1200 
TTGTACTGAC AATCTGTGTT TTCCAGGAGT TACGTCAAAC TACCTGTACT GGTTTAAATA 1260 
AGTTTACCTT TTCCTCCAGG AAATATAATG ATTTCTGGGA ACATGGGCAT GTAT ATATAT 1320 
ATATGGAGAG AGAATTTTGC ACATATTATA CATATTTTGT GCTAATCITGTrTTCXrTCTT 1380 
AGTATTCCTT TGTATAAATT AGTGTTTGTC TAGCATGTTT GTTTAATCCT TTGGGAATTC 



Scq ID NO: 124 Protein sequence: 
protein Accession H: NP_056970. 1 



1 11 21 31 41 51 

60 1 I I 1 I I 

MIAISAVSSA LLFSLLCEAS TVVLLNSTDS SPPTNNFTD1 EAALKAQLDS ADIPKARRKR 60 
YISQNDMIAI LDYHNQVRGK VFPPAANMEY MVWDENLAKS AEAWAATCIW DHGPSYLLRF 120 
LGQNLSVRTG RYRSILQLVK PWYDEVKDYA FPYPQDCNPR CPMRCFGPMC THYTQMVWAT 1 80 
SNRIGCAIHA CQNMNVWGSV WRRAVYLVCN YAPKGNWIGE APYKVG VPCS SCPPSYGGSC 240 

65 TDNLCFPG VT SNYLYWFK 

Seq ID NO: 125 DNA sequence 
Nucleic Acid Accession U: NM_00 1 793 

70 Coding sequence: 54-2543 (underlined sequences correspond to start and stop codons) 

I 11 21 31 41 51 

GCGGAACACC GGCCCG CCGT CGCGGCAGCT GCTTCACCCC TCTCTCTGCA GCCATGGGGC 60 
7 5 TCCCTCGTGG ACCTCTCGCG TCTCTCCTCC TTCTCCAGGT TTGCTGGCTG CAGTGCGCGG 1 20 

CCTCCGAGCC GTGCCGGGCG GTC1TCAGGG AGGCTGAAGT GACCTTGGAG GCGGGAGGCG 1 80 
CGGAGCAGGA GCCCGGCCAG GCGCTGGGGA AAGTATTCAT GGGCTGCCCT GGGCAAGAGC 240 
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CAGCTCTGTT TAGCACTGAT ^^^^^^ c ^^^^^^^^^^^^f^^^^Pr *Z 
AAAGAAGGTC ACTGAAGGAA AGGAATCCAT ^^^^^^^^l^S, 3 * 
GAAGACACAA GAGAGATTGG GTGGTTGCTC CAATATCTGT CCCTGAAAAT GGCAAGGGTC 420 
CCTTCCCCCA GAGACTGAAT CAGCTCAAGT CTAATAAAGA TAGAGACACC AAGATTTTCT 480 
A^StCAC ^GGGCCGGGG GCAGACAGCC CCCCTGAGGG TGTCTTCGCT GTAGAGAAGG 540 

TCTTTGGCCA CGCTGTGTCA G AG AATGGTG CCTCAGTGG A GGACCCCATG A^ATCTCXA 660 

tcaTcgtgac cgaccag aat gaccacaagc ccaagtttac ^^aggacatc TTCCGAGGGA 720 

GTGTCTTAGA GGG AGTCCTA CCAGGTACTT CTGTGATGCA GGTGACAGOC 780 
ATGATGCCAT CTACACCTAC AATGGGGTGG TTGCTTACTC CATCCATAGC CAAGAACCAA 840 
AGGAC^ACGACCTCATG TTCACAATTC ACCGGAGCAC AGGCACCATC AGCGTCATCT 900 
CCAGT^cYgGACCGGGAA AAAGTCCCTG AGTACACACTGACCATCCAG GCCACAGACA 960 
TGGATGGGGA CGGCTCCACC ACCACGGCAG TGGCAGTAGT GGAGATCCTT G ATCCCAATG 1 020 
ACAATGCTCC CATCTTTG AC CCCCAGAAGT ACG AGGCCCA TGTGCCTG AG AATGCAGTGG 1080 
GCCATGAGGTGCAGAGGCTG ACGGTCACTG ATCTGGACGC CCCCAACTCA CCAGCGTGGC U40 
CTGCCACCTA CCT^TCATG GGCGGTGACG ACGGGGACCA TTTTACCATC ACCACCCACC 1200 
CTGAGAG^AC^ 

Arr ap APPrT GTACGTTG A A GTGACCAACG AGGCCCCTTT TGTGCTGAAG CTCCCAACCT 1320 
^S^T^^O^GAXia ATGTGAATGA GGCACCTGTG TTTGTCCCAC 1380 
CCTCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 
Sg^GA CCCTG ACAAG GAG AATCAA A AGATCAGCTA CCGCATCCTG AG AGACCCAG 1500 
r IfififiTfinCT AGCCATGG AC CCAG ACAGTG GGCAGGTCAC AGCTGTGGGC ACCCTCGACC 1560 
OTG AGG ATG A GC AGT^GTG AGGAACAACA TCTATGAAGT CATGGTCTTG GCCATGGACA 1620 
A^GGA^GCCC ?C^CACCACT GGCACGGGAA CCCrrCTGCT AACACTGATT GATGTCAACG 1680 
ACCATOGCCC AGTCCCTG AG CCCCGTCAGA TCACCATCTG CAACCAAAGC CCTGTCCGCC 1740 
ACGTGCTGAA CATCACGGAC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAGC 1800 
TcSaGATGA CTCAG ACATC TACTGG ACGG CAGAGGTCAA CGAGGAAGGT <3 ACACAGTGG 1 860 
TCTTCTCOn^ j AAGAAG^TC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 1920 
AC^TGG^ 

Tr^rrrri r firTCTOCTC CTGCTGGTGC TGCTTTTGTT GGTGAGAAAG AAGCGGAAGA 2100 

?Sa^ 

AAGAGGGGGG TGGCGAAGAG GACCAGGACT A^AC ATCAC CCAGCTC 
AGGCCAGGCC GGAGGTGGTT CTCCGCAATG ACGTGGCACC aaccatcatc c^acacu-a U.m 
^A^aS^GCC AACCCAGATG AAATCGGC^CT^AAT] T GAGAACCTGA 2340 
AGGCGGCTAA CACAG ACCCC ACAGCCCCGC CCTACGACAC CCTCTTGGTG TTCGACTATG 2400 
^^Scnn^rrn aPGCC GCGTCCCTGA GCTCCCTCAC CTOrTCCGCC TTCGACX^G 2460 
ACS™ CGATTATCTG AACGAGTGGG GCAGCCGCTI '^^^^^^O 
ACGGTGGCGG GG AGGACG AC TAGGCGGCCT GCCTGCAGGG GJGGGGAC^A ^rc^CTCk\&LQ 
CACAGAGCAT CTCCAAGGGG TCTCAGTTCC CCCTTCAGCT GAGGACTTCG GAGCTTGTCA 2640 
GG AAGTGGCC GTAGCAACTT GGCGG AGACA GGCTATGAGT CTGACGTTAG AGTGGTTGCT 2700 
5^TgCCT TTCAGGATGG AGG AATGTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 2760 

SgggS 

TGCTCAACCC TGTGTCCTGG GCCTGGGCCT GCTGTG ACTG AOC^CAGTC i G ^^^7 J880 
CTGGAATGGA ACCTTCTTAG CCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCT ATCT 2940 
TCAA AACGTT AG A^GAAAGTT CTTCAAAAGT GCAGCCCAGA GCTGCTGGGC CCACTGGCCG 3000 
TCCTG^ATTT CTGGTTTCCA GACCCCAATG CCTCCCATTC GGATGGATCT ^GCGT^ 3060 
ATACTGAGTG TGCCTAGGTT GCCCCTTATT TTTTATTTTC CCTGTTGCGT TGCTATAGAT 
GAAGGGTGAG G ACAATCGTG TATATGTACT AGAACTTTTT TATTAAAGAA A 

Seq ID NO: 126 Protein sequence: 
Protein Accession #: NP_001 784 

I 11 21 31 41 51 

' plfipiL s LLLLOVCWLO CAASEPCRAV FREAEVTLEA GGAEQEPGQA LGKVFMGCPG 60 
SXSS^SS NPLFCIFPSKR ILRRHKRDWV VAPISVPENG 120 
KWP^W IFYSITGPGA DSPPEGVFAV BKETGWLL^^mAK 180 

YELFGHAVSE NGASVEDPMN ISI1VTDQND HKPKFTQDTF RGSVLEGVLF ^TSVMQVTAT 240 
tJEDDAIYTYN GVVAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK. VPEYTLTIQA 300 
SSSKt TAVAVVEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLTJ ^DAPNSP 360 
AWRATYLIMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVUCL 420 
P^T^VH VEDVNEAPVF VPPSKVVEVQ EGtPTGEPVC VYTAEDPDKE NQKISYRILR 480 

«PVPEP RQIT1CNQSP VRHVLN1TDK DLSPHTSPFQ AQ^JPP^^^^^^^.^^-^gQ 
TVVLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP ™ 
GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEED O^ITO^HR 720 
GLEARPEVVL RNDVAPTIIP TPMYRPRPAN PDEIGNFUE NLKAANTDPT APPYDTLLVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 

Seq ID NO: 127 DNA sequence 

Nucleic Acid Accession*: NMJJ03256.I 

Coding sequence: 60-734 (underlined sequences correspond to start and stop codons) 
I 11 21 31 41 51 

CCTGCTGGGG CCGTCC AGTC CCCCAGACCT CACAGGCTCA GTCGCGGATC TGCAGTGTCA 60 
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TGCCTGGGAG CCCTCGGCCC GCGCCAAGCT GGGTGCTGTT GCTGCGGCTG CTGGCGTTGC 1 20 
TGCGGCCCCC GGGGCTGGGT GAGGCATGCA GCTGCGCCCC GGCGCACCCT CAGCAGCACA 180 
TCTGCCACTC GGCACTTGTG ATTCGGGCCA AAATCTCCAG TGAGAAGGTA GTTCCGGCCA 240 
GTGCAGACCC TGCTGACACT GAAAAAATGC TCCGGTATGA AATCA AACAG ATAAAGATGT 300 
TCAAAGGGTT TGAGAAAGTC AAGG ATGTTC AGTATATCTA TACGCCTTTT GACTCTTCCC 360 
TCTGTGGTGT GAAACTAGAA GCCAACAGCC AGAAGCAGTA TCTCTTGACT GGTCAGGTCC 420 
TCAGTG ATGG AAAAGTCTTC ATCCATCTGT GCAACTACAT CGAGCCCTGG GAGGACCTGT 480 
CCTTGGTGCA G AGGGAAAGT CTG AATCATC ACTACCATCT GAACTGTGGC TGCCAAATCA 540 
CCACCTGCTA CACAGTACCC TGTACCATCT CGGCCCCTAA CGAGTGCCTC TGGACAGACT 600 
GGCTGTTGGA ACGAAAGCTC TATGGTTACC AGGCTCAGCA TTATGTCTGT ATGAAGCATG 660 
TTGACGGCAC CTG C AG CTG G TACCGGGGCC ACCTGCCTCT CAGGAAGGAG TTTGTTGACA 720 
TCGTTCAGCC CTAGTAGGGA CCAGTGACCA TCACATCCCT TCAAGAGTCC TGAAGATCAA 780 
GCCAGTTCTC CTTCCCTGCA GAGCTTTGGC CATTACCACC TGACCTCTTG CTGCCAGCTA 840 
ATAAGAAGTG CCAAGTGGAC AGTCTGGCCA CTGTCAAGGC AGGGAAGGGG CCATGACTTT 900 
TCTGCCCTGC CCTCAGCCTG TTGCCCTGCC TCCCAAACCC CATTAGTCTA GCCTTGTAGC 960 
TGTTACTGCA AGTGTTTCTT CTGGCTTAGT CTGTTTTCTA AAGCCAGGAC TATTCCCTTT 1020 
CCTCCCCAGG AATATGTGTT TTCCTTTGTC TTAATCQATC TGGTAGGGGA G AAATGGCG A 1080 
ATGTCATACA CATGAGATGG TATATCCTTG CGATGTACAG AATCAGAAGG TGGTTTGACA 1 140 
GCATCATAAA CAGGCTGACT GGCAGGAATG AAAAAAAAAA AAAAAAAAA 

Scq ID NO: 128 Protein sequence: 
Protein Accession #: NP_003247. 1 

1 11 21 31 41 51 

MPGSPRPAPS WVLLLRLLAL LRPPG LGEAC SCAPAHPQQH ICHSALV1RA KISSEKVVPA 60 
SADPADTEKM LRYEIKQDCM FKGFEKVKDV QYIYTPFDSS LCGVK LEANS QKQYLLTGQV 1 20 
LSDGKVFIHL CNYIEPWEDL SLVQRESLNH HYHLNCGCQ1 TTCYTVPCTI SAPNECLWTD 1 80 
WLLERKLYGY QAQHYVCMKH VDGTCSWYRG HLPLRKEFVD IVQP 

Seq ID NO: 129 DNA sequence 

Nucleic Acid Accession #: NM 007207.2 

Coding sequence: 143-1 591 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

CCACGCGTCC GCAATGAAGC CGAGTGAATG GGGGCTGAAT GTGCGAGTCC ATAGCTGAAG 60 
AGO AGCGCCA GATGGTGGAG GAATACACTT ATTTATGAAA CTGTCTTGAG TTCTTCTTGA 1 20 
ATTGCCAGTT TTCAGCCTCC T CATG CCTCC GTCTCCTTTA GACGACAGGG TAGTAGTGGC 1 80 
ACTATCTAGG CCCGTCCGAC CTCAGGATCT CAACCTTTGT TTAGACTCTA GTTACCTTGG 240 
CTCTGCCAAC CCAGGCAGTA ACAGCCACCC TCCTGTCATC GCCACCACCG TTGTGTCCCT 300 
CAAGGCTGCG AATCTGACGT ATATGCCCTC ATCCAGCGGC TCTGCCCGCT CGCTGAATTG 360 
TGGATGCAGC AGTGCCAGCT GCTGCACTGT GGCAACCTAC GACAAGGACA ATCAGGCCCA 420 
AACCCAAGCC ATTGCCGCTG GCACCACCAC CACTGCCATC GGAACCTCTA CCACCTGCCC 480 
TGCTAACCAG ATGGTCAACA ATAATGAGAA TACAGGCTCT CTAAGTCCAT CAAGTGGGGT 540 
GGGCAGCCCT GTGTCAGGGA CCCCCAAGCA GCTAGCCAGC ATCAAAATAA TCTACCCCAA 600 
TGACTTGGCA AAGAAGATG A CCAAATGCAG CAAGAGTCAC CTGCCGAGTC AGGGCCCTGT 660 
CATCATTGAC TGCAGGCCCT TCATGGAGTA CAACAAGAGT CACATCCAAG GAGCTGTCCA 720 
CATTAACTGT GCCGATAAGA TCAGCCGGCG GAGACTGCAG CAGGGCAAGA TCACTGTCCT 780 
AGACTTGATT TCCTGTAGGG AAGGCAAGGA CTCTTTCAAG AGGATCTTTT CCAAAGAAAT 840 
TATAGTTTAT GATG AGAATA CCAATG AACC AAGCCGAGTG ATGCCCTCCC AGCCACTTCA 900 
CATAGTCCTC GAGTCCCTGA AGAGAGAAGG CAAAGAACCT CTGGTGTTGA AAGGTGGACT 960 
TAGTAGTTTT AAGCAGAACC ATGAAAACCT CTGTGACAAC TCCCTCCAGC TCCAAGAGTG 1020 
CCGGGAGGTG GGGGGCGGCG CATCCGCGGC CTCGAGCTTG CTACCTCAGC CCATCCCCAC 1080 
CACCCCTGAC ATCGAGAACG CTGAGCTCAC CCCCATCTTG CCCTTCCTQT TCCTTGGCAA 1 140 
TGAGCAGGAT GCTCAGGACC TGGACACCAT GCAGCGGCTG AACATCGGCT ACGTCATCAA 1200 
CGTCACCACT CATCTTCCCC TCTACCACTA TGAGAAAGGC CTGTTCAACT A CAAGC GGCT 1260 
GCCAGCCACT GACAGCAACA AGCAGAACCT GCGGCAGTAC TTTGAAGAGG CTTTTGAGTT 1320 
CATTGAGGAA GCTCACCAGT GTGGGAAGGG GCTTCTCATC CACTGCCAGG CTGGGGTGTC 1380 
CCGCTCCGCC ACCATCGTCA TCGCTTACTT GATGAAGCAC ACTCGGATGA CCATGACTGA 1440 
TGCTTATAAA TTTGTCAAAG GCAAACGACC AATTATCTCC CCAAACCTTA ACTTCATGGG 1500 
GCAOTTGCTA GAGTTCGAGG AAGACCTAAA CAACGGTGTG ACACCGAGAA TCCTTACACC 1560 
AAAGCTGATG GGCGTGGAGA CGGTTGTG TG A CAATGGTCT GGATGGAAAG GATTGCTG CT 1620 
CTCCATTAGG AGACAATGAG GAAGGAGGAT GGATTCTGGT 111 11 lfCTl TCI 1 111! H 1680 
TTGTAGTTGG GAGTAAAGTT TGTGAATGGA AACAAACTTG GTTAAACACT TTATTTTTAA 1740 
CAAGTGTAAG AAGACTATAC TTTTGATGCC ATTGAGATTC ACCTTCCACA AACTGGCCAA 1800 
ATTAAGGAGG TTAAAGAAGT AATTTTTTTT AAGCCCAACC ATTAAAAATT TAATACAACT 1860 
TGGTTTCTCC CCCTTTTTCC TTTAAAGCTA NTTTGTAAAA GTTTATGAG 

Scq ID NO: 130 Protein sequence: 
Protein Accession 8: NPJTO9 1 3S. 1 

1 11 21 31 41 51 

MPPSPLDDRV VVALSRPVRP QDLNLCLDSS YLGSANPGSN SHPPV1ATTV VSLKAANLTY 60 
MPSSSGSARS LNCGCSSASC CTVATYDKDN QAQTQ AIAAG TTTTAIGTST TCPANQMVNN 1 20 
NENTGSLSPS SGVGSPVSGT PKQLASIKII YPNDLAKKMT KCSKSHLPSQ GPVIIDCRPF 180 

406 



WO 02/059377 
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MEYNKSHIQG AVHINCADKI SRRRLQQGKJ TVLDUSCRE GKDSFKRIFS ^VYD^T 240 
Srvmpso PLHIVLESLK REGKEPLVLK GGLSSFKQNH ENLCDNSLQL qecrevggc A 300 
S^SL^OP IPTTPDIENA ELTPILPFLF LGNEQDAQDL DTMQRLNIGY VINVTTHLPL 360 
^^SSmmS^^^^^ FEFIEEAHQC GKGLL1HCQA GVSRSAT1VI 420 

w 

SeqlDNO:131 DNA sequence 
Nucleic Acid Accession #: NM_005409.3 
1 0 Coding sequence: 94-378 (underlined sequences correspond to start and stop codons) 

1 U 21 31 41 51 

' ' r | T GTTCAGCATT TCTACTCCTT CCAAGAAG AG CAGCAAAGCT GAAGTAGCAG 60 

1 S Sa^gcaccSc^ 

1 "> ^rncfaA TtaTTGTGTGC TA^ CAAGGCTTCC CCATGTTCAA AAGAGGACGC 180 

TO^GCA ^SSgG GGTAAAAGCA GTGAAAGTGG CAGATATTG A GAAAGCCTCC 240 
ISS^^C^^K^T^tOACMJi ATAGAAGTGA TTATTACCCT G AAAGAAAAT 300 
aY^c! AT ^GCCTA^ TOD^AATCG AAGCAAGCAA GGCTTATAAT CAAAAAAGTT 360 

CTGCCCAAAG GAGTCCAGCA ATTAAATGG A mCTAGG^ 
9 ^ rrrrr Arr atp r.fiAGTTTACA AAGTGCTTTC ACGTTCTTAC TTGTTGTATT ATACAl 1L.A I «u 
GC^CT^S GCtXg^AAC CTTCTAGATT TGATGCTTAC AACTATTCTG TTGTCAarAT 780 
n A^Sm mn^CTAG AAGTTATCTG TCTGTATTGA TCTTTATGCT ATATTACTAT 840 

A (^TTTT GrTTTGATCC GTTTGTATAA ATG ATAGCAA TATCTTGGAC ACATTTGAAA 1260 
1 * ™Ja ?^TCT^ C^GAAAAA TGTTG AAAAA TAAGCAAATG TATACCTAGC 1320 

35 AATcl^S^ 

CTaTItACTG TAAAATTTAG OTATACTCAAG ^f/^^A 
GTCATTTTTT TCTCTA ATA A ACTACCACAA CCTTTCTTTT TTAAAAAAAA AAA 

40 s~i T n 1 17 PirHein sequence: 

Protein Accession #: NPJJ05400. 1 

I 11 21 31 41 51 
45 MSVKGMAIAL AVILCATVVQ GFPMFKRGRC LCIGPGVKAV KVADIEKAS1 MYPSNNCDKI 60 
EV11TLKENK GQRCLNPKSK QARUIKKVE RKNF 

Seq ID NO: 133 DNA sequence 

Nucleic Acid Accession it: NMJ) 12342 . . 

50 Coding sequence: 373-1 155 (underlined sequences correspond to start and stop coaonsj 

] n 21 31 41 51 

CTGGCGCGGG CGGGAGCTGC GGCGGATACC CTTGCGTGCT ^^9A^S^^^^^^tt^q jjO 

££r^nS£ SgctScGC CGAGCCGGGG CTCCGGAAGC CGGCGGGGGC GCCGCGGCCG 360 

rrfiTGCTGCT CACCAAAGGT GAAATTCGAT GCTACTGTGA TGCTGCCCAC 480 
T^r^^rrCA CT^G^GTT^T^T G^TGTAi^TCT GAGCTCAGCG CCTGCTTCTC TAGACTTCTT 540 
TGTGTAGCCA CTTCGTTATAT GTO 1 A TGGACTCTCT TGCAAGC ACG 600 

AGGGGTG AGG CCTCAGU ALA GAGTTGTGGT TCCGGGCAGC GGTCATTGCC 840 

70 s«E£SSss=: 

GTGCCGGTCA ^rcG^nSn^S^C^^^^^ "TOT AC AGTGG GCACGGGAAG 1140 

CTCAGCAACG ATAAGATCCT ^^^^^ACTTA CTGAACAGCT TGAAGGCCTT 1200 
CTGGAATTCG JJ^f^S^rA^ rr aT*TTTATC TGAAG AC^^CTCATTTAAT CATCTTTGAG 1260 
TTG AGTTCTG CTGGACA^^A GC ACTTTA^ TCTGAAG G AT TATTTGCACA 1320 

75 AG ACAAAATG ACCTCTGCAA ACAG^OT^TAIJ^^ ATAAAAAGCA AAGAGAAGAC 1380 
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TAAACAAA AA TGTGCCCTAT GTAAGCTTCT ACATCTTGAT TTATTGTAAA GATTTAAAAG 1500 
AAATATATAT ATTTTGTCTG A 

Scq ID NO: 134 Protein sequence: 
Protein Accession V: NP_036474. 1 

I 11 21 31 41 51 

MDRHSSYIFI WLQLELCAMA VLLTKGEIRC YCDAAHCVAT GYMCKSELSA CFSRLLDPQN 60 
SNSPLTHGCL DSLASTTDLC QAKQARNHSG 1TIPTLECCH EDMCNYRGLH DVLSPPRGEA 120 
SGQGNRYQHD GSRNUTKVQ ELTSSKELWF RAAVIAVPIA GGLILVLUM LALRMLRSEN 1 80 
KRLQDQRQQM LSRLHYSFHG HHSKKGQVAK LDLECMVPVS GHENCCLTCD KMRQADLSND 240 
K1LSLVHWGM YSGHGKLEFV 

Seq ID NO: 1 35 DNA sequence 

Nucleic Acid Accession U: NMJJOI 627. 1 

Coding sequence: 64-1 8 1 5 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

CGGGACGACG CCCCCTCCTG CGGCGTGGAC TCCGTCAGTG GCCCACCAAG AAGGAGGAGG 60 
AATATGGAAT CCAAGGGGGC CAGTTCCTGC CGTCTGCTCT TCTGCCTCTT GATCTCCGCC 120 
ACCGTCTTCA GGCCAGGCCT TGG ATGGTAT ACTGTAAATT CAGCATATGG AGATACCATT 1 80 
ATCATACCTT GCCGACTTGA CGTACCTCAG AATCTCATGT TTGGCAAATG G AAATATGA A 240 
AAGCCCGATG GCTCCCCAGT ATTTATTGCC TTCAGATCCT CTACAAAGAA AAGTGTGCAG 300 
TACGACGATG TACCAGAATA CAAAGACAGA TTGAACCTCT CAGAAAACTA CACTTTGTCT 360 
ATCAGTAATG CAAGG ATCAG TGATGAAAAG AGATTTGTGT GCATGCTAGT AACTGAGGAC 420 
AACGTGTTTG AGGCACCTAC AATAGTCAAG GTGTTCAAGC AACCATCTAA ACCTGAAATT 480 
GTAAGCAAAG CACTGTTTCT CGAAACAGAG CAGCTAAAAA AGTTGGGTGA CTGCATTTCA 540 
GAAGACAGTT ATCCAGATGG CAATATCACA TGGTACAGGA ATGGAAAAGT GCTACATCCC 600 
CTTGAAGGAG CGGTGGTCAT AA1 1 1 1 1 AAA AAGGAAATGG ACCCAGTGAC TCAGCTCTAT 660 
ACCATGACTT CCACCCTGGA GTACAAGACA ACCAAGGCTG ACATACAAAT GCCATTCACC 720 
TGCTCGGTGA CATATTATGG ACCATCTGGC CAGAAAACAA TTCATTCTGA ACAGGCAGTA 780 
TTTGATATTT ACTATCCTAC AGAGCAGGTG ACAATACAAG TGCTGCCACC AAAAAATGCC 840 
ATCAAAGAAG GGGATAACAT CACTCTTAAA TGCTTAGGGA ATGGCAACCC TCCCCCAGAG 900 
GAATTTTTGT TTTACTTACC AGGACAGCCC GAAGGAATAA GAAGCTCAAA TACTTACACA 960 
CTGATGGATG TGAGGCGCAA TGCAACAGGA GACTACAAGT GTTCCCTGAT AGACAAAAAA 1020 
AGCATGATTG CTTCAACAGC CATCACAGTT CACTATTTGG ATTTGTCCTT AAACCCAAGT 1080 
GGAGAAGTGA CTAGACAG AT TGGTGATGCC CTACCCGTGT CATGCACAAT ATCTGCTAGC 1 140 
AGGAATGCAA CTGTGGTATG GATGAAAGAT AAC ATCAG GC TTCGATCTAG CCOGTCATTT 1200 
TCTAGTCTTC ATTATCAGGA TGCTGGAAAC TATGTCTGCG AAACTGCTCT GCAGGAGGTT 1260 
GAAGGACTAA AGAAAAGAGA GTCATTGACT CTCATTGTAG AAGGCAAACC TCAAATAAAA 1320 
ATGACAAAGA AAACTGATCC CAGTGGACTA TCTAAAACAA TAATCTGCCA TGTGGAAGGT 1380 
TTTCCAAAGC CAGCCATTCA GTGGACAATT ACTGGCAGTG GAAGCGTCAT AAACCAAACA 1440 
GAGGAATCTC CTTATATTAA TGGCAGGTAT TATAGTAAAA TTATCATTTC CCCTGAAGAG 1500 
AATGTTACAT TAACTTGCAC AGCAGAAAAC CAACTGGAGA GAACAGTAAA CTCCTTGAAT 1560 
GTCTCTGCTA TAAGTATTCC AGAACACGAT GAGGCAGACG AGATAAGTGA TGAAAACAGA 1620 
GAAAAGGTGA ATGACCAGGC AAAACTAATT GTGGGAATCG TTGTTGGTCT CCTCCTTGCT 1680 
GCCCTTGTTG CTGGTGTCGT CTACTGGCTG TACATGAAGA AGTCAAAGAC TGCATCAAAA 1740 
CATGTAAACA AGGACCTCGG TAATATGGAA GAAAACAAAA AGTTAGAAGA AAACAATCAC 1800 
AAAACTGAAG CCTAAGAGAG AAACTGTCCT AGTTGTCCAG AGATAAAAAT CATATAGACC I860 
AATTGAAGCA TG AACGTGGA TTGTATTTAA GACATAAACA AAGACATTGA CAGCAATTCA 1920 
■- TGGTTCAAGT ATTAAGCAGT TCATTCTACC AAGCTGTCAC AGGTTTTCAG AGAATTATCT 1980 
CAAGTAAAAC AAATGAAATT TAATTACAAA CAATAAGAAC AAGTTTTGGC AGCCATG ATA 2040 
ATAGGTCATA TGTTGTGTTT GGTTCAATTT TTTTTCCGTA AATGTCTGCA CTGAGGATTT 2100 
CTTTTTGGTT TGCCTTTTAT GTAA ATTTTT TACGTAGCTA TTTTTATACA CTGTAAGCTT 2 1 60 
TGTTCTGGGA GTTGCTGTTA ATCTGATGTA TAATGTAATG TTTTTATTTC AATTGTTTAT 2220 
ATGGATAATC TGAGCAGGTA CATTTCTGAT TCTGATTGCT ATCAGCAATG CCCCAAACTT 2280 
TCTCATAAGC ACCTAAAACC CAAAGGTGGC AGCTTGTGAA GATTGGGGAC ACTCATATTG 2340 
CCCTAATTAA AAACTGTGAT TTTTATCACA AGGG AGGGGA GGCCGAGAGT CAG ACTGATA 2400 
GACACCATAG GAGCCGACTC TTTGATATGC CACCAGCGAA CTCTCAGAAA TAAATCACAG 2460 
ATGCATATAG ACACACATAC ATA ATGGTAC TCCCAAACTG ACAATTTTAC CTATTCTGAA 2520 
AAAGACATAA AACAGAATT 

Seq ID NO: 136 Protein sequence: 
Protein Accession #: NP_00 1 6 18. 1 

1 11 21 31 41 51 

MESKGASSCR LLFCLLIS AT VFRPG LGWYT VNSAYGDTII IPCRLDVPQN LMFGKWKYEK 60 
PDGSPVFIAF RSSTKKSVQY DDVPEYKDRi. NLSENYTLSI SNARISDEKR FVCMLVTEDN 1 20 
VFEAPTIVKV FKQPSKPEIV SKALFLETEQ LKKLG DCISE DSYPDGNITW YRNGKVLHPL 1 80 
EGAWIIFKK EMDPVTQLYT MTSTLEYKTT KADI QMP FTC SVTYYGPSGQ KTIHSEQAVF 240 
DIYYPTEQVT IQVLPPKNAI KEGDNITLKC LGNGNPPPEE FLFYLPGQPE GIRSSNTYTL 300 
MDVRRNATGD YKCSLIDKKS MIASTAITVH YLDLSLNPSG EVTRQIGDAL PVSCTISASR 360 
NATVVWMKDN IRLRSSPSFS SLH YQDAGNY VCETALQEVE GLKKRESLTL IVEGKPQDCM 420 
TKKTDPSGLS KT1ICHVEGF PKPAIQWTIT GSGSVINQTE ESPYINGRYY SKII1SPEEN 480 
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VTLTCTAENQ LERTVNSLNV SAISIPEHDE ADEISDENRE KVNDQAKLIV GIVVGLLLAA 
LVAGWYWLY MKKSKTASKH VNKDLGNMEE NKKLEENNHK TEA 



Scq ID NO: 137 DNA sequence 

Nucleic Acid Accession*: XM.030559 

Coding sequence: 1-119 (underlined sequences correspond to start and stop codons) 
1 II 21 31 41 51 

ATGAACCGCA GCCACCGGCA CGGGGCGGGC AGCGGCTGCC TGGGCACTAT IGGAGGTGAAG 60 
AGCAAGTTTG GAGCTGAATT TCGTCGGTTT TCGCTGGAAA GATCAAAACC TGGMAATTT 120 
GAGGAGTTTT ATGGATTACT ACAACATGTT CATAAGATCC ^^^T^T^^^JP-T^??^^^ f? 0 
GGCTATGCAG ACATCCATGG AG ACTTACTA CCTATAAATA AT^ATGATAA T]^\^rArAP \m 
GCTGTTTCAA CGGCCAATCC ACTGCTTAGG ATATTTATAC AAAAGAAGGA AGAAGCAGAC 300 
TACAGTGCCT TTGGTACAGA CACGCTAATA AAGAAGAAGA ATGTTTTAAC CAACGTATTG 360 
CGTCCTGACA ACCATAGAAA AAAGCCACAT ATAGTCATTA GTATGCCCCA AGACTTTAGA 420 

TACAAATACG GCACGGAG AA ACCCCTAGGA TTCTACATCC GGGATGGCTC CAGTGTCAGG 540 
GTAACACCAC ATGGCTTAG A AAAGGTTCCA GGG ATCTTTA TATCCAGGCT ^IP^^ppp ™ 
GGTCTGGCTC AAAGTACAGG ACTATTAGCT GTTAATGATG AAGTTTTAGA AGTTAATGGC 660 
ATAGAAGTTT CAGGGAAGAG CCTTGATCAA GTA ACAG ACA TG ATG ATTGC AAATAGCCGT 720 
AACCTCATCA TAACAGTGAGAOCGGCAAAC CAGAGGAATA ATGTTGTGAG GAACAGTCGG 780 

^SSagtocg^ 

GAACCAAGCT TTGAGCCAG A GGATGAAGAC AGCGAAGAAG ATGACATTAT CATTGAAGAC 900 
AATCGAGTC? CACAGCAGAT TCCAAAAGCT GTTCCTAATA ^^AG^jC(^ GCji^TCATTA 960 
ACACAGATAG AGCTAAGCTT TGAGTCTGGA CAGAATGGCT TTATTCCCTC TAATG AAGTG 1020 
AGCTTAGCAG C^ATAGCAAG CAGCTCAAAC ACGGAATTTG AAACACATGC TCCAGATCAA 1080 
AAACTCTTAG AAGAAGATGG AACAATCATA ACATTATGA 

Scq ID NO: 138 Protein sequence: 
Protein Accession #: XP_030559 

1 11 21 31 41 51 

MNRSHRHG AG SGCLGTMEVK SKFGAEFRRF SLERSKPGKF EEFYGLLQHV HKIPNVDVLV 60 

RPDNHRKKPH IVISMPODFR PVSSIID VDI LPETHRRVRL YKYGTEKPLG FYIRDGSSVR 1 80 
toSvP GIF ISRLVPG GLAQSTGLLA VNDEVLEVNG IEVSGKSLDQ VTDJ^IANSR 240 
NUITVRPAN QRNNVVRNSRTSGSSGQSTD NSLLGYPQQ1 EPSFEPEDEDSEEDDUIED 300 
NG WOJPKA VPNTESLESL TQIELSFESG QNGFIPSNEV SLAAIASSSN TEFETHAPDQ 360 
KLLEEDGT11TL 
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It is understood that the examples described above in no way serve to limit the 
true scope of this invention, but rather are presented for illustrative purposes. All 
5 publications, sequences of accession numbers, and patent applications cited in this 

specification are herein incorporated by reference as if each individual publication or patent 
application were specifically and individually indicated to be incorporated by reference. 



410 



WO 02/059377 PCT/US02/02242 

WHAT T£ CLAIMED IS: 

1 1 . A method of detecting a breast cancer-associated transcript in a cell 

2 from a patient, the method comprising contacting a biological sample from the patient with a 

3 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1-25. 

1 2. The method of claim 1, wherein the biological sample comprises 

2 isolated nucleic acids. 

1 3. The method of claim 2, wherein the nucleic acids are mRNA. 

1 4. The method of claim 2, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 5. The method of claim 1, wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1-25. 

1 6. The method of claim 1, wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 7. The method of claim 1, wherein the patient is undergoing a therapeutic 

2 regimen to treat breast cancer. 

1 8. The method of claim 1, wherein the patient is suspected of having 

2 breast cancer. 

1 9. An isolated nucleic acid molecule consisting of a polynucleotide 

2 sequence as shown in Tables 1 -25. 

1 1 0. The nucleic acid molecule of claim 9, which is labeled. 

1 1 1 . An expression vector comprising the nucleic acid of claim 9. 

1 12. Ahost cell comprising the expression vector of claim 11. 
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1 1 3 . An isolated polypeptide which is encoded by a nucleic acid molecule 

2 having polynucleotide sequence as shown in Tables 1-25. 

1 14. An antibody that specifically binds a polypeptide of claim 13. 

1 15. The antibody of claim 14, further conjugated to an effector component. 

1 16. The antibody of claim 15, wherein the effector component is a 

2 fluorescent label. 

1 17. The antibody of claim 15, wherein the effector component is a 

2 radioisotope or a cytotoxic chemical. 

1 18. The antibody of claim 15, which is an antibody fragment. 

1 19. The antibody of claim 1 5 , which is a humanized antibody 

1 20. A method of detecting a breast cancer cell in a biological sample from 

2 a patient, the method comprising contacting the biological sample with an antibody of claim 

3 14. 

1 21. The method of claim 20, wherein the antibody is further conjugated to 

2 an effector component. 

1 22. The method of claim 21, wherein the effector component is a 

2 fluorescent label. _ . . 

1 23. A method for identifying a compound that modulates a breast cancer- 

2 associated polypeptide, the method comprising the steps of: 

3 (i) contacting the compound with a breast cancer-associated polypeptide, the 

4 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 

5 80% identical to a sequence as shown in Tables 1-25; and 

6 (ii) determining the functional effect of the compound upon the polypeptide. 
1 24. A drug screening assay comprising the steps of 
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2 (i) administering a test compound to a mammal having breast cancer or a cell 

3 isolated therefrom; 

4 (ii) comparing the level of gene expression of a polynucleotide that selectively 

5 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-25 in a 

6 treated cell or mammal with the level of gene expression of the polynucleotide in a control 

7 cell or mammal, wherein a test compound that modulates the level of expression of the 

8 polynucleotide is a candidate for the treatment of breast cancer. 
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